Wyniki wyszukiwania

Sortuj według:

Ogranicz wyniki do:

On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms

100%

Drabik E.

Applicationes Mathematicae

1995-1996

tom 23

nr 4

449-473

Two kinds of strategies for a multiarmed Markov bandit problem with controlled arms are considered: a strategy with forcing and a strategy with randomization. The choice of arm and control function in both cases is based on the current value of the average cost per unit time functional. Some simulation results are also presented.

On adaptive control of Markov chains using nonparametric estimation

63%

Drabik E., Stettner Ł.

Applicationes Mathematicae

2000

tom 27

nr 2

143-152

Two adaptive procedures for controlled Markov chains which are based on a nonparametric window estimation are shown.

Ograniczanie wyników

2 Applicationes Mathematicae

2 Drabik E.

1 Stettner Ł.

1 2000

1 1996

Wyniki wyszukiwania

On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms

On adaptive control of Markov chains using nonparametric estimation