Gaussian Two-Armed Bandit and Optimization of Batch Data Processing

A. V. Kolnogorov

doi:10.1134/S0032946018010076

Gaussian Two-Armed Bandit and Optimization of Batch Data Processing

Autores: Kolnogorov A.V.¹
Afiliações:
1. Department of Applied Mathematics and Information Science
Edição: Volume 54, Nº 1 (2018)
Páginas: 84-100
Seção: Large Systems
URL: https://journal-vniispk.ru/0032-9460/article/view/166491
DOI: https://doi.org/10.1134/S0032946018010076
ID: 166491

Citar

Texto integral

Acesso aberto
Acesso é fechado

Acesso está concedido
Acesso é fechado

Somente assinantes

Resumo
Sobre autores
Bibliografia
Arquivos suplementares
Estatísticas

Resumo

We consider the minimax setting for the two-armed bandit problem with normally distributed incomes having a priori unknown mathematical expectations and variances. This setting naturally arises in optimization of batch data processing where two alternative processing methods are available with different a priori unknown efficiencies. During the control process, it is required to determine the most efficient method and ensure its predominant application. We use the main theorem of game theory to search for minimax strategy and minimax risk as Bayesian ones corresponding to the worst-case prior distribution. To find them, a recursive integro-difference equation is obtained. We show that batch data processing almost does not increase the minimax risk if the number of batches is large enough.

Sobre autores

A. Kolnogorov

Department of Applied Mathematics and Information Science

Autor responsável pela correspondência
Email: kolnogorov53@mail.ru
Rússia, Moscow

Arquivos suplementares

Ação

1. JATS XML

Baixar

Nome de usuário
Senha
Lembrar usuário

Esqueceu a senha?	Cadastro

Nome de usuário
Senha
Lembrar usuário

Esqueceu a senha?	Cadastro