TY - GEN
T1 - Learning and sharing in a changing world
T2 - 2011 Information Theory and Applications Workshop, ITA 2011
AU - Liu, Haoyang
AU - Liu, Keqin
AU - Zhao, Qing
PY - 2011
Y1 - 2011
N2 - We consider decentralized restless multi-armed bandit problems with unknown dynamics and multiple players. The reward state of each arm transits according to an unknown Markovian rule when it is played and evolves according to an arbitrary unknown random process when it is passive. Players activating the same arm at the same time collide and suffer from reward loss. The objective is to maximize the long-term reward by designing a decentralized arm selection policy to address unknown reward models and collisions among players. A decentralized policy is constructed that achieves a regret with logarithmic order. The result finds applications in communication networks, financial investment, and industrial engineering.
AB - We consider decentralized restless multi-armed bandit problems with unknown dynamics and multiple players. The reward state of each arm transits according to an unknown Markovian rule when it is played and evolves according to an arbitrary unknown random process when it is passive. Players activating the same arm at the same time collide and suffer from reward loss. The objective is to maximize the long-term reward by designing a decentralized arm selection policy to address unknown reward models and collisions among players. A decentralized policy is constructed that achieves a regret with logarithmic order. The result finds applications in communication networks, financial investment, and industrial engineering.
UR - http://www.scopus.com/inward/record.url?scp=79955764815&partnerID=8YFLogxK
U2 - 10.1109/ITA.2011.5743588
DO - 10.1109/ITA.2011.5743588
M3 - Conference Proceeding
AN - SCOPUS:79955764815
SN - 9781457703614
T3 - 2011 Information Theory and Applications Workshop, ITA 2011 - Conference Proceedings
SP - 240
EP - 246
BT - 2011 Information Theory and Applications Workshop, ITA 2011 - Conference Proceedings
Y2 - 6 February 2011 through 11 February 2011
ER -