TY - GEN
T1 - Distributed learning under imperfect sensing in cognitive radio networks
AU - Liu, Keqin
AU - Zhao, Qing
AU - Krishnamachari, Bhaskar
PY - 2010
Y1 - 2010
N2 - We consider a cognitive radio network, where M distributed secondary users search for spectrum opportunities among N independent channels without information exchange. The occupancy of each channel by the primary network is modeled as a Bernoulli process with unknown mean which represents the unknown traffic load of the primary network. In each slot, a secondary transmitter chooses one channel to sense and subsequently transmit if the channel is sensed as idle. Sensing is considered to be imperfect, i.e., an idle channel can be sensed as busy and vice versa. Users transmitting on the same channel collide and none of them can transmit successfully. The objective is to maximize the system throughput under the collision constraint imposed by the primary network while ensuring synchronized channel selection between each secondary transmitter and its receiver. The performance of a channel selection policy is measured by the system regret, defined as the expected total performance loss with respect to the optimal performance under the ideal scenario where all channel means are known to all users and collisions among users are eliminated throughput perfect scheduling. We show that the optimal system regret has the same logarithmic order as the centralized counterpart with perfect sensing. An order-optimal decentralized policy is constructed to achieve the logarithmic order of the system regret while ensuring fairness among all users.
AB - We consider a cognitive radio network, where M distributed secondary users search for spectrum opportunities among N independent channels without information exchange. The occupancy of each channel by the primary network is modeled as a Bernoulli process with unknown mean which represents the unknown traffic load of the primary network. In each slot, a secondary transmitter chooses one channel to sense and subsequently transmit if the channel is sensed as idle. Sensing is considered to be imperfect, i.e., an idle channel can be sensed as busy and vice versa. Users transmitting on the same channel collide and none of them can transmit successfully. The objective is to maximize the system throughput under the collision constraint imposed by the primary network while ensuring synchronized channel selection between each secondary transmitter and its receiver. The performance of a channel selection policy is measured by the system regret, defined as the expected total performance loss with respect to the optimal performance under the ideal scenario where all channel means are known to all users and collisions among users are eliminated throughput perfect scheduling. We show that the optimal system regret has the same logarithmic order as the centralized counterpart with perfect sensing. An order-optimal decentralized policy is constructed to achieve the logarithmic order of the system regret while ensuring fairness among all users.
KW - Cognitive radio
KW - decentralized multi-armed bandit
KW - distributed learning
KW - imperfect observation
KW - regret
UR - http://www.scopus.com/inward/record.url?scp=79957994125&partnerID=8YFLogxK
U2 - 10.1109/ACSSC.2010.5757646
DO - 10.1109/ACSSC.2010.5757646
M3 - Conference Proceeding
AN - SCOPUS:79957994125
SN - 9781424497218
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 671
EP - 675
BT - Conference Record of the 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
T2 - 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
Y2 - 7 November 2010 through 10 November 2010
ER -