TY - GEN
T1 - Cell selection using distributed Q-learning in heterogeneous networks
AU - Kudo, Toshihito
AU - Ohtsuki, Tomoaki
PY - 2013/12/1
Y1 - 2013/12/1
N2 - Cell selection with cell range expansion (CRE) that is a technique to expand a pico cell range virtually by adding a bias value to the pico received power, instead of increasing transmit power of the pico base station (PBS), can make coverage, cell-edge throughput, and overall network throughput improved. Many studies about CRE have used a common bias value among all user equipments (UEs), while the optimal bias values that minimize the number of UE outages vary from one UE to another. The optimal bias value that minimizes the number of UE outages depends on several factors such as the dividing ratio of radio resources between macro base stations (MBSs) and PBSs, it is given only by the trial and error method. In this paper, we propose a scheme to select a cell by using Q-learning algorithm where each UE learns which cell to select to minimize the number of UE outages from its past experience independently. Simulation results show that, compared to the practical common bias value setting, the proposed scheme reduces the number of UE outages and improves network throughput in the most cases. Moreover, instead of the degradation of the performances, it also solves the storage problem of our previous work.
AB - Cell selection with cell range expansion (CRE) that is a technique to expand a pico cell range virtually by adding a bias value to the pico received power, instead of increasing transmit power of the pico base station (PBS), can make coverage, cell-edge throughput, and overall network throughput improved. Many studies about CRE have used a common bias value among all user equipments (UEs), while the optimal bias values that minimize the number of UE outages vary from one UE to another. The optimal bias value that minimizes the number of UE outages depends on several factors such as the dividing ratio of radio resources between macro base stations (MBSs) and PBSs, it is given only by the trial and error method. In this paper, we propose a scheme to select a cell by using Q-learning algorithm where each UE learns which cell to select to minimize the number of UE outages from its past experience independently. Simulation results show that, compared to the practical common bias value setting, the proposed scheme reduces the number of UE outages and improves network throughput in the most cases. Moreover, instead of the degradation of the performances, it also solves the storage problem of our previous work.
UR - http://www.scopus.com/inward/record.url?scp=84893301735&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893301735&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2013.6694368
DO - 10.1109/APSIPA.2013.6694368
M3 - Conference contribution
AN - SCOPUS:84893301735
SN - 9789869000604
T3 - 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013
BT - 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013
T2 - 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013
Y2 - 29 October 2013 through 1 November 2013
ER -