Abstract
We investigate an online regression problem in which the learner makes predictions sequentially while only the limited information on features is observable. In this paper, we propose a general setting for the limitation of the available information, where the observed information is determined by a function chosen from a given set of observation functions. Our problem setting is a generalization of the online sparse linear regression problem, which has been actively studied. For our general problem, we present an algorithm by combining multi-armed bandit algorithms and online learning methods. This algorithm admits a sublinear regret bound when the number of observation functions is constant. We also show that the dependency on the number of observation functions is inevitable unless additional assumptions are adopted. To mitigate this inefficiency, we focus on a special case of practical importance, in which the observed information is expressed through linear combinations of the original features. We propose efficient algorithms for this special case. Finally, we also demonstrate the efficiency of the proposed algorithms by simulation studies using both artificial and real data.
Original language | English |
---|---|
Pages | 1599-1607 |
Number of pages | 9 |
Publication status | Published - 2018 |
Event | 21st International Conference on Artificial Intelligence and Statistics, AISTATS 2018 - Playa Blanca, Lanzarote, Canary Islands, Spain Duration: 2018 Apr 9 → 2018 Apr 11 |
Conference
Conference | 21st International Conference on Artificial Intelligence and Statistics, AISTATS 2018 |
---|---|
Country/Territory | Spain |
City | Playa Blanca, Lanzarote, Canary Islands |
Period | 18/4/9 → 18/4/11 |
ASJC Scopus subject areas
- Statistics and Probability
- Artificial Intelligence