Advantage Mapping: Learning Operation Mapping for User-Preferred Manipulation by Extracting Scenes with Advantage Function

Rintaro Hasegawa, Yosuke Fukuchi, Kohei Okuoka, Michita Imai

研究成果: Conference contribution

抄録

When a user manipulates a system, a user input through an interface, or an operation, is converted to the user's intended action according to the mapping that links operations and actions, which we call "operation mapping". Although many operation mappings are created by designers assuming how a typical user would operate the system, the optimal operation mapping may vary from user to user. The designer cannot prepare in advance all possible operation mappings. One approach to solve this problem involves autonomous learning of an operation mapping during the operation. However, existing methods require manual preparation of scenes for learning mappings. We propose advantage mapping, which enables the efficient learning of operation mappings. Working from the idea that scenes in which the user's desired action is predictable are useful for learning operation mappings, advantage mapping extracts scenes according to the magnitude of entropy in the output of the action value function acquired from reinforcement learning. In our experiment, the user's ideal operation mapping was more accurately obtained from the scenes selected by advantage mapping than from learning through actual play.

本文言語English
ホスト出版物のタイトルHAI 2022 - Proceedings of the 10th Conference on Human-Agent Interaction
出版社Association for Computing Machinery, Inc
ページ95-103
ページ数9
ISBN(電子版)9781450393232
DOI
出版ステータスPublished - 2022 12月 5
イベント10th Conference on Human-Agent Interaction, HAI 2022 - Christchurch, New Zealand
継続期間: 2022 12月 52022 12月 8

出版物シリーズ

名前HAI 2022 - Proceedings of the 10th Conference on Human-Agent Interaction

Conference

Conference10th Conference on Human-Agent Interaction, HAI 2022
国/地域New Zealand
CityChristchurch
Period22/12/522/12/8

ASJC Scopus subject areas

  • 人工知能
  • 人間とコンピュータの相互作用
  • ソフトウェア

フィンガープリント

「Advantage Mapping: Learning Operation Mapping for User-Preferred Manipulation by Extracting Scenes with Advantage Function」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル