TY - GEN
T1 - Solving hanabi
T2 - 29th AAAI Conference on Artificial Intelligence, AAAI 2015
AU - Osawa, Hirotaka
N1 - Publisher Copyright:
Copyright © 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2015
Y1 - 2015
N2 - A unique behavior of humans is modifying one's unobservable behavior based on the reaction of others for cooperation. We used a card game called Hanabi as an evaluation task of imitating human reflective intelligence with artificial intelligence. Hanabi is a cooperative card game with incomplete information. A player cooperates with an opponent in building several card sets constructed with the same color and ordered numbers. However, like a blind man's bluff, each player sees the cards of all other players except his/her own. Also, communication between players is restricted to information about the same numbers and colors, and the player is required to read his/his opponent's intention with the opponent's hand, estimate his/her cards with incomplete information, and play one of them for building a set. We compared human play with several simulated strategies. The results indicate that the strategy with feedbacks from simulated opponent's viewpoints achieves more score than other strategies.
AB - A unique behavior of humans is modifying one's unobservable behavior based on the reaction of others for cooperation. We used a card game called Hanabi as an evaluation task of imitating human reflective intelligence with artificial intelligence. Hanabi is a cooperative card game with incomplete information. A player cooperates with an opponent in building several card sets constructed with the same color and ordered numbers. However, like a blind man's bluff, each player sees the cards of all other players except his/her own. Also, communication between players is restricted to information about the same numbers and colors, and the player is required to read his/his opponent's intention with the opponent's hand, estimate his/her cards with incomplete information, and play one of them for building a set. We compared human play with several simulated strategies. The results indicate that the strategy with feedbacks from simulated opponent's viewpoints achieves more score than other strategies.
UR - http://www.scopus.com/inward/record.url?scp=84964573697&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84964573697&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84964573697
T3 - AAAI Workshop - Technical Report
SP - 37
EP - 43
BT - Computer Poker and Imperfect Information - Papers Presented at the 29th AAAI Conference on Artificial Intelligence, Technical Report
PB - AI Access Foundation
Y2 - 25 January 2015 through 30 January 2015
ER -