Adaptive Two-stage Learning Algorithm for Repeated Games
Wataru Fujita, Koichi Moriyama, Ken-ichi Fukui, Masayuki Numao
2016
Abstract
In our society, people engage in a variety of interactions. To analyze such interactions, we consider these interactions as a game and people as agents equipped with reinforcement learning algorithms. Reinforcement learning algorithms are widely studied with a goal of identifying strategies of gaining large payoffs in games; however, existing algorithms learn slowly because they require a large number of interactions. In this work, we constructed an algorithm that both learns quickly and maximizes payoffs in various repeated games. Our proposed algorithm combines two different algorithms that are used in the early and later stages of our algorithm. We conducted experiments in which our proposed agents played ten kinds of games in self-play and with other agents. Results showed that our proposed algorithm learned more quickly than existing algorithms and gained sufficiently large payoffs in nine games.
DownloadPaper Citation
in Harvard Style
Fujita W., Moriyama K., Fukui K. and Numao M. (2016). Adaptive Two-stage Learning Algorithm for Repeated Games . In Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-758-172-4, pages 47-55. DOI: 10.5220/0005711000470055
in Bibtex Style
@conference{icaart16,
author={Wataru Fujita and Koichi Moriyama and Ken-ichi Fukui and Masayuki Numao},
title={Adaptive Two-stage Learning Algorithm for Repeated Games},
booktitle={Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2016},
pages={47-55},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005711000470055},
isbn={978-989-758-172-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - Adaptive Two-stage Learning Algorithm for Repeated Games
SN - 978-989-758-172-4
AU - Fujita W.
AU - Moriyama K.
AU - Fukui K.
AU - Numao M.
PY - 2016
SP - 47
EP - 55
DO - 10.5220/0005711000470055