Adaptive Two-stage Learning Algorithm for Repeated Games

Wataru Fujita, Koichi Moriyama, Ken-ichi Fukui, Masayuki Numao

2016

Abstract

In our society, people engage in a variety of interactions. To analyze such interactions, we consider these interactions as a game and people as agents equipped with reinforcement learning algorithms. Reinforcement learning algorithms are widely studied with a goal of identifying strategies of gaining large payoffs in games; however, existing algorithms learn slowly because they require a large number of interactions. In this work, we constructed an algorithm that both learns quickly and maximizes payoffs in various repeated games. Our proposed algorithm combines two different algorithms that are used in the early and later stages of our algorithm. We conducted experiments in which our proposed agents played ten kinds of games in self-play and with other agents. Results showed that our proposed algorithm learned more quickly than existing algorithms and gained sufficiently large payoffs in nine games.

Download


Paper Citation


in Harvard Style

Fujita W., Moriyama K., Fukui K. and Numao M. (2016). Adaptive Two-stage Learning Algorithm for Repeated Games . In Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-758-172-4, pages 47-55. DOI: 10.5220/0005711000470055

in Bibtex Style

@conference{icaart16,
author={Wataru Fujita and Koichi Moriyama and Ken-ichi Fukui and Masayuki Numao},
title={Adaptive Two-stage Learning Algorithm for Repeated Games},
booktitle={Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2016},
pages={47-55},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005711000470055},
isbn={978-989-758-172-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - Adaptive Two-stage Learning Algorithm for Repeated Games
SN - 978-989-758-172-4
AU - Fujita W.
AU - Moriyama K.
AU - Fukui K.
AU - Numao M.
PY - 2016
SP - 47
EP - 55
DO - 10.5220/0005711000470055