TRANSFER LEARNING FOR MULTI-AGENT COORDINATION

Peter Vrancx, Yann-Michaël De Hauwere, Ann Nowé

2011

Abstract

Transfer learning leverages an agent’s experience in a source task in order to improve its performance in a related target task. Recently, this technique has received attention in reinforcement learning settings. Training a reinforcement learning agent on a suitable source task allows the agent to reuse this experience to significantly improve performance on more complex target problems. Currently, reinforcement learning transfer approaches focus almost exclusively on speeding up learning in single agent systems. In this paper we investigate the potential of applying transfer learning to the problem of agent coordination in multi-agent systems. The idea underlying our approach is that agents can determine how to deal with the presence of other agents in a relatively simple training setting. By then generalizing this knowledge, the agents can use this experience to speed up learning in more complex multi-agent learning tasks.

Download


Paper Citation


in Harvard Style

Vrancx P., De Hauwere Y. and Nowé A. (2011). TRANSFER LEARNING FOR MULTI-AGENT COORDINATION . In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-8425-41-6, pages 263-272. DOI: 10.5220/0003185602630272

in Bibtex Style

@conference{icaart11,
author={Peter Vrancx and Yann-Michaël De Hauwere and Ann Nowé},
title={TRANSFER LEARNING FOR MULTI-AGENT COORDINATION},
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2011},
pages={263-272},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003185602630272},
isbn={978-989-8425-41-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - TRANSFER LEARNING FOR MULTI-AGENT COORDINATION
SN - 978-989-8425-41-6
AU - Vrancx P.
AU - De Hauwere Y.
AU - Nowé A.
PY - 2011
SP - 263
EP - 272
DO - 10.5220/0003185602630272