ON TEMPORAL DIFFERENCE ALGORITHMS FOR CONTINUOUS SYSTEMS

Alexandre Donzé

2005

Abstract

This article proposes a general, intuitive and rigorous framework for designing temporal differences algorithms to solve optimal control problems in continuous time and space. Within this framework, we derive a version of the classical TD(λ) algorithm as well as a new TD algorithm which is similar, but designed to be more accurate and to converge as fast as TD(λ) for the best values of λ without the burden of finding these values.

Download


Paper Citation


in Harvard Style

Donzé A. (2005). ON TEMPORAL DIFFERENCE ALGORITHMS FOR CONTINUOUS SYSTEMS . In Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 972-8865-29-5, pages 55-62. DOI: 10.5220/0001183700550062

in Bibtex Style

@conference{icinco05,
author={Alexandre Donzé},
title={ON TEMPORAL DIFFERENCE ALGORITHMS FOR CONTINUOUS SYSTEMS},
booktitle={Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2005},
pages={55-62},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001183700550062},
isbn={972-8865-29-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - ON TEMPORAL DIFFERENCE ALGORITHMS FOR CONTINUOUS SYSTEMS
SN - 972-8865-29-5
AU - Donzé A.
PY - 2005
SP - 55
EP - 62
DO - 10.5220/0001183700550062