Article,

Gradient descent methods in learning classifier systems: Improving XCS performance in multistep problems

M. Butz, D. Goldberg, and P. Lanzi.
IEEE Transactions on Evolutionary Computation, (2005)

Abstract

The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal difference learning technique in XCS was based on deterministic updates. However, since a prediction is actually generated by a set of rules in XCS and Learning Classifier Systems in general, gradient-based update methods are applicable. The extension of XCS to gradient-based update methods results in a classifier system that is more robust and more parameter independent, solving large and difficult maze problems reliably. Additionally, the extension to gradient methods highlights the relation of XCS to other function approximation methods in reinforcement learning.

BibTeX key: Butz:2005
entry type: article
year: 2005
journal: IEEE Transactions on Evolutionary Computation
pages: 452-473
volume: 9
timestamp: 2009.10.06
owner: butz
Document: http://www.coboslab.psychologie.uni-wuerzburg.de/fileadmin/ext00209/user_upload/Publications/2005/ButzGoldbergLanzi2005GradientDescentMethods.pdf

BibSonomy

Gradient descent methods in learning classifier systems: Improving XCS performance in multistep problems

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on