Аннотация

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their relatively low computational cost. In this paper, we illustrate a basic problem with gradient-based methods applied to POMDPs, where the sequential nature of the decision problem is at issue, and propose a new stochastic local search method as an alternative.

Описание

CiteULike

Линки и ресурсы

тэги

сообщество

  • @darius
  • @dblp
@darius- тэги данного пользователя выделены