BibSonomy
::
author
::
tag
user
group
author
concept
BibTeX key
search:all
A blue social bookmark and publication sharing system.
tags
·
relations
·
groups
·
popular
help
·
blog
·
about
username:
password:
myFriends
myRelations
mySearch
myPDF
myDuplicates
myBibTeX
login
·
register
bookmarks
publications
(12)
previous | 1
2
|
next
Hierarchical Policy Gradient Algorithms
M.
Ghavamzadeh
and Sridhar
Mahadevan
Proceedings of the Twentieth Conference on Machine Learning (ICML-2003)
(2003)
to
daanbib
by
idsia
and
2 other people
on 2008-03-11 14:52:34
|
BibTeX
Hierarchical Policy Gradient Algorithms
M.
Ghavamzadeh
and Sridhar
Mahadevan
Proceedings of the Twentieth Conference on Machine Learning (ICML-2003)
(2003)
to
daanbib
by
schaul
and
2 other people
on 2008-02-26 12:05:08
|
BibTeX
Bayesian Policy Gradient Algorithms.
Mohammad
Ghavamzadeh
and Yaakov
Engel
NIPS
457-464 (2006)
to
dblp
by
dblp
on 2007-10-25 00:00:00
|
URL
|
BibTeX
Bayesian actor-critic algorithms.
Mohammad
Ghavamzadeh
and Yaakov
Engel
ICML
297-304 (2007)
to
dblp
by
dblp
on 2007-10-22 00:00:00
|
URL
|
BibTeX
The Workshop Program at the Nineteenth National Conference on Artificial Intelligence.
Ion
Muslea
and Virginia
Dignum
and Daniel D.
Corkill
and Catholijn M.
Jonker
and Frank
Dignum
and Silvia
Coradeschi
and Alessandro
Saffiotti
and Dan
Fu
and Jeff
Orkin
and William
Cheetham
and Kai
Goebel
and Piero P.
Bonissone
and Leen-Kiat
Soh
and Randolph M.
Jones
and Robert E.
Wray III
and Matthias
Scheutz
and Daniela Pucci de
Farias
and Shie
Mannor
and Georgios
Theocharous
and Doina
Precup
and Bamshad
Mobasher
and Sarabjot S.
Anand
and Bettina
Berendt
and Andreas
Hotho
and Hans W.
Guesgen
and Michael T.
Rosenstein
and Mohammad
Ghavamzadeh
AI Magazine
26
103-108 (2005)
to
dblp
by
dblp
on 2007-09-05 00:00:00
|
URL
|
BibTeX
Hierarchical multi-agent reinforcement learning.
Mohammad
Ghavamzadeh
and Sridhar
Mahadevan
and Rajbala
Makar
Autonomous Agents and Multi-Agent Systems
13
197-229 (2006)
to
dblp
by
dblp
on 2007-02-07 00:00:00
|
URL
|
BibTeX
Learning to Communicate and Act Using Hierarchical Reinforcement Learning.
Mohammad
Ghavamzadeh
and Sridhar
Mahadevan
AAMAS
1114-1121 (2004)
to
dblp
by
dblp
on 2004-10-13 00:00:00
|
URL
|
BibTeX
Hierarchical Policy Gradient Algorithms.
Mohammad
Ghavamzadeh
and Sridhar
Mahadevan
ICML
226-233 (2003)
to
dblp
by
dblp
and
2 other people
on 2003-09-22 00:00:00
|
URL
|
BibTeX
Hierarchical multi-agent reinforcement learning.
Rajbala
Makar
and Sridhar
Mahadevan
and Mohammad
Ghavamzadeh
Agents
246-253 (2001)
to
dblp
by
dblp
on 2002-12-09 00:00:00
|
URL
|
BibTeX
Continuous-Time Hierarchical Reinforcement Learning.
Mohammad
Ghavamzadeh
and Sridhar
Mahadevan
ICML
186-193 (2001)
to
dblp
by
dblp
on 2002-11-27 00:00:00
|
URL
|
BibTeX
previous | 1
2
|
next
Showing 10 items per page. Show
10
,
25
,
50
,
100
items per page.
tags
daanbib
dblp