achakraborty > reinforcement-learning | BibSonomy

bookmarks (hide)44
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

3AI and Deep Learning in 2017 – A Year in Review – WildML
http://www.wildml.com/2017/12/ai-and-deep-learning-in-2017-a-year-in-review/
6 years ago by @achakraborty
show all tags
2017
article
artificial-intelligence
blog
deep-learning
reinforcement-learning
review
2017articleartificial-intelligenceblogdeep-learningreinforcement-learningreview
(0)
copydelete
- community post
- history of this post
1CMSC389F: Reinforcement Learning
This is CMSC389F, the University of Maryland's theoretical introduction to the art of reinforcement learning. An introductory course taught by Kevin Chen and Zack Khan, CMSC389F covers topics including markov decision processes, monte carlo methods, policy gradient methods, exploration, and application towards real environments in broad strokes .
6 years ago by @achakraborty
show all tags
course
lectures
reinforcement-learning
slides
umd
courselecturesreinforcement-learningslidesumd
(0)
copydelete
- community post
- history of this post
1Generative Temporal Models with Spatial Memory for Partially Observed Environments - Nurture.AI
In Model-based Reinforcement Learning, Generative And Temporal Models Of Environments Can Be Leveraged To Boost Agent Performance, Either By Tuning The Agent's Representations During Training Or Via Use As Part Of An Explicit Planning Mechanism. However, Their Application In Practice Has Been Limited To Simplistic Environments, Due To The Difficulty Of Training Such Models In Larger, Potentially Partially-observed And 3d Environments. In This Work We Introduce A Novel Action-conditioned Generative Model Of Such Challenging Environments. The Model Features A Non-parametric Spatial Memory System In Which We Store Learned, Disentangled Representations Of The Environment. Low-dimensional Spatial Updates Are Computed Using A State-space Model That Makes Use Of Knowledge On The Prior Dynamics Of The Moving Agent, And High-dimensional Visual Observations Are Modelled With A Variational Auto-encoder. The Result Is A Scalable Architecture Capable Of Performing Coherent Predictions Over Hundreds Of Time Steps Across A Range Of Partially Observed 2d And 3d Environments.
6 years ago by @achakraborty
show all tags
nurture.ai
paper
reinforcement-learning
nurture.aipaperreinforcement-learning
(0)
copydelete
- community post
- history of this post
2CS294 Deep Reinforcement Learning (Berkeley) - Fall 2017 - YouTube
https://www.youtube.com/playlist?list=PLkFD6_40KJIznC9CDbVTjAF2oyt8_VAe3
6 years ago by @achakraborty
show all tags
berkeley
course
deep-learning
playlist
reinforcement-learning
videos
youtube
berkeleycoursedeep-learningplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1Article: Using OpenAI with ROS | The Construct
Anything you need to know about OpenAI with ROS. In this post we describe how to apply the OpenAI Gym to the control of a drone that runs with ROS.
6 years ago by @achakraborty
show all tags
article
artificial-intelligence
blog
reinforcement-learning
robotics
ros
articleartificial-intelligenceblogreinforcement-learningroboticsros
(0)
copydelete
- community post
- history of this post
1DeepMind papers at ICLR 2018 | DeepMind
https://deepmind.com/blog/deepmind-papers-iclr-2018/
6 years ago by @achakraborty
show all tags
2018
collection
conference
deep-learning
deepmind
iclr
paper
reinforcement-learning
robotics
2018collectionconferencedeep-learningdeepmindiclrpaperreinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Hallucinogenic Deep Reinforcement Learning using Python and Keras
A step-by-step guide to reproducing the World Models paper https://arxiv.org/pdf/1803.10122.pdf
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
keras
python
reinforcement-learning
articleblogdeep-learningkeraspythonreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)
https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2
7 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tensorflow
tutorial
A2Carticleblogreinforcement-learningtensorflowtutorial
(0)
copydelete
- community post
- history of this post
2Intuitive RL: Intro to Advantage-Actor-Critic (A2C)
https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752
7 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tutorial
A2Carticleblogreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1The Mathematics of 2048: Optimal Play with Markov Decision Processes
http://jdlm.info/articles/2018/03/18/markov-decision-process-2048.html
7 years ago by @achakraborty
show all tags
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
copydelete
- community post
- history of this post
1After Millions of Trials, These Simulated Humans Learned to Do Perfect Backflips and Cartwheels
https://gizmodo.com/after-millions-of-trials-these-simulated-humans-learne-1825156125
7 years ago by @achakraborty
show all tags
article
news
reinforcement-learning
simulation
articlenewsreinforcement-learningsimulation
(0)
copydelete
- community post
- history of this post
1Towards a Virtual Stuntman – The Berkeley Artificial Intelligence Research Blog
http://bair.berkeley.edu/blog/2018/04/10/virtual-stuntman/
7 years ago by @achakraborty
show all tags
animation
article
berkeley
blog
deep-learning
reinforcement-learning
research
robotics
animationarticleberkeleyblogdeep-learningreinforcement-learningresearchrobotics
(0)
copydelete
- community post
- history of this post
1Arxiv Insights - YouTube
Through my PhD on Deep Learning based robotics, I read a lot of papers on Machine Learning, Reinforcement Learning and AI in general. But papers can be a bit...
7 years ago by @achakraborty
show all tags
arxiv
deep-learning
lectures
playlist
reinforcement-learning
robotics
videos
youtube
arxivdeep-learninglecturesplaylistreinforcement-learningroboticsvideosyoutube
(0)
copydelete
- community post
- history of this post
1idsia | BibSonomy
https://www.bibsonomy.org/user/idsia
7 years ago by @achakraborty
show all tags
bibsonomy
neural-networks
profile
reinforcement-learning
robotics
bibsonomyneural-networksprofilereinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Shangtong | CV
https://shangtongzhang.github.io/
7 years ago by @achakraborty
show all tags
deep-learning
profile
reinforcement-learning
research
resume
deep-learningprofilereinforcement-learningresearchresume
(0)
copydelete
- community post
- history of this post
1Awesome-rl
Reinforcement learning resources curated
7 years ago by @achakraborty
show all tags
collection
reinforcement-learning
resources
collectionreinforcement-learningresources
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning Introduction
Introduction to Reinforcement Learning, including a definition, analysis of the motivations and limitations of AI, and an overview of the technology along with its applications.
7 years ago by @achakraborty
show all tags
reinforcement-learning
resources
reinforcement-learningresources
(0)
copydelete
- community post
- history of this post
1Two Minute Papers - YouTube
Awesome research for everyone. Two new science videos every week. You'll love it! Our links: Web → https://cg.tuwien.ac.at/~zsolnai/
7 years ago by @achakraborty
show all tags
collection
deep-learning
graphics
machine-learning
paper
reinforcement-learning
research
tutorial
videos
youtube
collectiondeep-learninggraphicsmachine-learningpaperreinforcement-learningresearchtutorialvideosyoutube
(0)
copydelete
- community post
- history of this post
2Asynchronous methods for deep reinforcement learning | the morning paper
Asynchronous methods for deep reinforcement learning Mnih et al. ICML 2016 You know something interesting is going on when you see a scalability plot that looks like this: That’s a superlinear speedup as we increase the number of threads, giving a 24x performance improvement with 16 threads as compared to a single thread. The result…
7 years ago by @achakraborty
show all tags
deep-learning
reinforcement-learning
deep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
4How to build your own AlphaZero AI using Python and Keras
The codebase contains a replica of the AlphaZero methodology, built in Python and Keras. Gain a deeper understanding of how AlphaZero works and adapt the code to plug in new games.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
tutorial
articleblogdeep-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1The Fido Project
An open-source machine learning library targeted towards embedded electronics and robotics.
7 years ago by @achakraborty
show all tags
c++
deep-learning
embedded
library
machine-learning
reinforcement-learning
robotics
c++deep-learningembeddedlibrarymachine-learningreinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Learning From Scratch by Thinking Fast and Slow with Deep Learning and Tree Search · David Barber
https://davidbarber.github.io/blog/2017/11/07/Learning-From-Scratch-by-Thinking-Fast-and-Slow-with-Deep-Learning-and-Tree-Search/
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
articleblogdeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1CS 294 Deep Reinforcement Learning, Fall 2017
http://rll.berkeley.edu/deeprlcourse/
7 years ago by @achakraborty
show all tags
berkeley
course
deep-learning
reinforcement-learning
berkeleycoursedeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
3AlphaGo Zero: Learning from scratch | DeepMind
We introduce AlphaGo Zero, the latest evolution of AlphaGo, the first computer program to defeat a world champion at the ancient Chinese game of Go. Zero is even more powerful and is arguably the strongest Go player in history. Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting from completely random play. In doing so, it quickly surpassed human level of play and defeated the previously published champion-defeating version of AlphaGo by 100 games to 0.
7 years ago by @achakraborty
show all tags
article
deepmind
google
reinforcement-learning
articledeepmindgooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning Nptel - YouTube
https://www.youtube.com/playlist?list=PLuWx2S0SyaDctJtVKHhmjYACmHZ3nX9ew
7 years ago by @achakraborty
show all tags
course
iit
lectures
nptel
playlist
reinforcement-learning
youtube
courseiitlecturesnptelplaylistreinforcement-learningyoutube
(0)
copydelete
- community post
- history of this post
2Neural Information Processing Systems - Videos
https://www.facebook.com/pg/nipsfoundation/videos/
7 years ago by @achakraborty
show all tags
deep-learning
facebook
lectures
machine-learning
reinforcement-learning
videos
deep-learningfacebooklecturesmachine-learningreinforcement-learningvideos
(0)
copydelete
- community post
- history of this post
1MIT 6.S094: Deep Learning for Self-Driving Cars - YouTube - YouTube
These are lectures for course 6.S094: Deep Learning for Self-Driving Cars taught in Winter 2017. Course website: http://cars.mit.edu Contact: deepcars@mit.ed...
7 years ago by @achakraborty
show all tags
deep-learning
lectures
mit
motion-planning
playlist
reinforcement-learning
videos
youtube
deep-learninglecturesmitmotion-planningplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1Flood Sung
http://www.floodsung.com/
7 years ago by @achakraborty
show all tags
blog
deep-learning
machine-learning
profile
reinforcement-learning
research
blogdeep-learningmachine-learningprofilereinforcement-learningresearch
(0)
copydelete
- community post
- history of this post
1RL Course by David Silver - YouTube
https://www.youtube.com/playlist?list=PLzuuYNsE1EZAXYR4FJ75jcJseBmo4KQ9-
7 years ago by @achakraborty
show all tags
course
lectures
playlist
reinforcement-learning
videos
youtube
courselecturesplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1lukasw | BibSonomy
https://www.bibsonomy.org/user/lukasw
7 years ago by @achakraborty
show all tags
bibsonomy
deep-learning
profile
reinforcement-learning
research
bibsonomydeep-learningprofilereinforcement-learningresearch
(0)
copydelete
- community post
- history of this post
1Why AlphaGo Zero is a Quantum Leap Forward in Deep Learning
https://medium.com/intuitionmachine/the-strange-loop-in-alphago-zeros-self-play-6e3274fcdd9f
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
google
reinforcement-learning
articleblogdeep-learninggooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning: The quirks – Towards Data Science
I have been working on Reinforcement Learning for the past few months and all I can say about it: It is different. A writeup of the common quirks and frustrations of Reinforcement Learning I have…
7 years ago by @achakraborty
show all tags
article
blog
nvidia
reinforcement-learning
udacity
articleblognvidiareinforcement-learningudacity
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning | DeepMind
Humans excel at solving a wide variety of challenging problems, from low-level motor control through to high-level cognitive tasks. Our goal at DeepMind is to create artificial agents that can achieve a similar level of performance and generality. Like a human, our agents learn for themselves to achieve successful strategies that lead to the greatest long-term rewards.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
deepmind
google
reinforcement-learning
articleblogdeep-learningdeepmindgooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1[1611.01578] Neural Architecture Search with Reinforcement Learning
https://arxiv.org/abs/1611.01578
7 years ago by @achakraborty
show all tags
2016
neural-networks
reinforcement-learning
search
2016neural-networksreinforcement-learningsearch
(0)
copydelete
- community post
- history of this post
2Roboschool
https://blog.openai.com/roboschool/
7 years ago by @achakraborty
show all tags
artificial-intelligence
examples
openai
reinforcement-learning
robotics
simulation
artificial-intelligenceexamplesopenaireinforcement-learningroboticssimulation
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning: Pong from Pixels
Musings of a Computer Scientist.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
machine-learning
reinforcement-learning
tutorial
articleblogdeep-learningmachine-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1OpenAI’s Goofy Sumo-Wrestling Bots Are Smarter Than They Look
It could be a virtual blood sport in some absurdist techno-future.
7 years ago by @achakraborty
show all tags
article
artificial-intelligence
news
openai
reinforcement-learning
articleartificial-intelligencenewsopenaireinforcement-learning
(0)
copydelete
- community post
- history of this post
1CMU 10703: Deep RL and Control
https://katefvision.github.io/
7 years ago by @achakraborty
show all tags
cmu
control-theory
controller
course
lectures
machine-learning
reinforcement-learning
slides
cmucontrol-theorycontrollercourselecturesmachine-learningreinforcement-learningslides
(0)
copydelete
- community post
- history of this post
2DeepMind Open Source – Datasets | DeepMind
https://deepmind.com/research/open-source/open-source-datasets/
7 years ago by @achakraborty
show all tags
dataset
deep-learning
google
machine-learning
reinforcement-learning
datasetdeep-learninggooglemachine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning, Decision Making, a... - compute - Quora
https://compute.quora.com/Deep-Reinforcement-Learning-Decision-Making-and-Control
7 years ago by @achakraborty
show all tags
article
blog
controller
deep-learning
machine-learning
quora
reinforcement-learning
articleblogcontrollerdeep-learningmachine-learningquorareinforcement-learning
(0)
copydelete
- community post
- history of this post
1OpenAI Gym
https://gym.openai.com/read-only.html
7 years ago by @achakraborty
show all tags
examples
opensource
reinforcement-learning
examplesopensourcereinforcement-learning
(0)
copydelete
- community post
- history of this post
2UCL Course on RL
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html
7 years ago by @achakraborty
show all tags
2015
course
reinforcement-learning
resources
slides
2015coursereinforcement-learningresourcesslides
(0)
copydelete
- community post
- history of this post
2Learning Reinforcement Learning (with Code, Exercises and Solutions) – WildML
http://www.wildml.com/2016/10/learning-reinforcement-learning/
7 years ago by @achakraborty
show all tags
course
github
machine-learning
programming
reinforcement-learning
repository
coursegithubmachine-learningprogrammingreinforcement-learningrepository
(0)
copydelete
- community post
- history of this post
1MvdP Projects & Publications
http://www.cs.ubc.ca/~van/papers/
7 years ago by @achakraborty
show all tags
controller
design
graphics
paragc
publications
reinforcement-learning
research
controllerdesigngraphicsparagcpublicationsreinforcement-learningresearch
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)23
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3A Survey on Policy Search for Robotics
M. Deisenroth$^*$, G. Neumann$^*$, and J. Peters. Foundations and Trends in Robotics, (2013)
6 years ago by @achakraborty
show all tags
2013
book
reinforcement-learning
robotics
search
survey
2013bookreinforcement-learningroboticssearchsurvey
(0)
copydeleteadd this publication to your clipboard
1Solving the Rubik's Cube Without Human Knowledge
S. McAleer, F. Agostinelli, A. Shmakov, and P. Baldi. (2018)cite arxiv:1805.07470Comment: First three authors contributed equally. Submitted to NIPS 2018.
6 years ago by @achakraborty
show all tags
2018
arxiv
games
paper
reinforcement-learning
2018arxivgamespaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
8Mastering the game of Go with deep neural networks and tree search
D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot and 10 other author(s). Nature, (January 2016)
6 years ago by @achakraborty
show all tags
2016
article
deep-learning
game
go
google
nature
reinforcement-learning
2016articledeep-learninggamegogooglenaturereinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Teaching Deep Convolutional Neural Networks to Play Go
C. Clark, and A. Storkey. (2014)cite arxiv:1412.3409Comment: 9 pages, 8 figures, 5 tables. Corrected typos, minor adjustment to table format.
6 years ago by @achakraborty
show all tags
2014
arxiv
cnn
deep-learning
game
go
reinforcement-learning
2014arxivcnndeep-learninggamegoreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Reinforcement learning in robotics: A survey
J. Kober, J. Bagnell, and J. Peters. The International Journal of Robotics Research, 32 (11): 1238--1274 (August 2013)
7 years ago by @achakraborty
show all tags
2013
journal
reinforcement-learning
robotics
survey
2013journalreinforcement-learningroboticssurvey
(0)
copydeleteadd this publication to your clipboard
3World Models
D. Ha, and J. Schmidhuber. (2018)cite arxiv:1803.10122.
7 years ago by @achakraborty
show all tags
2018
arxiv
deep-learning
reinforcement-learning
2018arxivdeep-learningreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4Deep Reinforcement Learning: An Overview
Y. Li. (2017)cite arxiv:1701.07274.
7 years ago by @achakraborty
show all tags
2017
arxiv
deep-learning
paper
reinforcement-learning
review
2017arxivdeep-learningpaperreinforcement-learningreview
(0)
copydeleteadd this publication to your clipboard
3Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Y. Zhu, Z. Wang, J. Merel, A. Rusu, T. Erez, S. Cabi, S. Tunyasuvunakool, J. Kramár, R. Hadsell, N. de Freitas and 1 other author(s). (2018)cite arxiv:1802.09564Comment: 13 pages, 6 figures.
7 years ago by @achakraborty
show all tags
2018
arxiv
deepmind
imitation-learning
reinforcement-learning
robotics
stanford
2018arxivdeepmindimitation-learningreinforcement-learningroboticsstanford
(0)
copydeleteadd this publication to your clipboard
1Truncated Horizon Policy Search: Combining Reinforcement Learning and Imitation Learning
W. Sun, J. Bagnell, and B. Boots. International Conference on Learning Representations, (2018)
7 years ago by @achakraborty
show all tags
2018
iclr
imitation-learning
paper
reinforcement-learning
2018iclrimitation-learningpaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
3A survey of robot learning from demonstration
B. Argall, S. Chernova, M. Veloso, and B. Browning. Robotics and Autonomous Systems, 57 (5): 469 - 483 (2009)
7 years ago by @achakraborty
show all tags
2009
reinforcement-learning
robotics
survey
2009reinforcement-learningroboticssurvey
(0)
copydeleteadd this publication to your clipboard
3Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning
A. Zeng, S. Song, S. Welker, J. Lee, A. Rodriguez, and T. Funkhouser. (2018)cite arxiv:1803.09956Comment: Under review at the International Conference On Intelligent Robots and Systems (IROS) 2018. Project webpage: http://vpg.cs.princeton.edu.
7 years ago by @achakraborty
show all tags
2018
arxiv
paper
reinforcement-learning
research
robot-arm
robotics
2018arxivpaperreinforcement-learningresearchrobot-armrobotics
(0)
copydeleteadd this publication to your clipboard
2Constructing Temporal Abstractions Autonomously in Reinforcement Learning
P. Bacon, and D. Precup. AI Magazine, 39 (1): 39--50 (March 2018)
7 years ago by @achakraborty
show all tags
2018
paper
reinforcement-learning
temporal
2018paperreinforcement-learningtemporal
(0)
copydeleteadd this publication to your clipboard
2OpenAI Gym
G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman, J. Tang, and W. Zaremba. (2016)cite arxiv:1606.01540.
7 years ago by @achakraborty
show all tags
2016
arxiv
paper
reinforcement-learning
2016arxivpaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
X. Peng, P. Abbeel, S. Levine, and M. van de Panne. (2018)cite arxiv:1804.02717.
7 years ago by @achakraborty
show all tags
2018
arxiv
berkeley
deep-learning
graphics
reinforcement-learning
robotics
siggraph
2018arxivberkeleydeep-learninggraphicsreinforcement-learningroboticssiggraph
(0)
copydeleteadd this publication to your clipboard
4Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods
D. Quillen, E. Jang, O. Nachum, C. Finn, J. Ibarz, and S. Levine. (2018)cite arxiv:1802.10264Comment: 8 pages.
7 years ago by @achakraborty
show all tags
2018
arxiv
deep-learning
grasp
reinforcement-learning
robot-arm
robotics
2018arxivdeep-learninggraspreinforcement-learningrobot-armrobotics
(0)
copydeleteadd this publication to your clipboard
4Deep Reinforcement Learning that Matters
P. Henderson, R. Islam, P. Bachman, J. Pineau, D. Precup, and D. Meger. (2017)cite arxiv:1709.06560Comment: Accepted to the Thirthy-Second AAAI Conference On Artificial Intelligence (AAAI), 2018.
7 years ago by @achakraborty
show all tags
2017
arxiv
deep-learning
reinforcement-learning
2017arxivdeep-learningreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Neural Task Programming: Learning to Generalize Across Hierarchical Tasks
D. Xu, S. Nair, Y. Zhu, J. Gao, A. Garg, L. Fei-Fei, and S. Savarese. (2017)cite arxiv:1710.01813.
7 years ago by @achakraborty
show all tags
2018
arxiv
machine-learning
paper
reinforcement-learning
robotics
2018arxivmachine-learningpaperreinforcement-learningrobotics
(0)
copydeleteadd this publication to your clipboard
4Neural Optimizer Search with Reinforcement Learning
I. Bello, B. Zoph, V. Vasudevan, and Q. Le. (2017)cite arxiv:1709.07417Comment: ICML 2017 Conference paper.
7 years ago by @achakraborty
show all tags
2017
arxiv
deep-learning
optimization
reinforcement-learning
2017arxivdeep-learningoptimizationreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4Mastering the game of Go without human knowledge
D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton and 7 other author(s). Nature, (October 2017)
7 years ago by @achakraborty
show all tags
2017
deep-learning
deepmind
google
paper
reinforcement-learning
2017deep-learningdeepmindgooglepaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems
S. Albrecht, and P. Stone. (2017)cite arxiv:1709.08071Comment: 42 pages, submitted for review to Artificial Intelligence Journal. Keywords: multiagent systems, agent modelling, opponent modelling, survey, open problems.
7 years ago by @achakraborty
show all tags
2017
artificial-intelligence
arxiv
collection
paper
problem
reinforcement-learning
research
survey
2017artificial-intelligencearxivcollectionpaperproblemreinforcement-learningresearchsurvey
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
⟩
⟩⟩