achakraborty > reinforcement-learning article

bookmarks (hide)17
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

3AI and Deep Learning in 2017 – A Year in Review – WildML
http://www.wildml.com/2017/12/ai-and-deep-learning-in-2017-a-year-in-review/
6 years ago by @achakraborty
show all tags
2017
article
artificial-intelligence
blog
deep-learning
reinforcement-learning
review
2017articleartificial-intelligenceblogdeep-learningreinforcement-learningreview
(0)
copydelete
- community post
- history of this post
1Article: Using OpenAI with ROS | The Construct
Anything you need to know about OpenAI with ROS. In this post we describe how to apply the OpenAI Gym to the control of a drone that runs with ROS.
6 years ago by @achakraborty
show all tags
article
artificial-intelligence
blog
reinforcement-learning
robotics
ros
articleartificial-intelligenceblogreinforcement-learningroboticsros
(0)
copydelete
- community post
- history of this post
1Hallucinogenic Deep Reinforcement Learning using Python and Keras
A step-by-step guide to reproducing the World Models paper https://arxiv.org/pdf/1803.10122.pdf
6 years ago by @achakraborty
show all tags
article
blog
deep-learning
keras
python
reinforcement-learning
articleblogdeep-learningkeraspythonreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)
https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2
6 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tensorflow
tutorial
A2Carticleblogreinforcement-learningtensorflowtutorial
(0)
copydelete
- community post
- history of this post
2Intuitive RL: Intro to Advantage-Actor-Critic (A2C)
https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752
6 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tutorial
A2Carticleblogreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1The Mathematics of 2048: Optimal Play with Markov Decision Processes
http://jdlm.info/articles/2018/03/18/markov-decision-process-2048.html
7 years ago by @achakraborty
show all tags
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
copydelete
- community post
- history of this post
1After Millions of Trials, These Simulated Humans Learned to Do Perfect Backflips and Cartwheels
https://gizmodo.com/after-millions-of-trials-these-simulated-humans-learne-1825156125
7 years ago by @achakraborty
show all tags
article
news
reinforcement-learning
simulation
articlenewsreinforcement-learningsimulation
(0)
copydelete
- community post
- history of this post
1Towards a Virtual Stuntman – The Berkeley Artificial Intelligence Research Blog
http://bair.berkeley.edu/blog/2018/04/10/virtual-stuntman/
7 years ago by @achakraborty
show all tags
animation
article
berkeley
blog
deep-learning
reinforcement-learning
research
robotics
animationarticleberkeleyblogdeep-learningreinforcement-learningresearchrobotics
(0)
copydelete
- community post
- history of this post
4How to build your own AlphaZero AI using Python and Keras
The codebase contains a replica of the AlphaZero methodology, built in Python and Keras. Gain a deeper understanding of how AlphaZero works and adapt the code to plug in new games.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
tutorial
articleblogdeep-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Learning From Scratch by Thinking Fast and Slow with Deep Learning and Tree Search · David Barber
https://davidbarber.github.io/blog/2017/11/07/Learning-From-Scratch-by-Thinking-Fast-and-Slow-with-Deep-Learning-and-Tree-Search/
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
articleblogdeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
3AlphaGo Zero: Learning from scratch | DeepMind
We introduce AlphaGo Zero, the latest evolution of AlphaGo, the first computer program to defeat a world champion at the ancient Chinese game of Go. Zero is even more powerful and is arguably the strongest Go player in history. Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting from completely random play. In doing so, it quickly surpassed human level of play and defeated the previously published champion-defeating version of AlphaGo by 100 games to 0.
7 years ago by @achakraborty
show all tags
article
deepmind
google
reinforcement-learning
articledeepmindgooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Why AlphaGo Zero is a Quantum Leap Forward in Deep Learning
https://medium.com/intuitionmachine/the-strange-loop-in-alphago-zeros-self-play-6e3274fcdd9f
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
google
reinforcement-learning
articleblogdeep-learninggooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning: The quirks – Towards Data Science
I have been working on Reinforcement Learning for the past few months and all I can say about it: It is different. A writeup of the common quirks and frustrations of Reinforcement Learning I have…
7 years ago by @achakraborty
show all tags
article
blog
nvidia
reinforcement-learning
udacity
articleblognvidiareinforcement-learningudacity
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning | DeepMind
Humans excel at solving a wide variety of challenging problems, from low-level motor control through to high-level cognitive tasks. Our goal at DeepMind is to create artificial agents that can achieve a similar level of performance and generality. Like a human, our agents learn for themselves to achieve successful strategies that lead to the greatest long-term rewards.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
deepmind
google
reinforcement-learning
articleblogdeep-learningdeepmindgooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning: Pong from Pixels
Musings of a Computer Scientist.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
machine-learning
reinforcement-learning
tutorial
articleblogdeep-learningmachine-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1OpenAI’s Goofy Sumo-Wrestling Bots Are Smarter Than They Look
It could be a virtual blood sport in some absurdist techno-future.
7 years ago by @achakraborty
show all tags
article
artificial-intelligence
news
openai
reinforcement-learning
articleartificial-intelligencenewsopenaireinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning, Decision Making, a... - compute - Quora
https://compute.quora.com/Deep-Reinforcement-Learning-Decision-Making-and-Control
7 years ago by @achakraborty
show all tags
article
blog
controller
deep-learning
machine-learning
quora
reinforcement-learning
articleblogcontrollerdeep-learningmachine-learningquorareinforcement-learning
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)1
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

8Mastering the game of Go with deep neural networks and tree search
D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot and 10 other author(s). Nature, (January 2016)
6 years ago by @achakraborty
show all tags
2016
article
deep-learning
game
go
google
nature
reinforcement-learning
2016articledeep-learninggamegogooglenaturereinforcement-learning
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

bookmarks (hide)17
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

3AI and Deep Learning in 2017 – A Year in Review – WildML

1Article: Using OpenAI with ROS | The Construct

1Hallucinogenic Deep Reinforcement Learning using Python and Keras

1Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)

2Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

1The Mathematics of 2048: Optimal Play with Markov Decision Processes

1After Millions of Trials, These Simulated Humans Learned to Do Perfect Backflips and Cartwheels

1Towards a Virtual Stuntman – The Berkeley Artificial Intelligence Research Blog

4How to build your own AlphaZero AI using Python and Keras

1Learning From Scratch by Thinking Fast and Slow with Deep Learning and Tree Search · David Barber

3AlphaGo Zero: Learning from scratch | DeepMind

1Why AlphaGo Zero is a Quantum Leap Forward in Deep Learning

1Reinforcement Learning: The quirks – Towards Data Science

1Deep Reinforcement Learning | DeepMind

1Deep Reinforcement Learning: Pong from Pixels

1OpenAI’s Goofy Sumo-Wrestling Bots Are Smarter Than They Look

1Deep Reinforcement Learning, Decision Making, a... - compute - Quora

publications (hide)1
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

8Mastering the game of Go with deep neural networks and tree search

browse

related tags

concepts

tags

bookmarks (hide)17 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)1 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)17
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)1
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...