copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Solving the Rubik's Cube Without Human Knowledge

S. McAleer, F. Agostinelli, A. Shmakov, and P. Baldi. (2018)cite arxiv:1805.07470Comment: First three authors contributed equally. Submitted to NIPS 2018.

Abstract

A generally intelligent agent must be able to teach itself how to solve problems in complex domains with minimal human supervision. Recently, deep reinforcement learning algorithms combined with self-play have achieved superhuman proficiency in Go, Chess, and Shogi without human data or domain knowledge. In these environments, a reward is always received at the end of the game, however, for many combinatorial optimization environments, rewards are sparse and episodes are not guaranteed to terminate. We introduce Autodidactic Iteration: a novel reinforcement learning algorithm that is able to teach itself how to solve the Rubik's Cube with no human assistance. Our algorithm is able to solve 100% of randomly scrambled cubes while achieving a median solve length of 30 moves -- less than or equal to solvers that employ human domain knowledge.

Description

[1805.07470] Solving the Rubik's Cube Without Human Knowledge

Links and resources

BibTeX key: mcaleer2018solving
entry type: misc
year: 2018
url: http://arxiv.org/abs/1805.07470
note: cite arxiv:1805.07470Comment: First three authors contributed equally. Submitted to NIPS 2018

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Solving the Rubik's Cube Without Human Knowledge

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Solving the Rubik's Cube Without Human Knowledge

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Solving the Rubik's Cube Without Human Knowledge

Comments and Reviews
(0)