News
A deep reinforcement learning algorithm can solve the Rubik's Cube puzzle in a fraction of a second. The work is a step toward making AI systems that can think, reason, plan and make decisions.
At a high level, reinforcement learning follows the insight derived from Pavlov’s dogs: it’s possible to teach an agent to master complex, novel tasks through only positive and negative feedback.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results