Previously, DeepMind has used reinforcement learning to teach programs to master various games such as the Chinese board game ‘Go,’ the Japanese strategy game ‘Shogi,’ chess, and challenging Atari video games, where earlier AI programs were taught the rules first during training.
DeepMind has introduced MuZero, an algorithm that (by combining a tree-based search with a learned model) achieves superhuman performance in several challenging and visually complex domains, without knowing their underlying dynamics. MuZero learns a model that, when applied iteratively, predicts the quantities most directly relevant to planning.
Paper: https://www.nature.com/articles/s41586-020-03051-4
Full Paper: https://arxiv.org/pdf/1911.08265.pdf
submitted by /u/ai-lover
[link] [comments]