MCTS

Mastering Atari Games with Limited Data

We propose a sample efficient model-based visual RL algorithm built on MuZero, which we name EfficientZero, which is the first time an algorithm achieves super-human performance on Atari games with 100k environment steps data. We hope EfficientZero’s low sample complexity and high performance can bring RL closer to real-world applicability.

2021-11-03

2 min read

NeurIPS 2021