We propose a sample efficient model-based visual RL algorithm built on MuZero, which we name EfficientZero, which is the first time an algorithm achieves super-human performance on Atari games with 100k environment steps data. We hope EfficientZero’s low sample complexity and high performance can bring RL closer to real-world applicability.
2 min read