printStarting from random play and given no domain knowledge, AlphaZero achieved within 24 hours a superhuman level of play in the games of chess, shogi, and Go
by veen
In this paper, we generalise this approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains.