This study explores the scalability of deep reinforcement learning and shows how you can use 768 CPU cores to cut training time down from 10 hours to 21 minutes—enough time to master multiple classic Atari* games.