Ayush Garg

Search

SearchSearch

Recently Updated

  • Deepseek V4

    Apr 25, 2026

    • On-Policy Distillation

      Apr 25, 2026

      • Pretraining

        Apr 25, 2026

        • Supervised Fine-Tuning (SFT)

          Apr 25, 2026

          Home

          ❯

          List of Notes

          ❯

          Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

          Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

          Oct 02, 2025, 1 min read

          Refer to AlphaZero Supplementary Data

          I am getting a lot of my information from ChatGPT

          Paper details:

          Graph View

          Backlinks

          • No backlinks found

          Created by Ayush Garg using Quartz , © 2026

          • GitHub
          • Linkedin
          • Blog
          • Twitter