Ayush Garg

Search

SearchSearch

Recently Updated

  • vCPU

    Oct 28, 2025

    • Byte Pair Encoding

      Oct 27, 2025

      • C++ Optimizations

        Oct 27, 2025

        • Verifiers

          Oct 17, 2025

          Home

          ❯

          List of Notes

          ❯

          Transformer

          Transformer

          Apr 14, 2025, 1 min read

          The original transformer architecture from the Attention is All You Need paper

          Graph View

          Backlinks

          • Cohere Transformer Challenge

          Created by Ayush Garg using Quartz , © 2025

          • GitHub
          • Linkedin
          • Blog
          • Twitter