Ayush Garg

Search

Recently Updated

Pareto Principle
Jun 18, 2026
Bits
Jun 18, 2026
Magnitude of a normalized floating-point number
Jun 18, 2026
Mixed Precision Training
Jun 18, 2026

❯

❯

Vision Tokens

Apr 18, 2025, 1 min read

Vision tokens are used in Vision Transformers

Vision tokens are small chunks / patches of an image that are treated like words in a sentence

Eg. 224 x 224 pixel image if split into 16 x 16 patches gives you 196 patches (similar to 196 words)

Graph View

Backlinks

No backlinks found

Created by Ayush Garg using Quartz , © 2026

GitHub
Linkedin
Blog
Twitter