Ayush Garg
Search
Search
Search
Light mode
Dark mode
Recently Updated
Deepseek V4
Apr 25, 2026
On-Policy Distillation
Apr 25, 2026
Pretraining
Apr 25, 2026
Supervised Fine-Tuning (SFT)
Apr 25, 2026
Home
❯
List of Notes
❯
Rope to Nope and Back Again: A New Hybrid Attention Strategy
Rope to Nope and Back Again: A New Hybrid Attention Strategy
Apr 22, 2025, 1 min read
Paper Link:
https://arxiv.org/pdf/2501.18795v1
Graph View
Backlinks
No backlinks found