Rachit Bansal, Aston Zhang, Rishabh Tiwari, Lovish Madaan, Sai Surya Duvvuri, Fnu Devvrit, David Brandfonbrener, David Alvarez-Melis, Prajjwal Bhargava, Mihir Kale,
Samy Jelassi,
“Let's (not) just put things in Context: Test-time Training for Long-context LLMs”,
submitted, 2025.
Samy Jelassi, Clara Mohri, David Brandfonbrener, Alex Gu, Nikhil Vyas, Nikhil Anand, David Alvarez-Melis, Yuanzhi Li, Sham M. Kakade, and Eran Malach,
“Mixture of Parrots: Experts improve memorization more than reasoning”,
13th International Conference on Learning Representations (ICLR), 2025.
Oral presentation (top 10%) at at the “Mathematics of modern machine learning” workshop, NeurIPS 2024.