FRESH Hacker News
Home
Accelerating Gemma 4: faster inference with multi-token prediction drafters
588 points by amrrs