Accelerating Gemma 4: faster inference with multi-token prediction drafters

607 points | by amrrs 21 hours ago

291 comments