FRESH Hacker News
Home
DSpark: Speculative decoding accelerates LLM inference [pdf]
651 points by aurenvale