FRESH Hacker News
Home
Two different tricks for fast LLM inference
192 points by swah