FRESH

Hacker News
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
24 points by PaulHoule
- Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.