FRESH Hacker News
Home
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
24 points by PaulHoule