FRESH Hacker News
Home
GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
35 points by laxmena