GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

22 points | by laxmena 2 hours ago

7 comments