Cerebras runs Llama 3.1 70B at 2,100 tokens per second, live demo available

6 points | by modeless 3 hours ago

1 comments