Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s

144 points | by campers 2 days ago

84 comments