Two different tricks for fast LLM inference

194 points | by swah a month ago

81 comments