Executing programs inside transformers with exponentially faster inference

265 points | by u1hcw9nx a day ago

103 comments