Search-R1: Training LLMs to Reason and Leverage Search Engines with RL

101 points | by jonbaer 4 days ago

12 comments