Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train

93 points | by tcp_handshaker 5 hours ago

21 comments