Reward models for LMs are fundamentally broken

2 points | by panthertrax 8 hours ago

No comments yet.