Show HN: A new benchmark for testing LLMs for deterministic outputs

58 points | by khurdula 2 days ago

24 comments