FRESH Hacker News
Home
Show HN: A new benchmark for testing LLMs for deterministic outputs
58 points by khurdula