Hacker News new | past | comments | ask | show | jobs | submit login

On my Thematic Generalization Benchmark (https://github.com/lechmazur/generalization, 810 questions), the Claude 4 models are the new champions.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: