I use Gemini for almost everything. But their model card[1] only compares to o3-...

jsnell · 2025-05-06T17:49:34 1746553774

The text in the model card says the results are from March (including the Gemini 2.5 Pro results), and o3 wasn't released yet.

Is this maybe not the updated card, even though the blog post claims there is one? Sure, the timestamp is in late April, but I seem to remember that the first model card for 2.5 Pro was only released in the last couple of weeks.

cbg0 · 2025-05-06T19:08:13 1746558493

o3 is $40/M output tokens and 2.5 Pro is $10-15/M output tokens so o3 being slightly ahead is not really worth 4 times more than gemini.

jorl17 · 2025-05-06T19:29:45 1746559785

Also, o3 is insanely slow compared to Gemini 2.5 Pro

i_have_an_idea · 2025-05-06T21:34:51 1746567291

Not sure why this is being downvoted, but it's absolutely true.

If you're using these models to generate code daily, the costs add up.

Sure, I'll give a really tough problem to o3 (and probably over ChatGPT, not the API), but on general code tasks, there really isn't meaningful enough difference to justify 4x the cost.