My guess is that they've done a lot of tuning to improve diff based code editing...

laborcontract · 2025-05-06T18:22:36 1746555756

Update 2: I've been using this model in both aider and cline and I've haven't gotten a diff matching error yet, even with some pretty difficult substitutions across different places in multiple files. The overall feel of this model is nice.

I don't have a formal benchmark but there's a notable improvement in code generation due to this alone.

I've had gemini chug away on plans that have taken ~1 hour to implement. (~80mln tokens spent) A good portion of that energy was spent fixing mistakes made by cline/aider/roo due to search/replace mistakes. If this model gets anywhere close to 100% on diffs then this is a BFD. I estimate this will translate to a 50-75% productivity boost on long context coding tasks. I hope the initial results i'm seeing hold up!

I'm surprised by the reaction in the rest of the thread. A lot unproductive complaining, a lot of off topic stuff, nothing talking about the model itself.

Any thoughts from anyone else using the updated model?

esperent · 2025-05-07T00:09:35 1746576575

Have you been using the Gemini 2.5 pro "Experimental" or "3-25" models in Cline? I've been using both over the last week and got quite a few diff errors, maybe 1/10 of edit so that 92% tracks for me.

Does this 2.5 pro "Preview" feel like an improvement if you had used the others?

laborcontract · 2025-05-07T08:56:07 1746608167

Yep i’ve been using the old and new models in cline. I can’t tell any difference outside of the improvement with diffs, but that’s good enough for me.

vessenes · 2025-05-07T15:17:34 1746631054

Question; are you calling it with “aider -model gemini”? And if so do you see 05-04 listed or the old one?

okdood64 · 2025-05-06T17:24:24 1746552264

What do you mean by agentic work in this context?

laborcontract · 2025-05-06T17:38:44 1746553124

Knowing when to call functions, generating the proper function calling text structure, properly executing functions in sequence, knowing when it's completed its objective, and doing that over an extended context window.