Discussion about this post

User's avatar
Neural Foundry's avatar

Those benchmark comparisons are wild, Gemini 3 Deep Think really does stand out. Your point about judgement is what maters most, the abilty to recognize when you're going down a rabbit hole is crucial. Tasks that used to take weeks now getting done in weekends is the kind of compresion we're seeing across the board.

Expand full comment

No posts

Ready for more?