by mfiguiere on 6/10/25, 8:15 PM with 199 comments
by DanMcInerney on 6/10/25, 9:09 PM
It would be interesting if there was a model that was specifically trained on task-oriented data. It's my understanding they're trained on all data available, but I wonder if it can be fine-tuned or given some kind of reinforcement learning on breaking down general tasks to specific implementations. Essentially an agent-specific model.
by chad1n on 6/10/25, 9:03 PM
by manmal on 6/10/25, 8:23 PM
by mark_l_watson on 6/11/25, 4:18 AM
I have dreamed of having powerful AI ever since I read Bertram Raphael's great book Mind Inside Matter around 1978, getting hooked on AI research and sometimes practical applications for my life since then.
I can easily afford $200 for a Pro account but I get this nagging feeling that LLMs are not the final path to the powerful AI I have always dreamed of and I don't want to support this level of hype.
I have lived through a few AI winters and I worry that accountants will tally up the costs, environmental and money, versus the benefits and that we collectively have an 'oh shit' moment.
by swyx on 6/10/25, 8:40 PM
sama's highlight[0]:
> "The plan o3 gave us was plausible, reasonable; but the plan o3 Pro gave us was specific and rooted enough that it actually changed how we are thinking about our future."
I kept nudging the team to go the whole way to just let o3 be their CEO but they didn't bite yet haha
by WhitneyLand on 6/10/25, 9:03 PM
This announcement adds o3-pro, which pairs with o3 in the same way the o4 models go together.
It should be called o3-high, but to align with the $200 pro membership it’s called pro instead.
That said o3 is already an incredibly powerful model. I prefer it over the new Anthropic 4 models and Gemini 2.5. It’s raw power seems similar to those others, but it’s so good at inline tool use it usually comes out ahead overall.
Any non-trivial code generation/editing should be using an advanced reasoning model, or else you’re losing time fixing more glitches or missing out on better quality solutions.
Of course the caveat is cost, but there’s value on the frontier.
by eru on 6/12/25, 9:29 AM
by ChrisArchitect on 6/10/25, 8:39 PM
OpenAI dropped the price of o3 by 80%
by tiahura on 6/10/25, 8:23 PM
by honeybadger1 on 6/12/25, 9:23 AM
by nickandbro on 6/11/25, 1:27 AM
https://www.svgviewer.dev/s/c3j6TEAP
in case anyone is interested
by vintagedave on 6/12/25, 9:05 AM
Does anyone know what it did or returned? I had not seen anything, nor have I read anything, about issues here.
by ikerino on 6/11/25, 2:33 AM
Have completed around a dozen chats with o3-pro so far. Can't say I'm impressed, output feels qualitatively very similar to regular o3.
Tried feeding in loads of context as suggested in the article but generally feels like a miss.
by conradfr on 6/12/25, 12:17 PM
by paul7986 on 6/12/25, 2:36 AM
It created the image showing each month but when you looked at each month it was so janky ... February 31st and other huge errors!
I'm not using image creation to create 3d art for fun or art sake im trying to use it to create utility images to share for discussion with friends & co-workers. The above is just one of many ways it fails when creating utility images!
by mmsc on 6/10/25, 8:22 PM
by carmelion on 6/10/25, 8:33 PM