← Feed
Erik Craddock
Erik Craddock@eriklink

Initial Thoughts on GPT-OSS | Drew Breunig

There’s two schools of thought when it comes to agent building.

Some people think you should shove your entire task into a giant model and let it sort it out, with plenty of thinking. It’s expensive, it’s slow, but it (allegedly) requires less upfront work.

Others think you should design your task, in composable steps, where you can measure the accuracy of each step. For most steps, you only need a small model! You don’t need o3 to churn through 3 minutes of tokens to summarize an email body or detect sentiment.

Initial Thoughts on GPT-OSS

Drew Breunig

Initial Thoughts on GPT-OSS

OpenAI released its open-weight model, gpt-oss, today. It comes in two sizes, 120B and 20B, the latter of which runs briskly on my Mac Studio. I’m sure I’ll have more impressions as I use it in anger over the next few weeks, but here’s my initial thoughts:

linkby Drew Breunigvia Drew Breunig
0 Replies0 Boosts0 Likes

Comments

No comments yet.