Skip to content
DuranteDurante
ALL SYSTEMSGet Access

27 weeks · 54 posts · Written while building

Field notes from a personal AI OS in flight

Every Tuesday, an evergreen essay on what I'm learning while shipping DuranteOS. Every Friday, a dispatch from the week. Roughly 108,000 words and counting — for builders who'd rather watch the foundation get poured than read the press release.

Subscribe · Tuesday essay

Around 3,800 builders read this weekly.

Wrapper Wars: The Week Cursor, Windsurf, and MiniMax All Went Vertical

On Oct 29, 2025, three competing AI coding stacks shipped their own purpose-built models within twelve hours of each other — Cursor 2.0 with Composer, Windsurf with SWE-1.5 at 950 tok/s, and MiniMax M2 as the open-weight challenger. The day before, Anthropic permanently retired Claude 3.5 Sonnet. Here is what that week meant, written from the outside before I had built anything to defend.

I am writing this on a Thursday in late October 2025 with no product to defend.

Worth saying out loud at the top, because most commentary about platform shifts is written by people who already shipped a thing the shift threatens. I have a thesis I have been turning over for months — that the next operating system layer is the agent, not the app — and I have been sketching what a personal AI OS might look like. I have not built it. I have a name for it (DOS, for DuranteOS) and a notebook full of architectural choices, and that is approximately the entire scaffolding so far.

So when I say what this week's news means, I mean it from the position of someone trying to decide whether the thesis still survives contact with the market. Not from the position of someone whose roadmap just got threatened.

The week of October 27, 2025 will, I think, be looked back on as the week the agentic developer-tooling stack vertically integrated overnight.

Within roughly twelve hours on Wednesday Oct 29, three coordinated launches landed:

  • Cursor 2.0 shipped with Composer, Cursor's first proprietary frontier coding model, "4x faster than similarly intelligent models" with sub-30-second turns and a multi-agent interface running up to eight parallel agents via git worktrees (source)
  • Windsurf released SWE-1.5, trained on GB200 NVL72 chips and served via Cerebras silicon at 950 tokens/sec — "6x faster than Haiku 4.5, 13x faster than Sonnet 4.5" (source)
  • MiniMax M2, an open-weights Chinese challenger (10B active / 200B total parameters, runs locally on a 256GB Mac Studio, $0.30/$1.20 per million tokens, Anthropic-API compatible) had been released two days earlier on Oct 27 and got mass adoption that Wednesday (source)

The day before all of this, on Tuesday Oct 28, Anthropic permanently retired both Claude 3.5 Sonnet snapshots (source) — forcing every agentic system that had pinned to 3.5 into a same-day migration.

Four days. Four major moves. One unmistakable message.

The week's signal in one sentence

The days of "I build on Claude or GPT" as a moat are over. Either you own the model layer, or you own a layer the model layer cannot reach.

The wrapper-wars era and what just ended it

For roughly the last three years, the dominant indie-developer-tools playbook has looked something like this:

  1. Pick a frontier API (Anthropic, OpenAI, lately Google)
  2. Wrap it in a coherent UI with carefully tuned prompts
  3. Sell the experience, not the intelligence
  4. Treat the model as commodity infrastructure

This worked. It produced Cursor (initially), Continue, Aider, Cody, dozens of category leaders. The thesis was sound: the model is the engine; the IDE is the chassis; the chassis is where the money is.

What happened this week is that the chassis builders started shipping their own engines.

PlayerMove (Oct 27–30, 2025)What it means
CursorShipped Composer (own coding model) + multi-agent UI"We are not a Claude wrapper. We are a Claude competitor."
WindsurfShipped SWE-1.5 at 950 tok/s on Cerebras"Latency, not intelligence, is the new battleground."
MiniMaxOpen-weights 10B-active model, runs on a Mac Studio"Frontier-quality is no longer hostage to an API."
AnthropicRetired Claude 3.5 Sonnet permanently"Your two-year-old wrapper just stopped working."

The pattern is not subtle. Three IDEs/labs proved within seventy-two hours that the IDE-as-thin-Claude-wrapper architecture has a ceiling, and they have crashed through it. Anthropic, in the same week, demonstrated the cost of being downstream of someone else's deprecation calendar.

What this means if you sell to developers

Three observations, in increasing order of how uncomfortable they are.

What worked last year vs. what works now

The 2024 wrapper playbook
  • Wrap the best frontier model
  • Fine-tune prompts; wait for the next model release to get better for free
  • Charge $20-40/mo for "AI in your editor"
  • Treat the underlying model as commodity
  • Differentiate on UX polish
The 2026 vertical playbook
  • Own the model OR own a layer the model cannot reach (memory, identity, workflow)
  • Optimize the loop (latency, parallelism, ergonomics) not the answer
  • Charge for outcomes the model alone cannot produce
  • Treat the underlying model as a replaceable substrate — but assume someone will out-execute you on it
  • Differentiate on accumulated context

If you are building a coding tool, the wrapper-wars era ended Wednesday. The Cursor and Windsurf playbook is a reasonable one if you have the capital to train your own model. It is fatal if you do not.

If you are building anything else — and I am increasingly convinced the most interesting work is "anything else" — the question becomes more interesting: what layer are you building that the model providers cannot collapse into themselves?

The DOS thesis, restated in light of the week

I want to be careful here. I have not shipped DOS. I am going to lay out the thesis this week's news pressure-tested, not the product I have built on top of it.

The thesis I have been working on for months is roughly this:

  • The next operating system layer is the personal agent, not the application
  • The substrate (Claude, GPT, Gemini, eventually open-weights) is replaceable and will commoditize
  • What does not commoditize is the operator's accumulated context — their identity, their decisions, their preferences, their corrections, their judgment expressed over time
  • Whoever owns the operator's accumulated context owns the relationship; whoever owns the relationship has a defensible position when the substrate shifts under them

This week's news is the strongest evidence I have so far that the second bullet is right. The substrate IS commoditizing. Cursor and Windsurf bypassing the API is one form of evidence. MiniMax M2 running on a Mac Studio at frontier-adjacent quality is another. Anthropic deprecating Sonnet 3.5 mid-cycle is a third. The substrate is now plural, replaceable, and on a deprecation calendar.

The thing the news does not answer is the third bullet — whether the operator's accumulated context is a defensible asset, or just a nice-to-have. That is the experiment I am about to run by actually building DOS.

The four candidate layers a model provider cannot collapse into themselves (yet)

  1. Operator identity — the rules, voice, refusal patterns, taste preferences that make the agent recognizably yours. Lives in the operator's hands, not the substrate's.
  2. Persistent memory — months or years of accumulated facts, decisions, and corrections. Survives any model swap. I have not yet built this; it is a hypothesis I am about to test.
  3. Capability composition — the named workflows, packs, and behaviors that turn raw inference into something an operator can recall, reuse, and refine. Sketched on paper; not shipped.
  4. Cross-substrate orchestration — a single point through which inference is routed across many providers, with metering, audit, and operator control. Vapor, this week. Will be infrastructure if the thesis holds.

These are the four layers I think survive the wrapper wars. They are also the four layers that the new vertical-integration moves (Cursor's Composer, Windsurf's SWE-1.5) do not address — those moves are about owning the generation of code; the DOS thesis is about owning the operator's relationship to that generation.

I will know in twelve months whether that distinction matters or whether I was wrong.

Three angles I am watching for the next four weeks

Putting forward predictions because predictions are the unit of accountability:

What I would tell an indie founder in this category right now

If your product is a thin wrapper around a single frontier API, you have approximately six months to either (a) shift to a layer the model providers cannot collapse into themselves, or (b) accept that your differentiation will commoditize as the model providers ship their own opinionated tools. Neither outcome is bad — but pretending the current shape is durable is.

The boring takeaway

The week of October 27, 2025 was the week the AI coding stack stopped being a flat market of API-wrappers and started looking like the early-2010s database market — vertically integrated leaders (Cursor, Windsurf), open-source challengers (MiniMax), and a primary infrastructure provider (Anthropic) whose deprecation choices ripple through every layer above them.

That market shape eventually consolidated to three or four players who owned both the storage engine and the query layer. The agent market is on track to consolidate the same way, on roughly the same time-scale.

I am not in that consolidation lane. I am one layer up — or I plan to be — building toward the operator's accumulated context, identity, and capability orchestration. That is the bet I am about to make this year. The week's news did not invalidate the thesis. If anything, it sharpened the case for it.

I will be watching the next dispatch slot to see what the second wave looks like.

— Lucas


Sources verified the week of Oct 27, 2025: Cursor 2.0 announcement · Simon Willison on Composer · Simon Willison on SWE-1.5 · Simon Willison on MiniMax M2 · Anthropic API release notes · Claude Skills GA Oct 16 · Claude Haiku 4.5 Oct 15

Was this page helpful?

The 27-week arc · A single body of work

Twenty-seven weeks. Two posts a week. Six months of writing while building.

Week

Tuesday evergreen

Friday dispatch