27 weeks · 54 posts · Written while building

Field notes from a personal AI OS in flight

Every Tuesday, an evergreen essay on what I'm learning while shipping DuranteOS. Every Friday, a dispatch from the week. Roughly 108,000 words and counting — for builders who'd rather watch the foundation get poured than read the press release.

Start here

If you're thinking about writing while building

After Twenty-Seven Weeks: What the Series Was For, and What I'm Building Next

If you're considering the founding cohort

The One Reference Customer Strategy: GTM for a Personal AI OS, Sketched Before the Customer Signs

If you want the architectural spine

The Decomposition Discipline I Am Trying to Codify Inside DOS

Subscribe · Tuesday essay

Around 3,800 builders read this weekly.

Oct 31, 2025

Wrapper Wars: The Week Cursor, Windsurf, and MiniMax All Went Vertical

On Oct 29, 2025, three competing AI coding stacks shipped their own purpose-built models within twelve hours of each other — Cursor 2.0 with Composer, Windsurf with SWE-1.5 at 950 tok/s, and MiniMax M2 as the open-weight challenger. The day before, Anthropic permanently retired Claude 3.5 Sonnet. Here is what that week meant, written from the outside before I had built anything to defend.

I am writing this on a Thursday in late October 2025 with no product to defend.

Worth saying out loud at the top, because most commentary about platform shifts is written by people who already shipped a thing the shift threatens. I have a thesis I have been turning over for months — that the next operating system layer is the agent, not the app — and I have been sketching what a personal AI OS might look like. I have not built it. I have a name for it (DOS, for DuranteOS) and a notebook full of architectural choices, and that is approximately the entire scaffolding so far.

So when I say what this week's news means, I mean it from the position of someone trying to decide whether the thesis still survives contact with the market. Not from the position of someone whose roadmap just got threatened.

The week of October 27, 2025 will, I think, be looked back on as the week the agentic developer-tooling stack vertically integrated overnight.

Within roughly twelve hours on Wednesday Oct 29, three coordinated launches landed:

Cursor 2.0 shipped with Composer, Cursor's first proprietary frontier coding model, "4x faster than similarly intelligent models" with sub-30-second turns and a multi-agent interface running up to eight parallel agents via git worktrees (source)
Windsurf released SWE-1.5, trained on GB200 NVL72 chips and served via Cerebras silicon at 950 tokens/sec — "6x faster than Haiku 4.5, 13x faster than Sonnet 4.5" (source)
MiniMax M2, an open-weights Chinese challenger (10B active / 200B total parameters, runs locally on a 256GB Mac Studio, $0.30/$1.20 per million tokens, Anthropic-API compatible) had been released two days earlier on Oct 27 and got mass adoption that Wednesday (source)

The day before all of this, on Tuesday Oct 28, Anthropic permanently retired both Claude 3.5 Sonnet snapshots (source) — forcing every agentic system that had pinned to 3.5 into a same-day migration.

Four days. Four major moves. One unmistakable message.

The week's signal in one sentence

The days of "I build on Claude or GPT" as a moat are over. Either you own the model layer, or you own a layer the model layer cannot reach.

The wrapper-wars era and what just ended it

For roughly the last three years, the dominant indie-developer-tools playbook has looked something like this:

Pick a frontier API (Anthropic, OpenAI, lately Google)
Wrap it in a coherent UI with carefully tuned prompts
Sell the experience, not the intelligence
Treat the model as commodity infrastructure

This worked. It produced Cursor (initially), Continue, Aider, Cody, dozens of category leaders. The thesis was sound: the model is the engine; the IDE is the chassis; the chassis is where the money is.

What happened this week is that the chassis builders started shipping their own engines.

Player	Move (Oct 27–30, 2025)	What it means
Cursor	Shipped Composer (own coding model) + multi-agent UI	"We are not a Claude wrapper. We are a Claude competitor."
Windsurf	Shipped SWE-1.5 at 950 tok/s on Cerebras	"Latency, not intelligence, is the new battleground."
MiniMax	Open-weights 10B-active model, runs on a Mac Studio	"Frontier-quality is no longer hostage to an API."
Anthropic	Retired Claude 3.5 Sonnet permanently	"Your two-year-old wrapper just stopped working."

The pattern is not subtle. Three IDEs/labs proved within seventy-two hours that the IDE-as-thin-Claude-wrapper architecture has a ceiling, and they have crashed through it. Anthropic, in the same week, demonstrated the cost of being downstream of someone else's deprecation calendar.

What this means if you sell to developers

Three observations, in increasing order of how uncomfortable they are.

What worked last year vs. what works now

The 2024 wrapper playbook

Wrap the best frontier model
Fine-tune prompts; wait for the next model release to get better for free
Charge $20-40/mo for "AI in your editor"
Treat the underlying model as commodity
Differentiate on UX polish

The 2026 vertical playbook

Own the model OR own a layer the model cannot reach (memory, identity, workflow)
Optimize the loop (latency, parallelism, ergonomics) not the answer
Charge for outcomes the model alone cannot produce
Treat the underlying model as a replaceable substrate — but assume someone will out-execute you on it
Differentiate on accumulated context

If you are building a coding tool, the wrapper-wars era ended Wednesday. The Cursor and Windsurf playbook is a reasonable one if you have the capital to train your own model. It is fatal if you do not.

If you are building anything else — and I am increasingly convinced the most interesting work is "anything else" — the question becomes more interesting: what layer are you building that the model providers cannot collapse into themselves?

The DOS thesis, restated in light of the week

I want to be careful here. I have not shipped DOS. I am going to lay out the thesis this week's news pressure-tested, not the product I have built on top of it.

The thesis I have been working on for months is roughly this:

The next operating system layer is the personal agent, not the application
The substrate (Claude, GPT, Gemini, eventually open-weights) is replaceable and will commoditize
What does not commoditize is the operator's accumulated context — their identity, their decisions, their preferences, their corrections, their judgment expressed over time
Whoever owns the operator's accumulated context owns the relationship; whoever owns the relationship has a defensible position when the substrate shifts under them

This week's news is the strongest evidence I have so far that the second bullet is right. The substrate IS commoditizing. Cursor and Windsurf bypassing the API is one form of evidence. MiniMax M2 running on a Mac Studio at frontier-adjacent quality is another. Anthropic deprecating Sonnet 3.5 mid-cycle is a third. The substrate is now plural, replaceable, and on a deprecation calendar.

The thing the news does not answer is the third bullet — whether the operator's accumulated context is a defensible asset, or just a nice-to-have. That is the experiment I am about to run by actually building DOS.

The four candidate layers a model provider cannot collapse into themselves (yet)

Operator identity — the rules, voice, refusal patterns, taste preferences that make the agent recognizably yours. Lives in the operator's hands, not the substrate's.
Persistent memory — months or years of accumulated facts, decisions, and corrections. Survives any model swap. I have not yet built this; it is a hypothesis I am about to test.
Capability composition — the named workflows, packs, and behaviors that turn raw inference into something an operator can recall, reuse, and refine. Sketched on paper; not shipped.
Cross-substrate orchestration — a single point through which inference is routed across many providers, with metering, audit, and operator control. Vapor, this week. Will be infrastructure if the thesis holds.

These are the four layers I think survive the wrapper wars. They are also the four layers that the new vertical-integration moves (Cursor's Composer, Windsurf's SWE-1.5) do not address — those moves are about owning the generation of code; the DOS thesis is about owning the operator's relationship to that generation.

I will know in twelve months whether that distinction matters or whether I was wrong.

Three angles I am watching for the next four weeks

Putting forward predictions because predictions are the unit of accountability:

What I would tell an indie founder in this category right now

If your product is a thin wrapper around a single frontier API, you have approximately six months to either (a) shift to a layer the model providers cannot collapse into themselves, or (b) accept that your differentiation will commoditize as the model providers ship their own opinionated tools. Neither outcome is bad — but pretending the current shape is durable is.

The boring takeaway

The week of October 27, 2025 was the week the AI coding stack stopped being a flat market of API-wrappers and started looking like the early-2010s database market — vertically integrated leaders (Cursor, Windsurf), open-source challengers (MiniMax), and a primary infrastructure provider (Anthropic) whose deprecation choices ripple through every layer above them.

That market shape eventually consolidated to three or four players who owned both the storage engine and the query layer. The agent market is on track to consolidate the same way, on roughly the same time-scale.

I am not in that consolidation lane. I am one layer up — or I plan to be — building toward the operator's accumulated context, identity, and capability orchestration. That is the bet I am about to make this year. The week's news did not invalidate the thesis. If anything, it sharpened the case for it.

I will be watching the next dispatch slot to see what the second wave looks like.

— Lucas

Sources verified the week of Oct 27, 2025: Cursor 2.0 announcement · Simon Willison on Composer · Simon Willison on SWE-1.5 · Simon Willison on MiniMax M2 · Anthropic API release notes · Claude Skills GA Oct 16 · Claude Haiku 4.5 Oct 15

Was this page helpful?

The 27-week arc · A single body of work

Twenty-seven weeks. Two posts a week. Six months of writing while building.

Week

Tuesday evergreen

Friday dispatch

W01

2025-10-28

The Agent Is the Product: Why Intelligence Replaces Interface

2025-10-31

Wrapper Wars: The Week Cursor, Windsurf, and MiniMax All Went Vertical

W02

2025-11-04

Memory Replaces Lock-In: Designing the Substrate of Personal AI

2025-11-07

The Week the Substrate Caught Up: Kimi K2 Thinking Lands

W03

2025-11-11

Why I Just Left a Steady Practice to Build a Personal AI Operating System

2025-11-14

The Week the Harness Ate the Model

W04

2025-11-18

The Decomposition Discipline I Am Trying to Codify Inside DOS

2025-11-21

Antigravity, Gemini 3, Grok 4.1: The Week the Frontier Re-Rendered

W05

2025-11-25

Four Copies, One Source of Truth: The Sync Pattern I Want to Commit to Before It Hurts

2025-11-28

Opus 4.5 Lands on Thanksgiving Week: Anthropic's Quiet Counter-Punch

W06

2025-12-02

Plugin Architecture for Hooks: The Pattern I Want Before the Hook Becomes a God-Function

2025-12-05

The Runtime Is the Moat Now: Anthropic Buys Bun, Claude Code Hits a Billion

W07

2025-12-09

Credit-Metered API Gateways: The Pattern I Want Before I Have Anything to Meter

2025-12-12

Three Frontier Models in 23 Days: Stop Picking a Winner, Start Picking a Router

W08

2025-12-16

Sentinel, Sketched: Convention-Driven Onboarding Before I Build It

2025-12-19

The Spec Goes Public, the Substrate Goes to War: Skills Opens While Codex, Gemini Flash, and Nemotron Sprint

W09

2025-12-23

Memory, Sketched: The Knowledge Graph I Was Designing Before MemPalace Shipped

2025-12-26

Christmas Wasn't Quiet: Open Weights Caught Up, Nvidia Bought Inference, Frontier Labs Bought Loyalty

W10

2025-12-30

Council, Sketched: The Three-Round Debate Protocol I Want to Formalize Before I Build It

2026-01-02

The Practitioners' Year-End: While the West Took PTO, DeepSeek Shipped Architecture

W11

2026-01-06

Altyaa's Wedge: The Brazilian-Portuguese SMB Bet Studio Is Pointed At

2026-01-09

Default Claude: Microsoft Flips the Switch as the Substrate Distributes by Default

W12

2026-01-13

Builder's Compass: Two Years In, ~3,800 Subscribers, and What I've Learned Teaching Architecture

2026-01-16

The Agent Surface Splits in Two: Anthropic Goes Vertical While China Goes Substrate

W13

2026-01-20

Failure Patterns, Sketched: The Bookkeeping Discipline I Want for the Agent's Mistakes

2026-01-23

The Rules Layer Solidifies: Constitution, Hardware, and Export Controls Land in One Week

W14

2026-01-27

The One Reference Customer Strategy: GTM for a Personal AI OS, Sketched Before the Customer Signs

2026-01-30

Vertical Integration Eats Horizontal AI: Maia, Meta's Capex, and Anthropic-ServiceNow

W15

2026-02-03

Hexagonal in Practice: The Ports and Adapters I'm Pulling Studio Toward

2026-02-06

Coding Becomes the Flagship: Opus 4.6 and GPT-5.3-Codex Land the Same Day

W16

2026-02-10

TDD for AI Agents, Sketched: The Translation I Want to Commit to Before the Eval Suite Exists

2026-02-13

The Bifurcation Hardens: Anthropic's $380B Meets China's Open-Weights Frontier

W17

2026-02-17

Refactoring the Hook Pipeline: The Fowler Walkthrough I'm Three Refactorings Into

2026-02-20

The Sonnet Window: Free-Tier Skills, Code Security, and the Race for Agentic Primitives

W18

2026-02-24

Skills, Packs, and Hooks: The Three-Layer Model I'm Pulling DOS Toward

2026-02-27

The Moat Above the Model: Distillation, Mobile Coding, and MCP Battlegrounds

W19

2026-03-03

The Knowledge Portfolio for an Indie Founder Building in Public, Audited at Week 25

2026-03-06

Procurement Is the New Benchmark: OpenAI Wins the DoW While Anthropic Gets Banned

W20

2026-03-10

Six Months of DOS: What Changed, What Endured, What Surprised

2026-03-13

The Counter-Sovereignty Move: Anthropic Sues, Then Diversifies

W21

2026-03-17

MCP in Production: What Works When the Protocol Becomes the Boundary

2026-03-20

Cursor Becomes a Model Lab: Composer 2 and the Unbundling at the Agent Layer

W22

2026-03-24

Routing by Sovereignty Class: The Architecture That Survives the Procurement Decade

2026-03-27

The Injunction the Pentagon Won't Honor: Anthropic Wins, the State Defies

W23

2026-03-31

Sentinel's First Scan: What the Convention Catalog Found Across Eleven Projects

2026-04-03

The Agent Stack Swallows the IDE, the IPO, and the Courtroom

W24

2026-04-07

Forking MemPalace: The 48-Hour Integration Retro

2026-04-10

Project Glasswing: When the Frontier Bifurcates on Safety

W25

2026-04-14

The 90-Day Open-Weights Bakeoff: What Actually Routes Where in DOS Today

2026-04-17

Routines Eat the Workflow: The Harness Becomes the Product

W26

2026-04-21

The Substrate Portability Pact: What DOS Refuses to Couple to, Even as the Harness Consolidates

2026-04-24

The Substrate Signs a Ten-Year Lease: Gigawatts, Governance, and the Closing Window for Indie Posture

W27

2026-04-28

After Twenty-Seven Weeks: What the Series Was For, and What I'm Building Next

2026-05-01

The Week Exclusivity Died: Substrate Goes Plural, Rules Layer Becomes the Moat