27 semanas · 54 textos · Escritos durante a construção

Notas de campo de um SO de IA pessoal em voo

Toda terça-feira, um ensaio perene sobre o que aprendi enquanto envio o DuranteOS. Toda sexta, um boletim da semana. Cerca de 108 mil palavras e contando — para construtores que preferem ver a fundação ser lançada a ler o press release.

Comece por aqui

Se você está pensando em escrever enquanto constrói

After Twenty-Seven Weeks: What the Series Was For, and What I'm Building Next

Se você está considerando o grupo fundador

The One Reference Customer Strategy: GTM for a Personal AI OS, Sketched Before the Customer Signs

Se você quer a espinha arquitetônica

The Decomposition Discipline I Am Trying to Codify Inside DOS

Assinar · Ensaio de terça

Cerca de 3.800 construtores leem isso toda semana.

Mar 10, 2026

Six Months of DOS: What Changed, What Endured, What Surprised

Twenty-six weeks of building a personal AI operating system in public. Twenty essays in this series so far. Roughly twelve packs in various states of readiness. Around eighty workflows. About 3,800 newsletter subscribers. Zero paying customers, with four months left on the founding-customer clock. Here is the honest half-year retrospective — what survived contact with reality through Q1, what I was wrong about, what I never saw coming.

I am writing this on a Tuesday in mid-March, twenty-six weeks and roughly two hundred and seventy commits into building DuranteOS. Studio has been live for sixty days. The Hexagonal migration is essentially done. The procurement-week dispatch I shipped Friday reframed the architecture argument I have been making for two months as a sovereignty risk priced at the federal-procurement tier.

Six months is the right shape for a retrospective. It is long enough that the load-bearing decisions have had time to compound, short enough that I remember the alternatives I considered and rejected, and exactly the moment at which the founding-customer hunt's clock has burned half its time. The series itself runs for another seven weeks; this essay is the mid-series check-in, not the closing argument.

This is the twentieth essay in the series I started writing six weeks ago when I committed to making the case for DOS — the personal AI operating system I left a steady income to build — out in public, in detail, with the receipts. Twenty essays in is the right moment to step back, audit honestly, and decide what the back half looks like.

The half-year compressed into one sentence

The thesis held. The technical architecture held. The go-to-market plan was wrong in three concrete ways, two of which I have corrected and one of which I am still working on. The single biggest surprise was how much the writing helped.

What endured

Three load-bearing decisions made in the first month that I have not had to revise.

One. The Algorithm as the primary discipline. The Ideal State Criteria decomposition — turning every multi-step request into a numbered list of falsifiable claims before any work begins — has not been adjusted in twenty-six weeks beyond minor tuning. It is the most important load-bearing piece of DOS and the one I would point to first if asked "what is the actual product."

Two. The four-copy distribution model backed by a mechanical sync invariant. I designed it grumbling about the overhead. Five months later I would not change a single line of the manifest schema. The sync-check tool has caught seven distinct multi-copy drifts that would have shipped to customers. Worth every minute of the build cost.

Three. Builder's Compass as the only acquisition channel. I considered paid ads in month two and again in month four and decided against both times. The trust capital from over two years of weekly writing has done what no paid acquisition could do at the founding rate. The single most underrated decision of the half-year.

Decision	Made in	Still standing?
Algorithm + ISC discipline	Month 1	Yes — minor tuning only
Four-copy + sync-check	Month 2	Yes — schema unchanged
Memory layer as SQLite knowledge graph (sketched, MemPalace fork pending)	Month 1	Yes — design intuition matched the upstream that ships in April
Studio gateway (one Bearer token, multi-provider)	Month 3	Yes — added providers, no architectural changes
Hexagonal Architecture for Studio	Month 5	Yes — boundaries holding mid-migration
Builder's Compass as the only acquisition channel	Month 4	Yes — still the most efficient channel
One reference customer at founding rate	Month 5	Yes — still hunting, July deadline

What changed

Three things I was meaningfully wrong about.

What I believed in month 1 vs what I believe now

Month 1 belief

"Ten paying operators in the first six months" — too many, too shallow, would dilute the depth
"I can sell DOS at standard pricing from day one" — wrong; the price band needed a founding rate first
"Direct outbound to my consulting network will produce the founding cohort" — wrong; the cohort is coming from Builder's Compass readers, not from prior consulting clients

Month 6 belief

One paying operator by July with a deep relationship is correct; ten would be wrong
A founding rate is correct; standard pricing is the right price for the next phase, not the founding phase
Public writing is the channel; the trust capital from BC readers is dramatically higher quality than the trust capital from prior clients

The first two corrections were applied in months 2-3. The third was the most expensive — I spent two months in late 2025 doing outbound to consulting contacts and produced approximately zero qualified founding-customer conversations. The pivot to BC-driven inbound was the second quarter's best decision.

What surprised

Three things I genuinely did not see coming.

The three biggest surprises of the half-year

The agent's ability to debate itself usefully. The Council pattern was a side experiment. It became the single most-used decision-making primitive in DOS — far more than I projected. Multi-specialist debate, with the synthesis preserving disagreement explicitly, is the architectural review I always wished I had access to as a solo founder. I now run it on every non-trivial decision, even though the formal orchestrator is still a sketch (the informal version has been running for fourteen weeks).
The compounding rate of public writing. I expected Builder's Compass to grow linearly. It grew super-linearly: 400 subscribers at end of year one, around 3,800 today. The mechanism — readers forwarding individual issues to other senior engineers — was visible in the data but I had not modeled how powerful it would become. The list is now my single most valuable professional asset.
The substrate-bifurcation pace. I expected the Western frontier to consolidate and the open-weights tier to stay months behind. I was wrong on both. The bifurcation that hardened in W16 — Western labs at $380B priced as enterprise distribution, Chinese labs shipping frontier-class open weights on non-US silicon — was not on my month-1 map. The architecture I committed to in month 3 turned out to be exactly what survives the bifurcation, which is partly luck and partly the discipline of building toward portability rather than speed.

The numbers, honestly

Metric	At month 1	At month 6	Where I expected to be
DOS paying customers	0	0 (1 in active negotiation)	0–1
DOS skills/workflows	~10	~80	~40
DOS packs	2	~12 (3 production-grade)	~6
Studio gateway operational	inactive	active (Altyaa via gateway, ~60 days live)	inactive
Builder's Compass subscribers	~3,200	~3,800	~3,500
Founding-cohort conversations completed	0	11	~8
Founding-cohort second conversations	0	4	~3
Posts published in this series	0	20	12
Failure-corpus signals (informal)	0	~30 estimated	unknown
Personal runway remaining (months)	12	6	6

The clock is the most important number. Six months of runway remaining means a binding deadline of roughly July that the founding-customer hunt has to clear. The negotiation in progress has to convert; if it does not, the next four months get the same intensity from the next two prospects. The runway number is the one I have not learned how to lie to myself about, and I am not going to start.

What I would not change

Three decisions that have been questioned (by readers, advisors, and my own occasional doubt) and that I would still make.

One. Leaving the consulting income when I did. The depth of work made possible by full attention is not approximate to half-attention; it is categorically different. I would have a worse product and a worse list if I had tried to do this part-time.

Two. Running Altyaa as the revenue line alongside DOS. The optimal-focus advice would say "shut down the second product." The optimal-focus advice is wrong here because Altyaa funds DOS's runway and validates the shared infrastructure (Studio gateway). The two products together are more interesting than DOS alone would be. The Studio launch in early January would not have made sense without Altyaa as the first internal customer.

Three. Public writing on DOS during the build, not after. The conventional wisdom says "build in private until you have something to show; then announce loudly." I did the opposite. The trust capital from public iteration through twenty essays has done what a single launch announcement could not. Writing was the marketing, the architectural-thinking forcing function, and the trust-building all at once.

What I would change if starting over

Two specific moves I would make differently.

One. I would invest in direct sales / GTM earlier. The current rebalance toward founder-customer development should have started in month two, not month five. The opportunity cost of the delay is at least three months of cohort-building — and it shows up directly in the founding-customer-conversation count being four under what it could have been with an earlier start.

Two. I would write the eval suite sooner. Building it in month six instead of month two meant four months of agent-behavior changes that I cannot precisely audit. Not catastrophic — the operator (me) caught most regressions — but unsupportable at scale. If I had a second founding customer right now, the eval suite would already be saving us both time. I am building it this month per the W16 essay.

The single discipline that mattered most

If forced to attribute the half-year's progress to one habit, I would name recurring-signal heuristic. The rule that any failure or correction repeating four or more times in 30 days requires structural fix has driven nearly every meaningful architectural improvement in DOS. The discipline is mechanical: count the signals, cross the threshold, do the work. The leverage is enormous, even running informally.

What I am betting on for the back half

The thesis remains: the agent is the product, intelligence replaces interface, memory replaces lock-in. The decade-shape of the bet is unchanged. What is in flux for the next six months is execution, in three specific lanes:

A small thank-you, named explicitly

A non-trivial number of readers will have made it through some of the twenty essays. A smaller number will have made it through all twenty. To both groups: thank you.

The writing was for me first — it forced the architectural intuitions into legible shape and the strategic bets into named claims. But the writing only worked because there were readers who would scrutinize it. Several of the essays were materially improved by reader replies: the memory replaces lock-in post benefited from a Brazilian DBA who pushed back on my confidence-decay heuristic, the Council pattern post sharpened after a former Stripe engineer questioned my non-convergence framing, and the Hexagonal Architecture post was directly improved by a reader who has been doing this longer than I have correcting two of my Cockburn citations.

The series is the audit. The audit is the asset. You helped build it. Thank you.

Where the series began

The thesis that frames everything.

The decision behind the series

The financial and identity bet.

The current chapter

The hunt that defines the next four months.

What continues weekly

The newsletter the writing extends from.

The series continues for another seven weeks. The work continues past it. The clock continues. The first reference customer is four months away on the schedule and not yet on the contract; that gap closes between now and July, or it does not.

Either way, this blog will keep producing the receipts. The next seven essays cover the build — Sentinel shipping, MemPalace forking once the upstream lands in April, the routing-bakeoff results, the first customer conversation if it converts, the post-mortem if it does not. The honest version of either kind of report is the only version worth writing. I will write them.

Until then, thank you for reading.

Was this page helpful?

O arco de 27 semanas · Um corpo de trabalho

Vinte e sete semanas. Dois textos por semana. Seis meses de escrita durante a construção.

Semana

Ensaio de terça

Boletim de sexta

W01

2025-10-28

The Agent Is the Product: Why Intelligence Replaces Interface

2025-10-31

Wrapper Wars: The Week Cursor, Windsurf, and MiniMax All Went Vertical

W02

2025-11-04

Memory Replaces Lock-In: Designing the Substrate of Personal AI

2025-11-07

The Week the Substrate Caught Up: Kimi K2 Thinking Lands

W03

2025-11-11

Why I Just Left a Steady Practice to Build a Personal AI Operating System

2025-11-14

The Week the Harness Ate the Model

W04

2025-11-18

The Decomposition Discipline I Am Trying to Codify Inside DOS

2025-11-21

Antigravity, Gemini 3, Grok 4.1: The Week the Frontier Re-Rendered

W05

2025-11-25

Four Copies, One Source of Truth: The Sync Pattern I Want to Commit to Before It Hurts

2025-11-28

Opus 4.5 Lands on Thanksgiving Week: Anthropic's Quiet Counter-Punch

W06

2025-12-02

Plugin Architecture for Hooks: The Pattern I Want Before the Hook Becomes a God-Function

2025-12-05

The Runtime Is the Moat Now: Anthropic Buys Bun, Claude Code Hits a Billion

W07

2025-12-09

Credit-Metered API Gateways: The Pattern I Want Before I Have Anything to Meter

2025-12-12

Three Frontier Models in 23 Days: Stop Picking a Winner, Start Picking a Router

W08

2025-12-16

Sentinel, Sketched: Convention-Driven Onboarding Before I Build It

2025-12-19

The Spec Goes Public, the Substrate Goes to War: Skills Opens While Codex, Gemini Flash, and Nemotron Sprint

W09

2025-12-23

Memory, Sketched: The Knowledge Graph I Was Designing Before MemPalace Shipped

2025-12-26

Christmas Wasn't Quiet: Open Weights Caught Up, Nvidia Bought Inference, Frontier Labs Bought Loyalty

W10

2025-12-30

Council, Sketched: The Three-Round Debate Protocol I Want to Formalize Before I Build It

2026-01-02

The Practitioners' Year-End: While the West Took PTO, DeepSeek Shipped Architecture

W11

2026-01-06

Altyaa's Wedge: The Brazilian-Portuguese SMB Bet Studio Is Pointed At

2026-01-09

Default Claude: Microsoft Flips the Switch as the Substrate Distributes by Default

W12

2026-01-13

Builder's Compass: Two Years In, ~3,800 Subscribers, and What I've Learned Teaching Architecture

2026-01-16

The Agent Surface Splits in Two: Anthropic Goes Vertical While China Goes Substrate

W13

2026-01-20

Failure Patterns, Sketched: The Bookkeeping Discipline I Want for the Agent's Mistakes

2026-01-23

The Rules Layer Solidifies: Constitution, Hardware, and Export Controls Land in One Week

W14

2026-01-27

The One Reference Customer Strategy: GTM for a Personal AI OS, Sketched Before the Customer Signs

2026-01-30

Vertical Integration Eats Horizontal AI: Maia, Meta's Capex, and Anthropic-ServiceNow

W15

2026-02-03

Hexagonal in Practice: The Ports and Adapters I'm Pulling Studio Toward

2026-02-06

Coding Becomes the Flagship: Opus 4.6 and GPT-5.3-Codex Land the Same Day

W16

2026-02-10

TDD for AI Agents, Sketched: The Translation I Want to Commit to Before the Eval Suite Exists

2026-02-13

The Bifurcation Hardens: Anthropic's $380B Meets China's Open-Weights Frontier

W17

2026-02-17

Refactoring the Hook Pipeline: The Fowler Walkthrough I'm Three Refactorings Into

2026-02-20

The Sonnet Window: Free-Tier Skills, Code Security, and the Race for Agentic Primitives

W18

2026-02-24

Skills, Packs, and Hooks: The Three-Layer Model I'm Pulling DOS Toward

2026-02-27

The Moat Above the Model: Distillation, Mobile Coding, and MCP Battlegrounds

W19

2026-03-03

The Knowledge Portfolio for an Indie Founder Building in Public, Audited at Week 25

2026-03-06

Procurement Is the New Benchmark: OpenAI Wins the DoW While Anthropic Gets Banned

W20

2026-03-10

Six Months of DOS: What Changed, What Endured, What Surprised

2026-03-13

The Counter-Sovereignty Move: Anthropic Sues, Then Diversifies

W21

2026-03-17

MCP in Production: What Works When the Protocol Becomes the Boundary

2026-03-20

Cursor Becomes a Model Lab: Composer 2 and the Unbundling at the Agent Layer

W22

2026-03-24

Routing by Sovereignty Class: The Architecture That Survives the Procurement Decade

2026-03-27

The Injunction the Pentagon Won't Honor: Anthropic Wins, the State Defies

W23

2026-03-31

Sentinel's First Scan: What the Convention Catalog Found Across Eleven Projects

2026-04-03

The Agent Stack Swallows the IDE, the IPO, and the Courtroom

W24

2026-04-07

Forking MemPalace: The 48-Hour Integration Retro

2026-04-10

Project Glasswing: When the Frontier Bifurcates on Safety

W25

2026-04-14

The 90-Day Open-Weights Bakeoff: What Actually Routes Where in DOS Today

2026-04-17

Routines Eat the Workflow: The Harness Becomes the Product

W26

2026-04-21

The Substrate Portability Pact: What DOS Refuses to Couple to, Even as the Harness Consolidates

2026-04-24

The Substrate Signs a Ten-Year Lease: Gigawatts, Governance, and the Closing Window for Indie Posture

W27

2026-04-28

After Twenty-Seven Weeks: What the Series Was For, and What I'm Building Next

2026-05-01

The Week Exclusivity Died: Substrate Goes Plural, Rules Layer Becomes the Moat