Skip to content
DuranteDurante
ALL SYSTEMSGet Access

27 weeks · 54 posts · Written while building

Field notes from a personal AI OS in flight

Every Tuesday, an evergreen essay on what I'm learning while shipping DuranteOS. Every Friday, a dispatch from the week. Roughly 108,000 words and counting — for builders who'd rather watch the foundation get poured than read the press release.

Subscribe · Tuesday essay

Around 3,800 builders read this weekly.

Vertical Integration Eats Horizontal AI: Maia, Meta's Capex, and Anthropic-ServiceNow

Microsoft launched the Maia 200 inference chip on Monday and posted FY26 Q2 earnings on Wednesday — capex up 66% to $37.5B in a single quarter, Azure up 39%. The same Wednesday, Anthropic announced ServiceNow as the default Build Agent distribution channel for Claude. Meta guided 2026 capex to $115-135B. Ai2 released the SERA-32B open coding agents as the open-source counter-current, reproducible at $400. The five-day window is one signal: hyperscalers are owning the vertical stack — chip, model, distribution, capex — rather than buying its parts. Indie founders just got a much sharper picture of who controls what.

The first week of February's lead-up was supposed to be a normal post-Davos cadence week. Big-tech earnings, some lab launches, the usual mid-cycle news.

It was not normal. It was the week the vertical stack got bought.

Microsoft launched its own inference accelerator on Monday, posted earnings on Wednesday with capex up sixty-six percent in a single quarter, announced Anthropic as a deeper distribution partner the same day, and surrounded all of it with capex forecasts from Meta that nearly double 2025 spend. Ai2 quietly shipped the open-source counter-current — open-coding-agents reproducible at four hundred dollars — on Tuesday afternoon while the hyperscalers were busy.

Five days. Five separate moves. One pattern that has been forming for two months and finally clicked into focus this week. Hyperscalers are not buying the stack anymore. They are building it — chip, model, distribution channel, capex commitment — and the build cycle just compressed enough that the indie founder's strategic map needs an update.

I am writing this six days after Studio shipped, twenty weeks into building DOS, and the way I describe what Studio is changes after this week. The "credit-metered gateway over multiple substrates" framing held for two months. After Maia 200 and the Anthropic-ServiceNow channel announcement, the framing sharpens to something more specific: an indie's defense against vertical integration that the indie did not initiate.

The week's signal in one sentence

The hyperscalers are vertically integrating the AI stack — chip, model, distribution, capex — at a speed that closes most "build a horizontal AI tool and sell to enterprises directly" plays inside this calendar year. The indie wedges that survive are the ones that route across the vertical fortresses, not the ones that try to compete with them.

The hook: Microsoft Maia 200 plus FY26 Q2

The single most consequential thing of the five-day window came in two parts that landed two days apart.

On Mon Jan 26, Microsoft unveiled the Maia 200 inference accelerator (source). 3nm, 750W, 216GB HBM3e. Microsoft's claim: 30% better performance-per-dollar than the current Azure fleet, FP8 throughput above Google's TPU v7, FP4 roughly 3x AWS Trainium. Inference-specific silicon, designed in-house, shipping to Azure customers this year.

On Wed Jan 28, Microsoft posted FY26 Q2 earnings (source). Revenue $81.3B (+17%). Microsoft Cloud $51.5B. Azure +39%. Capex $37.5B for a single quarter, up 66% year-over-year, with roughly two-thirds going to short-lived GPU and CPU. Nadella's framing on the call: "we are only at the beginning phases of AI diffusion."

Read the two announcements together. The chip and the spend were paired by design. Microsoft is not waiting for inference cost to come down through general-market silicon competition. It is building the silicon itself, deploying it on a capex schedule that dwarfs every prior cloud build-out, and using its own accounting to reset the inference-cost-per-token line in its own favor.

The implication for an indie founder is sharper than the press release suggests. Inference cost-per-token is about to become non-uniform. Hyperscalers running their own silicon will see step-function drops in internal cost that will not immediately reflect in public API pricing. The arbitrage window between "what it costs Microsoft to serve a Claude inference inside Azure" and "what it charges for that inference at the API" is going to widen this year, and the founders who route through hyperscaler-subsidized inference paths will have unit economics that founders on direct-API paths do not.

That is the architectural argument for a routing gateway, restated by Microsoft's earnings call. Studio's value proposition just got re-validated by the most boring possible source: a single line in an earnings statement.

The Anthropic distribution channel

The same Wednesday, the second half of the vertical-integration story shipped.

On Wed Jan 28, Anthropic announced ServiceNow as the default model for ServiceNow's Build Agent (source). Distribution into ServiceNow's twenty-nine-thousand-seat workforce surface, governance via the AI Control Tower, and ServiceNow's own forward guidance that Build Agent traction will quadruple in twelve months on the back of the Claude integration.

This is the same playbook as the Microsoft 365 Copilot default flip from three weeks ago. Anthropic does not need to acquire the enterprise customer; ServiceNow already has them. Anthropic does not need to write the workflow integration; Build Agent already shipped it. The only thing the partnership creates is Claude as the default model running inside an existing enterprise surface.

For an indie founder, the compounding pattern is the alarm. Three weeks ago Microsoft Copilot. This week ServiceNow. Salesforce, Workday, SAP, Atlassian — pick any enterprise productivity surface with twenty thousand seats or more, and the betting odds say Anthropic announces a similar default-model partnership inside Q1. The agent-distribution channel is collapsing into a small number of governed enterprise platforms that route to one of two or three frontier models by default, with the indie tooling layer either inside (via Skills, MCP, AGENTS.md) or invisible.

If you are building a tool today, the question is no longer "how do I get enterprise customers." The question is "how does my tool ride inside ServiceNow Build Agent / Microsoft Copilot / Salesforce Agent Cloud's default surface, exposed to the operator already inside someone else's product." That is a fundamentally different distribution thesis than "we'll build a great product and they'll come."

The Meta tell

Capex commitments are usually a quiet earnings-day footnote. This week, Meta's was the loudest data point of the entire window outside the Microsoft pair.

On Wed Jan 28, Meta posted Q4 2025 ($59.9B revenue, $8.88 EPS) and guided 2026 capex to $115-135B (source). The 2025 capex was $72.2B. The 2026 guide is nearly double. The framing: Meta Superintelligence Labs build-out.

Meta's capex is not just a Meta data point. It is a market signal about what the floor of frontier-model spend looks like in 2026. If Meta is committing 2x for the full year, Microsoft is committing roughly $150B at run-rate, and Google's prior-quarter guidance is in the same neighborhood, the aggregate hyperscaler spend on AI infrastructure in 2026 is somewhere north of half a trillion dollars. The capacity that volume buys is going to compress inference cost-per-token through 2027, and the founders building product economics on 2025-pricing assumptions are about to be wrong by an order of magnitude.

The compression cuts two ways. Inference subsidies for hyperscaler-routed workloads get larger. Indie founders building on direct frontier-API pricing pay full freight. The unit-economics gap between routed and direct widens. Whichever side you build for, you should be deliberate about it now.

The Ai2 counter-current

The other Tuesday landed largely uncovered.

On Tue Jan 27, the Allen Institute for AI shipped Open Coding Agents — including SERA-32B (source). Open weights on Qwen 3 backbone, full training recipe published, Claude Code integration out of the box, reproducible at approximately four hundred dollars of inference. The Olmo "open all the way down" pattern, applied to the coding-agent stack.

For most readers this is a curiosity. For founders who have been watching the substrate-commoditization curve for three months, this is the logical next chapter. Ai2's SERA release follows the same arc as DeepSeek-V3.2 (open weights), GLM-Image-on-Ascend (open weights without US silicon), and now SERA-32B (open weights with full reproducibility recipe). Each release lowers the floor of what an indie founder can build with at near-zero marginal cost.

The Ai2 release is the counter-current to the vertical-integration story. While hyperscalers consolidate the stack, Ai2 publishes a reproducible recipe for assembling an alternative stack. Both motions are real. Both will run in parallel through 2026. Founders who treat them as alternatives — pick a side, hedge your architecture across both — survive whichever side wins.

The pattern: vertical integration vs. open reproducibility

The five-day window in one frame

The 'we're still in the horizontal era' frame (no longer true)
  • Inference cost-per-token uniform across the market
  • Distribution surfaces still TAM-able directly to enterprises
  • Indie tooling can stay outside hyperscaler integration loops
  • Frontier-model API pricing reflects underlying compute cost
  • Build great product, sell to customers, win on quality
The 'vertical stacks are the unit' frame (this week's evidence)
  • Microsoft Maia 200 + $37.5B Q2 capex repricing inference internally
  • Meta committing $115-135B 2026 capex, doubling 2025
  • Anthropic-ServiceNow channel deal stacking on Microsoft Copilot default
  • Ai2 SERA-32B as the open-reproducibility counter-current
  • Indie tooling either rides inside someone's stack or rides on open weights — middle ground compresses

If you wanted evidence for the year's prevailing narrative — "the substrate is plural, the protocols are public goods, the workflow layer is the moat, the rules layer is being codified" — this week added a fifth refinement: the substrate is being vertically integrated by hyperscalers at calendar-quarter speed. Indie founders have to choose which vertical stack(s) they ride inside, and which open-substrate alternative(s) they hedge against. Building outside both is the strategy that does not survive 2026.

Two angles for an indie founder

What an indie founder building on the substrate should do this week

  1. Treat your distribution thesis as routing across vertical stacks, not as direct sales. The Anthropic-ServiceNow announcement following the Microsoft Copilot default is the second event in a series. Three more partnerships of similar shape are coming inside Q1. The question to ask of every feature you ship: does this run as a Skill inside Build Agent / Copilot / Agent Cloud, or does it require the operator to leave that surface to use it? Surfaces inside the default ride compound; surfaces outside it do not.
  2. Build product economics around BYO-key + multi-vendor routing now. Maia 200 and Meta's capex guide together signal that 2026 inference cost-per-token will become non-uniform across providers, with hyperscaler-subsidized paths cheaper than direct API paths. The architecture that captures each downward repricing the day it ships is the architecture that wins on unit economics. The architecture that locks into a single provider's API pricing pays for the cost that hyperscalers absorb. Studio is built for this. Founders building anything similar should ship the routing layer this quarter.
  3. Hedge against the open-weights counter-current with a real adapter, not a TODO. SERA-32B reproducible at $400 is the kind of release that cumulatively lowers the floor of the substrate. Five more releases of this shape across 2026 — and there will be five — make the open-weights tier a structurally credible alternative. Founders who keep an open-weights adapter in their gateway as a real path, not as a backlog ticket, retain optionality the closed-stack lock-ins surrender. Cost: one adapter and a feature flag. Benefit: optionality through whichever scenario wins.

What this changes for DOS

Two design decisions hardened this week, both of them coming out of the vertical-integration framing rather than any single news item.

One. Studio's positioning sharpens from "credit-metered gateway over multiple substrates" to "the indie's defense against vertical-integration lock-in." The technical spec is identical. The story is sharper. The story matters because it determines what kind of operator the product is for and what alternatives they evaluate against. After this week, Studio is positioned against not "OpenRouter or X" but "the cost of being inside one hyperscaler's vertical stack with no exit."

Two. The Skills-as-distribution thesis from Week 11 accelerates. With ServiceNow announcing Build Agent integration, the addressable surface for a DOS-authored Skill expanded by tens of thousands of seats this week. The first DOS Skill — the Sentinel sketch — was supposed to ship this week. After Wednesday's Anthropic-ServiceNow announcement, it ships before Friday. The compounding cost of waiting is now visible in real partner-directory math.

That is the kind of forcing function the vertical-integration week is structurally designed to produce, and this year the forcing function is louder and less negotiable than I expected when I started this dispatch.

What I am watching for next week

The thread that runs through the week

Hyperscalers are vertically integrating the AI stack — chip, model, distribution channel, capex commitment — at calendar-quarter speed. Microsoft shipped the chip. Anthropic shipped the channel. Meta committed the capex. Ai2 shipped the open-reproducibility counter-current. The "horizontal AI" market that the prior eighteen months were organized around is closing this year.

For an indie founder, the playbook reduces. Route through someone's vertical stack via Skills, MCP, AGENTS.md. Build product economics around BYO-key multi-vendor routing. Hedge with a real open-weights adapter. Bet on the operator loop on top — context engineering, memory, convention catalogs, failure pipelines — because that is the only piece of the stack the hyperscalers cannot vertically integrate, since it is per-operator and not per-stack.

That is what Studio is, after this week. The framing sharpened. The architecture did not change. The market did the hardening on its own schedule.

— Lucas


Sources verified the week of Jan 26-30, 2026: Microsoft Maia 200 launch (Jan 26) · Microsoft FY26 Q2 results (Jan 28) · Ai2 Open Coding Agents / SERA-32B (Jan 27) · Anthropic × ServiceNow Build Agent default (Jan 28) · Meta Q4 2025 results + 2026 capex guide (Jan 28)

Was this page helpful?

The 27-week arc · A single body of work

Twenty-seven weeks. Two posts a week. Six months of writing while building.

Week

Tuesday evergreen

Friday dispatch