Pular para o conteúdo
DuranteDurante
ALL SYSTEMSGet Access

27 semanas · 54 textos · Escritos durante a construção

Notas de campo de um SO de IA pessoal em voo

Toda terça-feira, um ensaio perene sobre o que aprendi enquanto envio o DuranteOS. Toda sexta, um boletim da semana. Cerca de 108 mil palavras e contando — para construtores que preferem ver a fundação ser lançada a ler o press release.

Assinar · Ensaio de terça

Cerca de 3.800 construtores leem isso toda semana.

Antigravity, Gemini 3, Grok 4.1: The Week the Frontier Re-Rendered

In four days, the agentic-coding stack acquired two new harnesses (Antigravity, Grok Agent Tools) and one frontier-tier challenger (Gemini 3 Pro at half Sonnet's price). Anthropic answered not with a model but with $30B of Azure compute. The model layer is converging on parity. The differentiator is moving — visibly, this week — to the harness, the tool surface, and the compute contract.

If you blinked between Monday and Thursday this week, you missed an entire competitive re-rendering of the AI substrate.

On Monday Nov 17, xAI shipped Grok 4.1, taking #1 on LMArena Text Arena (1483 Elo) and beating its predecessor 64.78% in blind preference tests (xAI announcement). On Tuesday Nov 18, Google shipped Gemini 3 Pro simultaneously into Search, the Gemini app, AI Studio, Vertex AI, the Gemini CLI, and a new agent IDE called Antigravity — Google's first true "everywhere on day one" model launch (Google blog, Antigravity launch). Same day, Microsoft, NVIDIA, and Anthropic announced a strategic partnership in which Anthropic commits $30B of Azure spend, NVIDIA invests up to $10B, and Microsoft up to $5B — locking Claude as the only frontier model on AWS + Azure + GCP (Microsoft announcement). On Wednesday Nov 19, xAI followed with Grok 4.1 Fast plus an Agent Tools API — 2M-token context, server-side tools (web, X search, code exec, doc retrieval), aggressive pricing (xAI Fast announcement).

Four days. Two new agent harnesses. One frontier-tier challenger at half Sonnet's price. One $45B compute deal. The "wait for Anthropic to ship Opus 4.5" story everyone was tracking became, in retrospect, the least interesting story of the week.

The week's signal in one sentence

The model layer is converging on parity. The differentiator is moving — visibly, this week — to the harness layer, the tool surface, and the compute contract.

Antigravity is the warning shot

Of everything that happened this week, Google's Antigravity matters most for anyone building on top of frontier models.

Antigravity is a fork of VS Code optimized for agent-first workflows. What makes it strategically interesting is not the IDE itself — Cursor and Windsurf already exist — but that it ships with day-one support for Claude Sonnet 4.5 and OpenAI models alongside Gemini 3 Pro. Google is publicly saying: we will compete on the agent runtime, not on locking you to our model.

That is a structural admission identical to the one OpenAI made last week with GPT-5.1's apply_patch / shell tools and the named ecosystem co-developers. Two of the three frontier labs have now publicly admitted, in product launches, that the harness is the surface area they are competing on.

LabMove this weekStrategic admission
GoogleAntigravity ships with Claude + OpenAI as peer models"The harness is more important than the model exclusivity"
xAIGrok 4.1 Fast + Agent Tools API at 2M context"Long-horizon agents need server-side tools and we will price aggressively to win them"
Anthropic$30B Azure deal, no new model"We will compete on guaranteed compute and tri-cloud distribution, not weekly model drops"
OpenAI(no shipped news in window — still riding GPT-5.1 launch from prior week)(silent, by inference)

Antigravity is the warning shot specifically because Google had every incentive to make it Gemini-only. They chose multi-model. The reading is simple: even Google does not believe the model is the moat.

Gemini 3 Pro lands cheaper than Sonnet, matching it on coding

The numbers Google shipped Tuesday:

  • #1 on LMArena at 1501 Elo (above Grok 4.1's Monday score)
  • 76.2% on SWE-bench Verified (matching Sonnet 4.5)
  • $2/$12 per million tokens in/out under 200k context — undercutting Sonnet meaningfully

Simon Willison's same-day review is the most useful senior-engineer take I have read on it: "Gemini 2.5 upgraded to match the leading rival models." He transcribed a 3.5-hour council meeting for $1.42 — but with timestamp drift across the long context. A calibration shot for indie founders breathlessly retweeting benchmarks. The model is good. It is not magic.

For an indie founder building a personal AI OS, the price point matters more than the benchmark. Sonnet 4.5 has been the default substrate for serious agentic work since September. Gemini 3 Pro just made the pricing math real for "what if I run my agentic loop on Gemini for the parts where Claude's strengths are not load-bearing?"

That question did not have a serious answer last week. It does this week.

Anthropic's quiet move was the most strategic

Anthropic shipped no new model in the Nov 17-20 window. Opus 4.5 did not land until Nov 24, four days outside the window. What Anthropic shipped instead was the $30B Azure / Microsoft / NVIDIA partnership.

That is not a model launch. It is a distribution and compute lock-in play that compounds for the next 24+ months.

What the Microsoft + NVIDIA + Anthropic deal actually does

  1. Tri-cloud availability. Claude is now the only frontier model available on AWS, Azure, and GCP simultaneously. Enterprise procurement teams that were forced to pick one cloud just lost that constraint. That alone is worth multi-billions in expansion revenue.
  2. Compute floor through 2027. $30B of Azure spend + 1GW compute commitment + $10B from NVIDIA + $5B from MSFT means Anthropic has guaranteed inference capacity through the GPT-6 era. No cloud-driven supply crunch can starve them.
  3. De-risks the Claude bet. For an indie founder building on Claude Code (which is where I sit), this is the most important news of the week. The substrate I am committing to has just secured itself for the duration of my 12-month runway and beyond. I do not have to factor "what if Anthropic loses cloud capacity in mid-2026" into my planning anymore.

The deal will not generate headlines for another six months. It is, structurally, the more important news of the week.

What I am watching for the next week

What this changes for DOS

For the personal AI OS I am ten weeks into building, the week confirmed three planning assumptions:

  1. Substrate-portability is now a forced architectural choice, not a hopeful one. Antigravity's multi-model default is the same admission GPT-5.1's named co-developers made last week. Build the harness; let the model float underneath.
  2. The Anthropic bet is now safer than it was on Monday. The Azure deal is the most underrated news of the week. For DOS specifically — built on Claude Code — the substrate just secured itself through 2027.
  3. Pricing pressure on inference is real and accelerating. Gemini 3 Pro at half Sonnet's price for matching coding capability means the credit-metered gateway architecture I am about to build (routing inference across providers based on cost + capability) is going to matter sooner than I thought.

That last one shifts a build priority. The gateway was on my Q1 2026 roadmap. After this week I am moving it forward.

— Lucas


Sources verified the week of Nov 17-20, 2025: xAI Grok 4.1 launch (Nov 17) · Google Gemini 3 launch (Nov 18) · Antigravity announcement (Nov 18) · CNBC on Gemini 3 (Nov 18) · Microsoft + NVIDIA + Anthropic partnership (Nov 18) · Simon Willison on Gemini 3 (Nov 18) · xAI Grok 4.1 Fast + Agent Tools (Nov 19)

Was this page helpful?

O arco de 27 semanas · Um corpo de trabalho

Vinte e sete semanas. Dois textos por semana. Seis meses de escrita durante a construção.

Semana

Ensaio de terça

Boletim de sexta