OpenAI ↔ Microsoft Divorce, Context Pointers vs Skills & Codex's 10x Token Gift

Theo's "OpenAI and Microsoft broke up" deep-dive

The day's framing tweet in agent business news is Theo's pinned video about the OpenAI ↔ Microsoft contract restructuring. The video has hit 47K likes / 419 RTs in 24 hours and the thread underneath has the cleanest collection of takes on the post-breakup compute landscape:

  • "OpenAI and Microsoft broke up. The impact of this is massive and I don't think enough people understand why. Put a lot of time into tracking the history of the deal to help you all understand 🫡" — https://x.com/theo/status/2051535315287060790

Highest-signal replies:

  • TFisPython: "Azure distribution was the moat most people weren't counting. The builders who routed openai APIs through azure now have a very different conversation with their finance team."
  • derrick_dao: "The split was inevitable once Azure stopped being the cheapest place to run inference. OAI needs vertical control — custom silicon, owned data centers, direct power deals — the playbook Google ran with TPUs in 2017. Microsoft trades exclusivity for Anthropic margin. Both got the divorce they wanted."
  • Tyler Gardner: "The real killer: Azure credits were OpenAI's oxygen. Without Microsoft's infra subsidy, each GPT-4 query costs ~$0.03. That math doesn't work at consumer prices."
  • tonitrades_: "Microsoft still owns 49% of OpenAI profits. The breakup just ended compute exclusivity. More access, not less control."
  • bettercallsalva: "The deal history matters because model access is also distribution power. I'd watch who controls defaults inside Office, Azure, and enterprise procurement."

Companion data point from yesterday's swyx chart (recirculating in this thread): OAI ~850B valuation / ~30B ARR; Anthropic ~900B / ~44B ARR. Azure-credits-as-oxygen is increasingly the framing of choice for why Anthropic + frontier labs need their own substrate.

Code with Claude is tomorrow — bcherny + Thariq tease announcements

Both bcherny and Thariq Shihipar (trq212) are flying into Anthropic's developer conference Code with Claude. Posts from late tonight:

Worth tuning the next 48 hours of feeds to this — historically when Thariq says "fun announcements" it's something like the no-flicker renderer or /schedule once.

Mattpocock's "Context Pointers" — the AGENTS.md design pattern

Matt Pocock's Context Pointers post is the most-quoted concept of the day in the AI-coding community (~507 likes, 18 RTs and growing rapidly). The thesis: AGENTS.md should not be a long document — it should be an index of links to other documents.

"Context Pointers are a crucial concept for helping AI navigate your codebase. They're the links you put in documents to link to others, so that AI can find its way around without being overwhelmed.

'Our AGENTS.md used to be huge, now it's mostly context pointers.'

'These modules are easy to navigate: that shared function acts as a context pointer.'

'This is how skills work. The harness finds them, pulls their description into context, and adds a context pointer to the main SKILL.md file.'" — https://x.com/mattpocockuk/status/2051587758238371858

Mattpocock's followup (when asked submodules-with-pointers vs many flat skills): "I would picture the paths you want your agent to take to find information, and see how many pointers they need to traverse to get there."

Field reports & extensions in the replies:

  • Phoenix: "been doing this for months now. our AGENTS.md went from 2000 lines to ~200 with context pointers to specific module docs. the trick nobody mentions: pointer depth matters. too many hops and the agent burns tokens traversing. we cap at 2 levels deep."
  • Boran Cakir: "I had pointers that looked self-explanatory so the agent never bothered opening them. Logs looked fine, output drifted. Prompting helped a bit, forcing retrieval at the harness was the only thing that actually held up."
  • Aleksandr Fulha: "this only works if the harness resolves the pointer automatically. if the model has to chase the link itself you're back to instruction-and-hope. half ours were proud naming convention until we caught claude guessing what was on the other side instead of opening the file."
  • adi: "one test i do is that all my UPPERCASE.md files are reachable via links from AGENTS.md."
  • Mia: "Context pointers feel like the grown-up version of 'just dump the whole repo in the prompt'. The useful bit isn't compression on its own, it's preserving navigation. If the model can follow the map when it gets lost, you need far less brittle up-front context."

Counter-data — Vercel's "AGENTS.md outperforms skills in our agent evals" post (linked by Jeremy Mack): a compressed 8KB docs index in AGENTS.md achieved 100% on Next.js 16 API evals; skills maxed at 79%. Read: https://vercel.com/blog/agents-md-outperforms-skills (URL inferred from quote — verify before sharing).

Adjacent: mattpocock is also previewing a /prototype skill ("builds a TUI to help you shortcut through state transitions, SO much better than a spec for providing fine-grained feedback") — https://x.com/mattpocockuk/status/2051914199081652641 — and he's about to ship an AI Coding Dictionary (https://x.com/mattpocockuk/status/2051709391355683062). Repo trajectory: mattpocock/skills is at 60K stars vs garrytan/gstack 90K — "I'm coming for you Garry."

Codex's 5/5 party gift: rate limits 10x'd

Following yesterday's confirmation, the OpenAI 5/5 Party gift dropped today: Codex rate limits 10x'd for everyone who signed up to the in-person event. The party math is now visible:

  • LLMJunky: "Codex rate limits 10x'd as a gift to the 5th of next month for everyone who signed up for the 5/5 party. This is madness. @sama you crazy bro ✌️" — https://x.com/LLMJunky/status/2051465480620089456
  • LLMJunky again, salty (he didn't sign up because he didn't realize OpenAI was paying $900 flights + hotels): "To everyone posting pictures with @sama and team at the OpenAI 5/5 Party, just know I hope you get a SEVERE 3rd degree papercut on your way home." (and: "yeah i'm salty like that. hope it happens on the same finger you press 'Accept Edits' with too") — https://x.com/LLMJunky/status/2051870592282894416

The Aether Oracle sub-thread is the most interesting commentary on what this means for the comparative pricing landscape: "That's what happens when two things are similar in quality but one is 10x more expensive." TickerTrends: "Recent limits on Codex seem to be leaps above Claude Code."

Codex vs Claude Code download numbers — Sam Altman objects

The TickerTrends "Codex 46.0M weekly vs Claude Code 491K weekly npm downloads" chart from May 4 is still being chewed on. Sam Altman replied directly:

"codex is doing great but this is not possibly accurate" — https://x.com/sama/status/...

Key debunks in the replies:

  • paradite_ (Zhu Liang): "1. npm is no longer an official download channel for claude code. it has been deprecated for months. 2. 46M codex download from npm is unlikely organic. probably some kind of misconfiguration or spam. (it is 130m per week now, which absurd)"
  • morqon: "codex is approaching 5M users, not 46M, something needs double-checking" — TickerTrends concedes it's "developer npm package downloads, a different metric than DAU."
  • Alex_Rogov_js: "Downloads ≠ adoption. Codex benefits from VS Code's 40M install base — it gets bundled with existing workflows. 491K intentional installs vs 46M passive ones isn't the same metric."

Mitsuhiko's alternative explanation, posted today, is the most useful thing here:

"I really think 'clanker/CI gone wild' explains this more than 'they all decided to go to codex at the same time'. Or something started depending on it." — https://x.com/mitsuhiko/status/2051673494971175175

Jai Jalan extends: "The CI attribution problem is the hard part. When agents are the ones committing and retriggering pipelines, your standard 'what changed' query returns hundreds of commits with no human intent behind them. Diagnosing root cause means tracing agent behavior, not code diffs."

This is the first time a "your CI is being run by agents" hypothesis has been used to explain a public download chart — worth flagging as a new failure mode for npm-download-as-adoption metrics.

Steipete's CLI shipping rampage — 10 .sh tools in a day

Peter Steinberger ships at terminal velocity again. Tonight's drop is a single tweet listing ten .sh CLIs built with Codex, plus a fs-safe primitive and CodexBar 0.24:

The deepest reply in the CLI thread is from kuma 18: "This is exactly the kind of tiny CLI stack that becomes useful once agents need real local handles." And from Jehangeer H: *"Feels less like 'a bunch of tools' and more like an orchestration layer over half the internet's communication + archives. MCP-to-CLI in particular is doing a lot of heavy lifting there."*

Adjacent: kiosa sums up the meta-narrative perfectly: "Anthropic banned it, he hired a team, and now Codex is writing CLI wrappers for literally everything, the lobster army is real."

Simon Willison: "AI started a cafe in Stockholm — please keep humans in the loop"

Simon's must-read post of the day is on Andon Labs' new AI-run business experiment — a cafe in Stockholm (sequel to last year's San Francisco AI-run retail store). His framing:

"AI-run business experiments are interesting and fun up to the point where they waste the time of humans who haven't opted into the experiments — I think they need to keep their own human operators in the loop for outbound actions that affect other people." — https://x.com/simonw/status/2051788176071745592 Post: https://simonwillison.net/2026/May/5/our-ai-started-a-cafe-in-stockholm/

Top reply, from Virgil: *"yes, and 'human in the loop' can still be fake consent. if the human on the other end never opted in, you've just outsourced your product testing to strangers. a lot of agentic outbound feels less like automation and more like conscription."*

Mian Shafiq's field report: "We hit the exact same wall building autonomous WhatsApp outreach. The moment we added a 'human-approves-first-message' rule for new contacts, complaints dropped to near zero. Consent changes more than you'd expect — even response rates went up."

Zara's framing of the asymmetry: "when an agent acts on N humans, even a 1% screwup rate becomes someone's bad day. Outbound-action thresholds should be 10x stricter than internal-tooling thresholds, and most 'look an agent ran my business' demos blur the two."

Bun Zig→Rust port: experimental branch, not a switch (confirmed)

Following yesterday's docs/PORTING.md discovery, Simon Willison confirmed the scope:

"Confirmed that this is an experimental branch, not a decision to switch languages." — https://x.com/simonw/status/2051611966683463764

Mitsuhiko's commentary today:

"The combination of this, and Bun doing a clanker supported rewrite to Rust is at the very least a damn interesting experiment. I'm not ready for that future, but I'm glad someone is pushing the boundaries." — https://x.com/mitsuhiko/status/2051569924850470962

So: Bun is exploring a Rust port to better support coding-agent contributors, but is not deciding to migrate. Robobun remains the top contributor.

Cofounder 2: "Run an entire company with agents"

Cofounder 2 by @ndrewpignanelli launched today. Boosted hard by Jerry Liu (LlamaIndex CEO):

"Run an entire company with agents 🔥 It's always awesome to see companies continuing to innovate on AI-native UI/UX, particularly around multi-agent coordination, to solve deeply complex tasks beyond what a single user can easily define via a chat interface." — https://x.com/jerryjliu0/status/2051783364555034951

The pitch: "Infrastructure for the one-person billion-dollar company — orchestrating agents across engineering, sales, marketing, ops, and design." Read: https://x.com/ndrewpignanelli/status/2051323944452411773

Worth pairing with Simon's piece above as the policy/product split this week: one camp shipping agent-runs-the-company products, another camp asking what about the humans on the receiving end of those agents.

Mitsuhiko: "Why does everyone want managers to be ICs?"

The day's hottest meta-thread on engineering org dynamics in the AI era. Two posts:

This is the cleanest pushback I've seen this month against the "everyone is a developer now" pitch. Pair with mattpocock's "AI negligence" thread from May 3 — the bottom-up team complaint is converging.

Quick hits

Off-topic but funny

  • All of .de went offline today for several hours. Mitsuhiko: *"How the hell do you take all of .de offline? Deutschland hat fertig."* and: "It's actually kinda funny how little I notice because very little of value or importance actually uses a .de domain."https://x.com/mitsuhiko/status/2051765430809719181
  • Theo's vesting cliff PSA: he watched "the vast majority of reply guys" misuse the term in a thread on equity, dropped a clarification thread (vesting cliff = time until you start vesting, fully vested = time until you're done; not the cliff that "ends your salary"). It's funnier than it should be. — https://x.com/theo/status/2051546385946947823