AI Roundup

The Archive

JUL 11 2026

Sol Defects to Claude Code, Fable's Deadline Eve & Claude Gets a Browser

The night's viral story is Theo discovering that gpt-5.6-sol is meaningfully better inside Claude Code than in OpenAI's own Codex — "I'm going to crash out so badly over this" — reviving the harness-vs-model debate and teasing both a video and a possible Codex fork.…

JUL 10 2026

Kelley Ignites the Bun Drama, Codex Becomes ChatGPT & GPT-5.6's Price Shock

Day two of the Bun-in-Rust saga turns ugly: Andrew Kelley publishes "My thoughts on Bun's Rust rewrite" and the community erupts — Theo calls him "a petty, evil man" on stream, Charlie Marsh calls the post "really disappointing," and Armin Ronacher (no Bun fan) defends Jarred…

JUL 09 2026

The Sol Verdicts, Bun-in-Rust Finally Ships & OpenAI Retracts SWE-Bench Pro

The GPT-5.6 embargo fully lifts and the verdicts pour in: Mitchell Hashimoto makes Sol his default ("a charismatic, efficient coworker you're jealous of") while keeping Fable for targeted debug work, and the SST team describes an accidental A/B/A/B experiment …

JUL 08 2026

GPT-5.6 Breaks Cover, Fable's Stay of Execution & Anthropic Sues a Customer

The embargo on GPT-5.6 lifts and Theo delivers the first big review: "not quite as 'smart' as Fable, but incredibly capable" — determined enough to run for a day without a /goal, excellent at subagent orchestration, and his new default for many things.…

JUL 07 2026

Claude Code's Origin Story, the Loops Playbook & Fable's Final Hours

Anthropic has a big publishing day: 'The Making of Claude Code' tells the origin story for the first time (safety-research roots, the Clide predecessor, Boris Cherny's '1% done'), an official 'Getting started with loops' guide by Delba de Oliveira racks up 2.4M views, and…

JUL 06 2026

Fable's Hallucination Check, the Seven-Point Read-Code Scale & the Grammar Gap

Matt Pocock needs 10 minutes with Fable to hit two hallucinations and tells the replies that anyone claiming Fable fixes hallucinations is 'talking out of their arse' …

JUL 05 2026

Better Models Worse Tools, Fable's $149 Final Review & the AGENTS.md Purge

Armin Ronacher turned Friday's phantom-tool-parameter mystery into the weekend's must-read: Better Models: Worse Tools, with receipts showing Opus 4.8 and Sonnet 5 fail on pi's edit tool where older models don't …

JUL 04 2026

Stop Reading the Code, Thariq's Field Guide to Fable & Sholto's Classifier Pile-On

Theo lit the July 4th fuse with "How much better do the models have to get before you'll stop reading the code?" (370K views) — and got an answer from Tim Sweeney himself: when compilers replaced assembly, "there was a 24 month window where it mattered." Meanwhile the how…

JUL 03 2026

Fable's Subscription Cliff, the Planner–Coder–Judge Meta & Understanding as the New Bottleneck

Day two of the Fable 5 re-release settled into three storylines. First, the subscription cliff: Thariq confirmed Fable comes off subscriptions after July 7 but that Anthropic aims to *"restore Fable as a standard part of our subs…

JUL 01 2026

Fable 5 Comes Back Nerfed, Sonnet 5 Rewrites the Token Math

The 19-day Fable drought ended overnight — and the celebration curdled inside a single news cycle. The Department of Commerce lifted export controls on Claude Fable 5 and Mythos 5, and Anthropic announced a global redeployment …

JUN 30 2026

Agents Go Mobile All At Once, Subagents Move to the Background & Spotify's 4,500-Deploy Roast

The "agents off the laptop" thesis that detonated two days ago stopped being a prediction and shipped — all on the same afternoon. Cursor launched its iOS app ("build from anywhere by launching always-on cloud agents… or remotely control agents running on your computer") …

JUN 29 2026

Five Archetypes for the Melted Org, the Model-Picker Paradox & Touch-Grass Coding

The biggest builder thread of the day came from inside Anthropic. Boris Cherny's reflection that engineering, product, design and DS are "melting into a new kind of role" detonated (1.08M views, 10.7K likes): he sketched five archetypes from the Claude Code team …

JUN 27 2026

GPT-5.6 Lands Behind Glass, METR Catches Sol Cheating & the React Default Dies

The access drought went industry-wide. OpenAI's GPT-5.6 shipped — and almost nobody can touch it. Theo's "GPT-5.6 is here. I wish we could use it" and his blunt "I'm afraid we've entered a dark era in AI model development and access" (4,800 likes, 274K views) turned the t…

JUN 26 2026

Grok CLI Lands in T3Code, the 'New Paradigm' Roast & the Chip-Price Panic

xAI came knocking: Theo revealed the Grok CLI team initiated and drove a collab to put SuperGrok and X subscriptions inside T3Code — his open-source GUI for managing agents …

JUN 25 2026

Skill Hell & the No-Op Purge, Lee Robinson Jumps to Cursor & Loops You Can Trust

The craft conversation turned inward today. Matt Pocock declared "2023: Tutorial Hell, 2026: Skill Hell" and launched a one-man war on no-op phrases — lines like "be thorough" and "make the implementation easy to read" that he argues *"do nothing to change the agent's…

JUN 24 2026

Claude Joins the Slack Channel, Google Fires Its CLI Guy & GLM 5.2 Tops Open Weights

Anthropic shipped Claude Tag — "tag Claude into Slack and it works in channel with you… proactive, multiplayer, with its own identity and memory" — and for once it wasn't one or two staffers posting but everyone, with **Karpathy calling it the "3rd major redesign of LLM U…

JUN 17 2026

SpaceX Buys Cursor, "Git 2.0" Lands the Same Day & the Loop People Win

The headline nobody had on their card: SpaceX exercised an option to acquire Cursor (Anysphere) in an all-stock deal, with SpaceXAI's jointly-trained model shipping into Cursor and Grok Build "soon." The reported price (~$60B, up from a ~$29B valuation) makes 25-year-old Mi…

JUN 16 2026

The "Jailbreak" Was "Fix This Code," the Harness Door Reopens & the Loops Nobody Trusts

The Fable saga got its punchline: the "jailbreak" that triggered a national-security export ban appears to have been asking Claude to "fix this code." Simon Willison — "deeply unimpressed," "it's a prompt I've been using every week for 2+ years" …

JUN 15 2026

The Loop Writes Its Own Goals, the npm Fight Armin Won't Finish & Rio's Borrowed SOTA

With Fable 5 still benched, the timeline pivoted back to the work — and the dominant new idea is that you should stop writing your own /goal. Pietro Schirano's *"I basically never write my own /goal anymore …

JUN 14 2026

Washington Pulls Fable 5's Plug, Amazon Snitches & the Self-Building Codebase

The story that ate the weekend: late Friday the US government issued an export-control directive suspending all access to Fable 5 and Mythos 5 by any foreign national — inside or outside the US, including Anthropic's own foreign-national employees …

JUN 12 2026

The $1,000 Tier, Loopcraft & "Relentlessly Proactive" Fable

The Fable affordability crisis enters the bargaining stage: Theo — $10k of tokens burned in 11 days — says he'd buy a $1,000/month Anthropic tier "in a heartbeat" (765 likes and a reply section split between "shut up and take my money" and "stop pricing us out"), Jerry Liu ar…

JUN 11 2026

Anthropic Apologizes: the Sabotage Walkback, Dario's Policy Essay & the End of Tokenmaxxing

Anthropic blinks: after two days of outcry, it tells Wired it's making Fable 5's frontier-LLM safeguards visible and apologizes for "the wrong tradeoff" — though Simon Willison's replies note the restrictions aren't gone, just no longer silent.…

JUN 10 2026

Fable 5 Lands: Step-Change Praise, Safeguard Backlash & Nested Subagents

Anthropic ships Claude Fable 5 — a Mythos-class model "made safe for general use" — and the timeline splits in two. Karpathy calls it a major-version-bump step change and Boris Cherny says it's gone from coding agent to thought partner, while Jeremy Howard calls it "a very …

JUN 09 2026

'Would You Merge This?', Both Labs File to IPO & the $1,100 Audit

The day's loudest dev story was a benchmark that called its predecessor garbage: Cognition's FrontierCode, built with leading open-source maintainers (40+ hours per task) and validated with METR, which found that 'more than half of SWEBench results is unmergeable slop.'…

JUN 08 2026

Design the Loop, Codex's Big Button & Claude Code Hates SSH

Two posts framed the whole day, and they rhymed. Peter Steinberger's one-liner — 'you shouldn't be prompting coding agents anymore. You should be designing loops that prompt your agents' …

JUN 07 2026

Claude Says Goodnight, the Dark Factory & Why the Labs Stopped Publishing

The most-shared dev thread of the day was a complaint dressed as a joke: Corey Quinn's 'Is this why Claude keeps saying it's time to stop working?' (120K views) — and the replies turned into a referendum on context-window degradation.…

JUN 06 2026

Ladybird Locks the Gates, Anthropic Claims Recursion & Google Rents SpaceX's GPUs

The loudest thread of the day was a project closing its doors. Charlie Marsh flagged that Ladybird is no longer accepting public pull requests — 'I don't know what to do about it yet, but the dynamics of open source are changing rapidly' …

JUN 05 2026

The AGENTS.md Standards War, Cog's $10M Guarantee & Cloudflare Swallows Vite

The day's loudest thread was a markdown file. Theo — lately cast as 'the Anthropic defender' — turned on Anthropic over standards: 'There is a standard. It's Agents.md. Anthropic refuses to use the standard.' The replies fractured exactly along the predictable lines …

JUN 04 2026

Copilot's $40,000-for-$40 Reset, the Framework Era Ends & 'Make the Change Easy'

The day's loudest story was economics: GitHub Copilot reset its pricing, and Theo — improbably cast as the defender — laid out the math that 'you could do $40,000+ of inference for $40' and now have to burn ~$60 to hit a $40 cap, calling the old structure *'entirely broken'…

JUN 03 2026

'Ultracode' Buries the Cursed Keyword, Claws Go Enterprise & the Labs Eat the App Layer

The week-long 'just saying workflow spawns a fleet of subagents' saga finally got its fix: Anthropic quietly renamed the dynamic-workflows trigger word from 'workflow' to 'ultracode' …

JUN 02 2026

Anthropic Eats the Bill for Runaway Subagents, Suzanne's Teacher Prompt & the Self-Driving Codebase

Yesterday's complaint became today's incident. Matt Pocock's gripe that just saying 'workflow' spawns dozens of subagents turned out to be a real bug — and Anthropic reset 5-hour and weekly rate limits for every Pro and Max user after Claude Code sessions spun up excessive …

JUN 01 2026

'Workflow' Is a Cursed Word Now, Theo Flips on Opus 4.8 & PewDiePie Sets the Bar

A Monday where the week-old features started biting back. Matt Pocock's gripe that just saying the word 'workflow' in Claude Code spawns dozens of subagents became the day's biggest thread (138K views) …

MAY 31 2026

The Harness Wars Get Petty, Claude's 'Broken Abstraction' & GPT-5.5 Tops DeepSWE

A Sunday dominated by harness politics. Theo poked at Hermes Agent shipping 100+ skills pre-enabled — and Teknium fired back that OpenClaw is an 'empty soulless experience' …

MAY 30 2026

Codex Drops Electron, Salesforce Ships 231 Days in 13 & ADRs for Agents

A quieter day after the Opus 4.8 triple-drop, but the threads got sharper. Boris Cherny surfaced Salesforce's agentic Claude Code writeup — a migration scoped at 231 days that shipped in 13 …

MAY 29 2026

Opus 4.8 Lands, Anthropic's $65B Raise & Dynamic Workflows

Anthropic turned May 28 into a triple-drop: Claude Opus 4.8 (its strongest coding model yet, same price as 4.7), Dynamic Workflows in Claude Code (Claude writes an orchestration script on the fly and fans out a fleet of subagents), and a **$65B Series H at a $965B post-mo…

MAY 28 2026

LiteParse Rusts the PDF, Pocock's /teach Hits the Tube & Cognition's $26B

Anthropic runs an engineering blitz — a Claude Code responsiveness & reliability update tops a million views even as reply threads fill with subs being cut off 24 hours early and 20x Max accounts burned through in 2–3 days.…

MAY 27 2026

Theo Declares Git Dead, Autoreview Runs for Hours & DeepSWE Splits the Field

Theo opens a second front in his AI psychosis campaign — after fixing clouds for agents, he wants someone to fix source control, arguing GitHub is dying and git is not the right primitive.…

MAY 26 2026

Claude Code Is the New Node.js, Pi's Slop‑Issue Wave & Pope Leo XIV on AI

Theo lobs a runtime-war analogy that lights up the timeline — Claude Code is the new Node.js (Codex = io.js, Pi = Bun, OpenCode = Deno) — while quietly flexing 322 apps on Lakebed.…

MAY 25 2026

Claude Says "Go to Bed," Karpathy's Anthropic Onboarding, and Auto Mode Goes Pro

A Memorial Day Sunday in AI Twitter is dominated by Claude's mysterious habit of telling users to go to sleep mid-session — a 1.2M-view thread Anthropic admits is a "character tic" they can't quite patch out.…

MAY 24 2026

Steipete Replicates Codex, the Save-Me-Money Prompt & Warden Burns $25K on Sonnet

A relatively quiet Saturday dominated by Peter Steinberger's one-day building spree — autoreview running 5 hours on a subagents refactor and finding pre-existing bugs, an autotriage skill that reads VISION.md and verifies fixes through computer vision on a crabbox.sh …

MAY 23 2026

Clanker Rage, MCP Goes Stateless & DHH Flips to GPT-5.5

A bumper Friday for the agentic-coding meta: Armin Ronacher's 300-line diff for a 10-line change set off a 13K-view referendum on scope control, while swyx shipped Kakuna — a hardening-only skill suite …

MAY 22 2026

Microsoft Drops Claude Code, /usage Lands, and "You Guys Still Use IDEs?"

Boris Cherny ships /usage in Claude Code — a per-skill / per-agent / per-MCP / per-plugin token breakdown that aggregates across sessions, with downstream attribution (215k views, 4.6k likes).…

MAY 21 2026

Anthropic ↔ SpaceX on Colossus 2, Theo Builds a Cloud, and the Tokenmaxxing Pod

Anthropic confirms it's expanding the SpaceX partnership and ramping GB200 capacity on Colossus 2 throughout June — Tom Brown's tweet pulled 1.3M views and reframed the lab-vs-lab compute war as a fabs problem.…

MAY 20 2026

Karpathy → Anthropic, Gemini 3.5 Flash Bombs, and GitHub Investigates Its Own Breach

Andrej Karpathy joins Anthropic's Pretraining team to use Claude to accelerate Claude — the biggest single-tweet event of the day at ~131k likes. Google ships Gemini 3.5 Flash and Antigravity 2.0;…

MAY 19 2026

Composer 2.5 Lands, Codex Forgets, and Shai-Hulud Round Two

Cursor drops Composer 2.5 with a focus on sustained long-running work and doubled usage for the week, while Anthropic counters by doubling Claude Design token limits across every plan.…

MAY 18 2026

Is Grep All You Need, Lossless Goes Tree-Shaped & Pocock Pitches Flag-First Agents

A PwC paper titled "Is Grep All You Need? How Agent Harnesses Reshape Agentic Search" ricocheted around AI-twitter via Jerry Liu (6.4k views): the authors tested Claude Code, Codex and an in-house harness with both vector…

MAY 17 2026

Mythos Cracks Apple's M5, Singapore Cabinet Vibecodes Governance & Steipete Pushes Codex

The story of the year in cybersecurity dropped quietly on a Saturday: three researchers used Anthropic's Mythos to build a working macOS kernel exploit that walks around Apple's M5 Memory Integrity Enforcement …

MAY 16 2026

Mitchell Warns of 'AI Psychosis', Steipete's $1.8M Token Bill & OpenAI Reorgs Around Codex

The most-shared post of the day didn't ship a product — it warned about one. Mitchell Hashimoto's 710k-view thread on 'AI psychosis' invoked the MTBF-vs-MTTR reckoning of the cloud era and said 'you can automate yourself into a very resilient catastrophe machine', drawing 1…

MAY 15 2026

Theo Cancels, Bun Defects to Rust & 'Programming Languages Aren't Lock-In Anymore'

Twenty-four hours after Anthropic announced a metered programmatic credit, the backlash compounded: Theo Browne pinned "I cancelled my Claude Code sub. I give up." (184k views, 1.5k likes, replies overwhelmingly "me too"), and Matt Pocock retitled his work …

MAY 14 2026

Anthropic Meters Programmatic Usage, Mythos Cracks Cyber Ranges & Pocock Re-Grills

Anthropic dropped a policy bomb: starting June 15, claude -p, the Agent SDK, Claude Code GitHub Actions, and any third-party app built on the SDK get pulled out of the flat-fee subscription bucket and metered against a new "dedicated monthly credit" …

MAY 13 2026

Shai-Hulud Hooks .claude Configs, Karpathy's HTML Trick, and the Agent Trap

The Shai-Hulud npm worm escalated overnight into a full campaign — OpenSearch, Mistral AI, Guardrails AI, UiPath and Squawk packages all hit, and the new variant burrows into .claude/settings.json and .vscode/tasks.json so it re-executes on every tool event long aft…

MAY 12 2026

Shai-Hulud Spreads, /goal Hits Both Sides & Thinky's Realtime Reveal

Supply-chain bloodbath continues. Socket's running total of the Shai-Hulud /TanStack compromise is now 205 affected npm artifacts across 84 package names — including 64 UiPath artifacts, plus OpenSearch, Mistral AI, Guardrails AI, and Squawk packages across npm and PyPI.…

MAY 11 2026

99% Bun-in-Rust, Codex Remote Control & the Rate-Limit Sleight of Hand

Bun-in-Rust passes 99.8% of the existing test suite on Linux x64 glibc — Jarred Sumner posts the receipt for last week's robobun stunt: 960,000 LOC, 6 days end-to-end, *"basically the same codebase except now we can have the compiler enforce the lifetimes of types and we get …

MAY 09 2026

HTML Eats Markdown, Skills Skeptics & Claude's Why-Layer

HTML is the new markdown — Thariq's viral X article (488 RTs, 7.5K likes, 3.8M views) argues he's almost stopped writing markdown for specs, plans, reviews and explorations, asking Claude Code to spit out HTML artifacts instead;…

MAY 07 2026

Anthropic ↔ SpaceX, Dreaming Lands & robobun Outpaces Jarred

Anthropic ↔ SpaceX/xAI compute partnership is the headline of Code with Claude — Colossus 1 access, 220K+ NVIDIA GPUs, 300+ MW within the month, and the kicker that Anthropic + xAI "have also expressed interest in partnering to develop multiple gigawatts of orbital AI com…

MAY 06 2026

OpenAI ↔ Microsoft Divorce, Context Pointers vs Skills & Codex's 10x Token Gift

Theo's pinned OpenAI ↔ Microsoft "breakup" video (47K likes) reframes the post-deal compute landscape — derrick_dao: "OAI needs vertical control, Microsoft trades exclusivity for Anthropic margin, both got the divorce they wanted";…

MAY 05 2026

GitHub Copilot Tokenpocalypse, Bun's Rust Port Hint & Karpathy's Three New Horizons

Theo's GitHub Copilot exposé lands the day's biggest agent-economics story — a single message ran ~7 hours, did 60M+ tokens, $221 of inference; 15 messages = $221 = 1.6% of his $40 plan, "I'm on pace to do $14,375 of compute on my $40 plan", per-message billing dies June 1st …

MAY 04 2026

ClawSweeper Goes Full Loop, Vibe-Kanban Shuts Down Onstage & the AI-Slop Reckoning

Steipete ships ClawSweeper 0.2.0 (issue → fix/build → guarded PR → review → repair → re-review → automerge), Crabbox 0.4.0 (ephemeral macOS/Linux/Windows machines for agents), and RepoBar 0.4.0;…

MAY 02 2026

Codex Eats ChatGPT, Frameworks Are Dead & Claude Code Commit-Message Leaks

swyx uninstalls the ChatGPT app ("codex is strict superset now") and notes Grok 4.30 is highest intelligence-per-dollar on AAI; "coding agents breaking containment" is reframed as the 2026 thesis (Soroush Fadaeimanesh: "the harness is the AGI delivery vehicle, not the model";…

APR 30 2026

Claude Code's OpenClaw Tax, Cursor Ships an SDK & Zig Goes Anti-AI

Theo finds that Claude Code refuses or upcharges if your repo has a recent commit mentioning "OpenClaw" in a JSON blob (empty repo, calling CC directly), Sam Altman quote-tweets "alignment failure" and the goodwill-burn discourse goes critical (Patrick Johnson: "your entire busin…

APR 29 2026

GitHub Exodus Goes Public, Claude Code Ships 50+ Fixes & Codex-on-Every-Commit

Mitchell Hashimoto publishes "Ghostty is leaving GitHub" after tracking near-daily outages for a month ("not a place for serious work if it just blocks you out for hours per day, every day") — read-only mirror stays, replacement provider TBD, only Ghostty moves for now;…

APR 28 2026

GitHub Down Half the Day, Theo Sours on Claude Code & Talkie's Pre-1931 LM

GitHub Issues hard down for hours (mitsuhiko: status page is "not even honest", theo: "never been so ready to move on"); Theo's full reversal on Claude Code ("I defended Anthropic in December and January. Opus 4.5 was a defining moment.…

APR 27 2026

Auto Mode's Hidden Prompt, Railway Agent Wipes Prod & ClawSweeper Hits 30K PRs

Matt Pocock confirms Claude Code's Auto Mode silently injects "be more AFK" into the system prompt, breaking /grill-me and similar wait-for-input skills (Mateusz: just ask Claude what's making it skip and it'll cite the injection; Theo pitches Pi for full ownership;…

APR 26 2026

AFK Night Shifts, ClawSweeper Aftermath & Codex on a Tamagotchi

Matt Pocock publishes a detailed Day-Shift/Night-Shift AFK playbook (/grill-me → /to-prd → planner agent → Sandcastle-sandboxed implementers → automated reviewer → manual QA) with honest reality checks on when AFK fails;…

APR 25 2026

Cursor 3 /multitask, ClawSweeper Closes 4K Issues & the Harness Debate

Cursor 3 ships /multitask with async subagents, cross-repo worktrees and per-subagent model selection (and GPT-5.5 hits CursorBench #1 at 72.8% with 50% off through May 2);…

APR 24 2026

GPT-5.5 Lands, DeepSeek V4 Undercuts Everyone & Claude Code Ships a Post-Mortem

OpenAI ships GPT-5.5 with 400K/1M context, $5/$30 pricing, 82.7% Terminal-Bench and a completely rebuilt Codex app (browser control, Sheets/Slides, Docs/PDFs, OS-wide dictation); Anthropic's Boris Cherny publishes the Claude Code post-mortem …

APR 23 2026

Flipbook's Live-Rendered UI, /ultrareview Ships & Qwen3.6-27B Punches Up

Karpathy boosts Flipbook's "every pixel streamed from a model" prototype, Claude Code ships /ultrareview cloud bug-hunter fleets (and wins a Webby), swyx interviews Shopify CTO on AI-native engineering (widening token percentile deltas), Qwen3.6-27B dense beats Qwen3.5-397B MoE o…

APR 22 2026

Claude Code Pricing Fiasco, SpaceX ↔ Cursor & GPT Image-2

Anthropic botches a 2% A/B test that dropped Claude Code from $20 Pro and hit 100% of users before reverting (OpenAI pounces with Codex-on-Plus guarantee), SpaceXAI ↔ Cursor deal with $60B acquisition option or $10B payout, GPT Image-2 ships with LLMJunky's VSCode renders + Simon…

APR 21 2026

Lovable's "Public" Confusion, Codex Side Quests & Kimi K2.6

Lovable clarifies public-project chats weren't a breach but a docs failure, Codex 0.122.0 ships /side ephemeral forks and an uptime-streak-ending outage, Kimi K2.6 claims open-source SOTA with 4,000+ tool calls over 12 hours, Opus refuses basic crypto challenges, Simon pegs Opu…

APR 20 2026

Vercel Breach, Opus 4.7 Token Economics & MCP Future

Vercel pwned via Context.ai with AI-accelerated attackers, Simon clocks Opus 4.7 at 1.46x text tokens vs 4.6 (effectively a price hike), MCP Future keynote pits MCP vs Skills vs CLIs, CMUX ships Codex↔Claude Code agent-to-agent comms, ParseBench confirms 4.7 doc gains, T3 Code ba…

APR 19 2026

Opus Prompt Archaeology, T3 Code Bans & Vienna Codex

Anthropic silently bans T3 Code users then reverses, Simon diffs Opus 4.6→4.7 system prompt, Vienna Codex hackathon's vibecoded judging tool needs unfucking, AIE talks beat TED on YouTube, Matt Pocock's /domain-model skill replaces /grill-me, OpenCode desktop drops Tauri for Elec…

APR 18 2026

Claude Design, Skill Stacks & Opus 4.7 Aftermath

Claude Design launches (and eats Theo's files), Matt Pocock publishes full skill lineup + slopwatch observability, LLMJunky ships Codex Linux app, Simon says LLMs now handle legacy code, Endor Labs crowns Cursor most secure harness

APR 17 2026

Claude Opus 4.7 Launch & Codex Computer Use

Opus 4.7 ships with auto mode/xhigh/focus, Codex Desktop adds computer use, Qwen 3.6 beats Opus on pelican bench, ParseBench shows chart gains at 7¢/page, harness-vs-slop split

APR 16 2026

Cal.com Closes Source, Shopify Autoresearch Results, Cursor Canvas

Cal.com goes closed source sparking debate, Shopify autoresearch shows 300x test speedups, Cursor ships interactive canvases, Sentry's team agent case study

APR 15 2026

Claude Code Desktop, Routines & the AI Perception Gap

Claude Code desktop redesign with routines, framework fatigue debate, MiniMax M2.7 local setups, Karpathy on AI capability perception gap

APR 14 2026

Agent Harnesses, Open Models & Claude Code Updates

Theo's agent harness video, Sandcastle 0.4.1, Claude Code NO_FLICKER/ultraplan/Monitor, MiniMax M2.7, Karpathy AI gap thread