Everything That Happened in AI Today (Monday, May 4, 2026)

Around the Horn Digest: Everything That Happened in AI Today (Monday, May 4, 2026)

The White House started considering pre-release vetting of AI models; Anthropic and OpenAI both married private equity on the same day; Anthropic CCO Paul Smith launched a separate $1.5B mid-market spinout; DeepSeek's V4-Pro became the first Chinese model at frontier parity; Mayo Clinic's AI spotted pancreatic cancer up to three years before diagnosis.

Written By
Grant Harvey
Grant Harvey
May 5, 2026
27 minute read

Around the Horn Digest: Everything That Happened in AI Today (Monday, May 4, 2026)

Anthropic and OpenAI both hooked up with private equity on the same day, the White House started considering pre-release vetting of AI models, Anthropic's CCO Paul Smith spun out a separate $1.5B company to scale Claude to mid-size businesses, Mayo Clinic's AI spotted pancreatic cancer up to three years before diagnosis, and a Palantir-linked super PAC was caught paying TikTok influencers to fear-monger about Chinese AI.

Welcome to the Around the Horn Digest, your daily readout of every AI story worth knowing about. Today was the day enterprise AI grew up: Anthropic and OpenAI both announced parallel joint ventures with major asset managers and PE firms, Anthropic CCO Paul Smith spun out a separate $1.5B mid-market vehicle, Bret Taylor's Sierra raised another $950M, Cisco swallowed an Israeli AI security startup for $400M, Long Lake bet $6.3B that AI can reshape corporate travel, and the Trump White House quietly started reversing course on its noninterventionist AI stance. On the medical side, AI quietly had one of its best days ever. Politically, things got loud.

Let's get into it.

Previous digests: Weekend May 2-3 | Thu Apr 30 | Wed Apr 29 | Tue Apr 28 | Mon Apr 27 | Fri Apr 24 | Thu Apr 23 | Mon Apr 20 | Mon Apr 13

Monthly skill digests: AI Skill — March Part 3 | AI Skill — March Part 2 | AI Skill — March

🆕 NEW From The Neuron

  • When three of AI's top builders tell you coding is solved, pay attention to what they mean — Boris Cherny (Claude Code), Greg Brockman (OpenAI), and Andrej Karpathy all gave Sequoia AI Ascent 2026 talks last week saying the same thing: writing code is essentially solved. Cherny says he hasn't typed a line in 2026 and ships dozens of PRs from his phone; Karpathy says he's never felt more behind as a programmer; Brockman thinks the next era is about scarce compute and managing agents. So if "the code part" isn't the bottleneck anymore, what is? Our breakdown of what these three actually mean by "solved," what's still hard (taste, context, knowing what to build), and what shifts for everyone else.
Advertisement

Around the Horn — Tuesday, May 5, 2026

The big news today: Anthropic and OpenAI both announced parallel enterprise AI joint ventures with major asset managers and private equity firms, and the synchronized timing is the story. OpenAI finalized a $10 billion partnership with TPG, Brookfield, Advent, and Bain to help businesses deploy its software at scale. Anthropic countered with its own asset-manager coalition. The pitch to enterprises is that model access alone won't get the deal done; you need integration partners who can rewire workflows, manage change, and push past corporate antibodies.

Box CEO Aaron Levie called it an "explosively growing" trend, noting that turning model capability into stable business-process impact requires IT upgrades, context provisioning, workflow modernization, and adoption work that pure-play AI labs aren't built to do themselves. Translation: the frontier labs both said the quiet part out loud. They can ship the models; PE firms own the playbook for actually deploying software inside Fortune 500 companies. Your enterprise AI vendor list just shrank, and the integration partners got a permanent middle seat at the table.

Bret Taylor's Sierra picking up another $950M on the same day (now north of $15B valuation, $150M+ ARR, serving 40%+ of the Fortune 50) is the second data point. The third: Anthropic CCO Paul Smith launched a new spinout company backed by $1.5B from Blackstone, Hellman & Friedman, and Goldman Sachs to embed engineers directly with mid-size businesses and solve their on-prem and integration bottlenecks. Three Anthropic distribution announcements in one day is not a coincidence; it's a strategy. Enterprise AI is consolidating fast, and the pure-play labs need distribution muscle, not just better benchmarks.

🏆 TOP 5 NEWS (Around the Horn)

Advertisement

Honorable Mentions

🍪 TOP TREATS TO TRY

  • Cursor Team Kit gives you the internal CI watcher, compiler-error checker, control-cli/UI harnesses for local verification and profiling, deslop code cleaner, fix-ci tool, and other shipping workflows that Cursor developers actually use themselves; everything runs without third-party services, free to try (community Codex-plugin riffs from Matthew Lam, Queue / Studio17_x, and Ray Fernando).
  • Unity AI entered open beta with a project-aware in-editor agentic assistant (plus AI Gateway and MCP Server) trained on 20+ years of Unity best practices that automates tasks, generates assets from designs/images, drives Editor actions, and keeps you in creative control, free trial then $10/mo for Personal (included for Pro/Enterprise) (announcement).
  • Pocket TTS by Kyutai Labs released open-source 100M-parameter models that generate high-quality real-time speech in six languages on CPU (no GPU needed), with improved English quality and the same compact size, free to try (code, announcement).
  • XGrammar-2 from MLC gives you fast customizable structured generation that guarantees 100% correct tool calling and complex agent outputs via a composable Structural Tag DSL with 80x faster grammar compilation, cross-grammar caching, and native integrations into vLLM, SGLang, and TensorRT-LLM, free to try (code, launch).
  • HiL-Bench from Scale AI tests whether agents recognize missing or ambiguous information and proactively ask targeted clarifying questions instead of guessing wrong (GPT-5.5 leads at 29% Pass@3, Claude Opus 4.7 at 27.67%), free to try (announcement).
  • Saperly is the first phone carrier built for AI agents; provision a real phone number in seconds via any MCP-compatible agent for unified calling, messaging, and SMS with stable caller ID, audit trails, and webhook handoff, first number free for 30 days then $2.50/mo + usage (launch post).
  • Hermes Agent by Nous Research is an open-source agent that grows with you by learning your projects, auto-generating skills, maintaining persistent memory in secure sandboxes, and reaching you across Telegram/Discord/Slack/email/CLI with delegated subagents and natural language scheduling, free to try (launch reaction).
  • MathNet from MIT gives you free public access to 30,676 expert-authored Olympiad-level math problems across 47 countries, 17 languages, and 40+ years for evaluating LLMs on problem-solving and retrieval-augmented generation, free to try.
  • Bonsai 1.7B Apple Silicon edition drops in as an optimized inference build of the ternary model running ~42% faster decode (442 t/s on M4 Max) with custom Metal kernels tuned by an autonomous engineering agent, free to try.
Advertisement

💻 TECH CORNER

Two weekend reads worth your time if you care about agent infrastructure and the local-vs-cloud AI debate:

  • a16z's David Booth tried to "raise" his AI like a human (and posted his work). Booth built a personal knowledge graph on top of Karpathy's auto-compiling LLM wiki gist, then speed-ran four phases of "education": foundational culture docs (including a16z's culture playbook), his own writing (preferential attachment blog, lighthouse playbook), ~20 books and blogs from his core canon, plus ~50k words of Wispr Flow voice monologue. He paired it with a nightly "sleep protocol" Codex automation that distills the day's inputs and decays stale memories. Sunday's rabbit hole was "context traces," inspired by Rohit Krishnan on agent-markets and Komoroske's slime mold coordination frame. The thesis: hierarchical coordination won't scale as the graph grows 10-100x because "locally reasonable compression can turn into false institutional state" and saving that false state as canonical drifts you from reality. His translation of slime mold theory into agent environment design: set gradients, create focal points, let local actors move with partial context, preserve traces in the environment, decay stale signal. Current research question: "what is the digital equivalent of a pheromone trail?" Bonus self-awareness: he saw Supermemory's SMFS launch hours later and posted "shoulda just used supermemory lol." Asked how much time he saved: "wasn't the point. i spent lots of time lol."
  • Hacker News debated whether rolling your own local AI is worth the hassle, responding to The Register's local AI guide on running Qwen3.6-27B with llama.cpp + Cline/Pi to escape rate limits. The cleanest framework came from commenter 0xbadcafebee: 95% of people should pay for a subscription; local only makes sense for privacy, constant token churn, latency, or availability (he also shared a price-per-request comparison tool for picking subscriptions). Hardware reality check: a 24GB RTX 3090 Ti runs ~€2,000, and commenter beej71, who actively wants to run local, conceded "in my testing, 24 GB doesn't get you much brainpower." The Register's recommended local model also doesn't hit gpt-5.4-mini quality, so the suggested workaround is routing Kimi K2.6 through OpenRouter as your "Anthropic/OpenAI is down" backup. Privacy counterpoint from jen20: opt-out training defaults at GitHub Copilot and OpenAI make subscription trust harder than people admit. The sleeper question (raised by xscott): how much "compaction" Codex and Claude do is to keep context fresh vs. to save their runtime costs? If your "1M token context" gets constantly summarized behind the scenes, are you really getting 1M tokens of benefit? Local models let you use the full window on your own terms. No public benchmarks on this yet; if anyone runs the test, it's a story.

🏢 Big Tech & Major Companies

Advertisement

💼 AI Productivity, Labor & Economics

🤖 AI Agents & Infrastructure

Advertisement

💻 AI Coding & Developer Tools

🔬 AI Research & Models

Advertisement

🤖 Robotics & Embodied AI

🏛️ AI Policy, Governance & Safety

Advertisement

🛠️ AI Tools & Products (Niche)

📊 Fundraising & Deals Roundup

Sorted by deal size descending. Lead-story and Top 5 deals appear here once for completeness.

  • Cerebras IPO — tracking toward $26.6B+ valuation; deep OpenAI relationship including a $10B+ multi-year deal and $1B loan.
  • OpenAI — $10B joint venture with TPG, Brookfield, Advent, and Bain to deploy enterprise AI.
  • Long Lake — $6.3B acquisition of Amex GBT (corporate travel, all-cash, AI-led thesis).
  • Lattice Semiconductor — $1.65B acquisition of Georgia-based AMI ($1B cash + $650M stock) for AI/data center firmware.
  • Anthropic CCO Paul Smith spinout — $1.5B from Blackstone, Hellman & Friedman, and Goldman Sachs to scale Claude to mid-size businesses.
  • SAP / Prior Labs — €1B+ committed over four years to acquire Prior Labs as an independent open-models lab.
  • Katie Haun (Haun Ventures) — $1B for new venture funds expanding into AI agents.
  • Sierra — $950M Series E led by Tiger and GV at $15.8B valuation.
  • Cisco / Astrix — $400M acquisition of Israeli AI security startup.
  • Panthalassa — $140M Series B led by Peter Thiel for wave-powered ocean compute.
  • Enzo Health — $20M for AI-powered post-acute and home-health workflows.
Advertisement

🌍 International AI

That's a Wrap

That's 100+ stories from today alone. If you scrolled all the way down here, you now know more about Anthropic's three-headed enterprise distribution play than the analyst whose pitch deck just got obsoleted overnight. Condolences to the strategy slide they shipped on Friday.

For the daily version (bite-sized, 5-minute reads), make sure you're subscribed to The Neuron. We send six issues a week, and yes, we read all of this so you don't have to.

See you tomorrow.

P.S: Know someone who'd find this useful? Forward this to them and tell them to subscribe here.

Grant Harvey

Grant Harvey is the Lead Writer of The Neuron, where he continues to lead the publication's daily coverage of AI news, tools, and trends.

The Neuron Logo

Don't fall behind on AI. Get the AI trends & tools you need to know. Join 700,000+ professionals from top companies like Microsoft, Apple, Salesforce and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.