A 397B Model on a Mac, the Pentagon vs. Anthropic + 10 more

Editor's Note

This was the week AI stopped being an API call and started being infrastructure. From Musk building a chip fab to Amazon's $50B silicon bet to an open-source project running 397B parameters on a MacBook, the stack is being rewritten from the transistor up. Meanwhile, the Pentagon went to war with Anthropic over safety red lines — and a court filing suggests the conflict may be more political than technical.

The Model Race

Epoch Confirms GPT-5.4 Pro Solved a Frontier Math Open Problem
Epoch / Hacker News

GPT-5.4 Pro cracked a Ramsey hypergraph problem that remained unsolved by mathematicians — first confirmed case of a frontier model solving an open problem in pure math. This isn't benchmark gaming; it's novel mathematical contribution.

Why this matters:
AI has first been democratizing information and now it’s proving to advance our knowledge and solve problems that humans haven’t been able to crack. We’re only seeing early signs of what these frontier models are capable of doing.

GPT-5.4 Mini and Nano
OpenAI

OpenAI released two smaller model tiers, compressing the 5.4 architecture into cheaper, faster variants. Direct shot at Claude Haiku and Gemini Flash — expect API pricing pressure across the board.

Why this matters:
OpenAI is hoping to be involved in everyone’s agentic workflow. Applying pressure to both Anthropic and Google’s cheaper variants.

Cursor Plans New Model to Rival Anthropic, OpenAI
Bloomberg

Cursor is building its own efficient coding model rather than depending entirely on Claude and GPT. The IDE that mainstreamed AI-assisted coding is betting that vertical-specific models beat general-purpose ones for its use case.

Why this matters:
Cursor is entirely dependent on external models for its AI assistance. Building their own model would give them full control over training and inference. This would be a long-term investment to save costs and compete with the general models in the vertical-specific coding domain.

OpenAI’s Total War

OpenAI Acquires Astral
OpenAI / Bloomberg

OpenAI acquired Astral, the team behind uv and ruff — the fastest Python tooling in the ecosystem. Developer tools are now a lab battleground, and OpenAI is building the full stack from model to package manager.

Why this matters:
Companies buying software tooling, run times, frameworks is nothing new, but Astral is a big deal. At the center of modern python tooling, this gives OpenAI the leverage to influence the python ecosystem. Python is historically used for AI development and OpenAI will take any advantage to help with that.

OpenAI Is Building a Fully Automated Researcher
MIT Technology Review

OpenAI is redirecting major resources toward an autonomous agent that can independently tackle complex research problems. Not a chatbot. Not a copilot. A system designed to do research without a human in the loop.

Why this matters:
The Astral acquisition in March, the OpenClaw acquisition in February, and now an autonomous researcher, OpenAI is assembling the pieces for something bigger than a chatbot company. The pattern suggests they're building the infrastructure to automate R&D itself, not just assist with it.

Silicon Superpowers

Musk Announces Terafab Chip Plant in Austin
The Verge / Bloomberg

A joint Tesla-SpaceX chip fab for AI, robotics, and space-based data centers. Nobody else is attempting vertical integration at this scale — making your own silicon for your own AI for your own robots.

Why this matters:
A vertical like this is unmatched. Musk is completely removing any dependency on external chip suppliers or AI providers. Inference, chips, AI models, robotics can all be highly specialized and not require any manufacturing or importing from outsiders.

Amazon's $50B Trainium Bet Wins Over Anthropic, OpenAI, Apple
TechCrunch

Amazon invited press into its Trainium lab, and the customer list reads like the entire frontier: Anthropic, OpenAI, Apple. AWS is transitioning from cloud host to silicon provider — the margin implications are enormous.

Why this matters:
AWS has been at the center of web services and is now competing to be at the center of AI. These in-house built chips are attempting to make a dent in the AI Giant Nvidia.

Flash-MoE: Running 397B Parameters on a Mac with 48GB RAM
GitHub / Hacker News

An open-source project achieves what seemed impossible — running a full 397B MoE model on consumer hardware. If inference costs collapse to $0, the moat moves entirely to data and distribution.

Why this matters:
Running a 397B model on 48GB RAM matters because it moves frontier-class models from datacenter hardware to consumer machines by streaming MoE experts from disk instead of loading everything into memory. Normally a 397B model requires 800 GB on a FP16 setup and Flash-MoE only needing 48 GB is huge especially in the rising cost of RAM.

Trust & the Pentagon

Snowflake Cortex AI Escapes Sandbox and Executes Malware
PromptArmor / Simon Willison

Not a red-team exercise. Snowflake's AI system broke containment and executed malware in a real deployment. Every company running AI in production should be auditing their sandboxing after this.

Why this matters:
This is the first documented case of an enterprise AI breaking containment in production (not in a lab). If your company runs AI agents with system access, your threat model just changed.

Pentagon Labels Anthropic 'Unacceptable Risk' — Then Court Filing Reveals They Were Nearly Aligned
TechCrunch

The DOD classified Anthropic as a supply chain risk over safety red lines, but a sworn court declaration revealed the two sides were nearly aligned just a week before the public breakup. The gap between the politics and the policy is telling.

Why this matters:
Defense contracts are worth billions and signal trust to enterprise buyers. Losing DOD access doesn't just cost Anthropic revenue, it hands competitors a credibility edge in every government-adjacent deal.

Money Moves

Gimlet Labs Raises $80M for Multi-Chip AI Inference
TechCrunch

Run models across Nvidia, AMD, Intel, ARM, Cerebras, and d-Matrix simultaneously. If this works, it breaks Nvidia's lock-in and changes the economics of inference for every AI company.

Why this matters:
Breaking free from Nvidia’s vendor lock-in is huge. Chip designers now can compete more fairly and with competition brings innovation. Same with AI inference, if inference is chip agnostic and runs simultaneously, AI companies no longer rely on what chips they can afford but how much raw power they can attain.

Air Street Becomes One of Europe's Largest Solo VCs at $232M
TechCrunch

Nathan Benaich's Fund III makes Air Street one of the biggest solo GP firms in Europe. European AI VC used to be an afterthought — $232M from a single partner says otherwise.

Why this matters:
This is a signal that Europe’s AI startup ecosystem is maturing. It’s evidence that Europe now has serious, independent, AI capital, not just satellites of U.S. venture firms.

12 stories across 5 topics — The Model Race, OpenAI's Total War, Silicon Superpowers, Trust & the Pentagon, Money Moves

If this was useful,
forward it to one person who'd benefit. That's how we grow.

Till next time,

Daniel

Share the newsletter

A 397B Model on a Mac, the Pentagon vs. Anthropic + 10 more

Editor's Note

The Model Race

Epoch Confirms GPT-5.4 Pro Solved a Frontier Math Open Problem
Epoch / Hacker News

GPT-5.4 Pro cracked a Ramsey hypergraph problem that remained unsolved by mathematicians — first confirmed case of a frontier model solving an open problem in pure math. This isn't benchmark gaming; it's novel mathematical contribution.

GPT-5.4 Mini and Nano
OpenAI

Cursor Plans New Model to Rival Anthropic, OpenAI
Bloomberg

OpenAI’s Total War

OpenAI Acquires Astral
OpenAI / Bloomberg

OpenAI Is Building a Fully Automated Researcher
MIT Technology Review

Silicon Superpowers

Musk Announces Terafab Chip Plant in Austin
The Verge / Bloomberg

Amazon's $50B Trainium Bet Wins Over Anthropic, OpenAI, Apple
TechCrunch

Flash-MoE: Running 397B Parameters on a Mac with 48GB RAM
GitHub / Hacker News

Trust & the Pentagon

Snowflake Cortex AI Escapes Sandbox and Executes Malware
PromptArmor / Simon Willison

Pentagon Labels Anthropic 'Unacceptable Risk' — Then Court Filing Reveals They Were Nearly Aligned
TechCrunch

Money Moves

Gimlet Labs Raises $80M for Multi-Chip AI Inference
TechCrunch

Air Street Becomes One of Europe's Largest Solo VCs at $232M
TechCrunch

Daniel

Keep Reading

Stratavore

A 397B Model on a Mac, the Pentagon vs. Anthropic + 10 more

Editor's Note

The Model Race

Epoch Confirms GPT-5.4 Pro Solved a Frontier Math Open ProblemEpoch / Hacker News

GPT-5.4 Pro cracked a Ramsey hypergraph problem that remained unsolved by mathematicians — first confirmed case of a frontier model solving an open problem in pure math. This isn't benchmark gaming; it's novel mathematical contribution.

GPT-5.4 Mini and NanoOpenAI

Cursor Plans New Model to Rival Anthropic, OpenAIBloomberg

OpenAI’s Total War

OpenAI Acquires AstralOpenAI / Bloomberg

OpenAI Is Building a Fully Automated ResearcherMIT Technology Review

Silicon Superpowers

Musk Announces Terafab Chip Plant in AustinThe Verge / Bloomberg

Amazon's $50B Trainium Bet Wins Over Anthropic, OpenAI, AppleTechCrunch

Flash-MoE: Running 397B Parameters on a Mac with 48GB RAMGitHub / Hacker News

Trust & the Pentagon

Snowflake Cortex AI Escapes Sandbox and Executes MalwarePromptArmor / Simon Willison

Pentagon Labels Anthropic 'Unacceptable Risk' — Then Court Filing Reveals They Were Nearly AlignedTechCrunch

Money Moves

Gimlet Labs Raises $80M for Multi-Chip AI InferenceTechCrunch

Air Street Becomes One of Europe's Largest Solo VCs at $232MTechCrunch

Daniel

Keep Reading

Stratavore

Epoch Confirms GPT-5.4 Pro Solved a Frontier Math Open Problem
Epoch / Hacker News

GPT-5.4 Mini and Nano
OpenAI

Cursor Plans New Model to Rival Anthropic, OpenAI
Bloomberg

OpenAI Acquires Astral
OpenAI / Bloomberg

OpenAI Is Building a Fully Automated Researcher
MIT Technology Review

Musk Announces Terafab Chip Plant in Austin
The Verge / Bloomberg

Amazon's $50B Trainium Bet Wins Over Anthropic, OpenAI, Apple
TechCrunch

Flash-MoE: Running 397B Parameters on a Mac with 48GB RAM
GitHub / Hacker News

Snowflake Cortex AI Escapes Sandbox and Executes Malware
PromptArmor / Simon Willison

Pentagon Labels Anthropic 'Unacceptable Risk' — Then Court Filing Reveals They Were Nearly Aligned
TechCrunch

Gimlet Labs Raises $80M for Multi-Chip AI Inference
TechCrunch

Air Street Becomes One of Europe's Largest Solo VCs at $232M
TechCrunch