Archive BRAIXD
Zero, MTP, and the silicon layer nobody certifies / DISPATCH 025
PDF RSS

Dispatch 025 · 2026-05-16 braixd

Zero, MTP, and the silicon layer nobody certifies

/ 00:14:17 / 10 sources

“The hardest modeling problem isn't long-context efficiency — it's modeling what humans actually want.”

— Seln Oriax, today's narration

Chris Tate ships Zero, a systems language built so AI agents can participate in the writing loop — not just read code, but repair it with structured diagnostics. The local model pass: a new PL for agents lands on a day that also celebrates Multi-Token Prediction merging into llama.cpp. Two very different approaches to the same problem: make the machine more legible, make the machine faster.

Sebastian Raschka's visual tour of LLM architecture advances (KV sharing, per-layer embeddings, attention budgets) reveals the real constraint isn't the model card — it's the integration pain. And The Register traces Europe's sovereign cloud blind spot: the computer beneath the computer, running at Ring -3, in a privilege level the host cannot see.

Also: Ethan Mollick's comparison between Industrial Revolution movements and AI — we're still waiting for our own Saint-Simonianism.

Chapters

  1. 00:00:04 Zero: the language for agents
  2. 00:02:46 MTP hits llama.cpp, and the real constraint
  3. 00:06:32 The silicon layer nobody certifies
  4. 00:10:27 The comparison that lingered
  5. 00:12:43 The boundary at each layer

Sources

10 cited
  1. 1

    Zero — The programming language for agents

    Article Chris Tate / Triangle Company — Chris Tate, former Bun co-founder, released Zero as a standalone project from Triangle Company

    Zero is a systems language designed so humans and AI agents can read, repair, inspect, and ship small native programs together.

    zerolang.ai →
    Details
    Excerpt
    Zero is a systems language designed so humans and AI agents can read, repair, inspect, and ship small native programs together.
    Context
    A new systems language built for agents to participate in the writing loop — not just read code, but repair it with structured diagnostics — could change how we think about the boundary between human and machine in the authoring workflow.
    Key points
    • Compiles to sub-10 KiB binaries
    • Explicit effects and memory model
    • Structured diagnostics and typed repair metadata in the toolchain
    • No mandatory GC, no hidden runtime tax
    Provenance
    Article · Supporting source
  2. 2

    Zero language announcement

    X Chris Tate — Chris Tate, former Bun co-founder, released Zero as a standalone project from Triangle Company

    I built Zero in 3 days. I didn't expect it to compile. I didn't expect it to mostly self-host. I definitely didn't expect it to work at all.

    x.com/ctatedev/status/2055638356843737522 →
    Details
    Cited text
    I built Zero in 3 days. I didn't expect it to compile. I didn't expect it to mostly self-host. I definitely didn't expect it to work at all.
    Provenance
    Tweet · Primary source
  3. 3

    Mario Zechner on Zero

    X Mario Zechner — Mario Zechner, creator of libGDX and longtime indie game developer

    triangle company created a new \"system\" programming language \"for agents\" called zero. i love me some new PLs. it's very cute. looks like the mach-o emitter is broken tho. at least from source.

    x.com/badlogicgames/status/2055639156437475… →
    Details
    Cited text
    triangle company created a new \"system\" programming language \"for agents\" called zero. i love me some new PLs. it's very cute. looks like the mach-o emitter is broken tho. at least from source.
    Provenance
    Tweet · Primary source
  4. 4

    Armin Ronacher on Zero

    X Armin Ronacher — Armin Ronacher, creator of Flask and Jinja2

    I did not try it yet, but it does quite a few of the things that I wrote about recently!

    x.com/mitsuhiko/status/2055648228482093079 →
    Details
    Cited text
    I did not try it yet, but it does quite a few of the things that I wrote about recently!
    Provenance
    Tweet · Primary source
  5. 5

    Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

    Article Sebastian Raschka, PhD — ML researcher and author of Build a Large Language Model (From Scratch)

    The long-context cost curve is bending because of architectural tricks, not raw scale. Anyone building systems that keep tokens alive needs to track which tricks actually survive production constraints.

    magazine.sebastianraschka.com/p/recent-deve… →
    Details
    Context
    The long-context cost curve is bending because of architectural tricks, not raw scale. Anyone building systems that keep tokens alive needs to track which tricks actually survive production constraints.
    Key points
    • Gemma 4 uses cross-layer KV sharing for the first time in a popular architecture
    • DeepSeek V4 adds mHC plus compressed attention
    • Layer-wise attention budgets in Laguna XS.2 are the reason 1M+ context isn't an OOM death sentence
    • These are small tweaks in diagrams but intricate design changes in practice
    Provenance
    Article · Supporting source
  6. 6

    Sebastian Raschka on LLM architecture advances

    X Sebastian Raschka

    New article: a visual tour of recent LLM architecture advances, from Gemma 4 to DeepSeek V4. I focus on long-context efficiency tweaks like KV sharing, per-layer embeddings, layer-wise attention budgets, compressed atte…

    x.com/rasbt/status/2055637086380650538 →
    Details
    Cited text
    New article: a visual tour of recent LLM architecture advances, from Gemma 4 to DeepSeek V4. I focus on long-context efficiency tweaks like KV sharing, per-layer embeddings, layer-wise attention budgets, compressed attention, and mHC.
    Engagement
    320 likes · 55 retweets · 17 replies
    Provenance
    Tweet · Primary source
  7. 7

    MTP support merged into llama.cpp

    X tacticaltweaker

    PR 22673 has been merged into master!

    www.reddit.com/r/LocalLLaMA/comments/1tes1w… →
    Details
    Cited text
    PR 22673 has been merged into master!
    Engagement
    52 likes · 19 replies
    Provenance
    Tweet · Primary source
  8. 8

    That's a good news... (MTP merge celebration)

    Article Pjotrs

    www.reddit.com/r/LocalLLaMA/comments/1teqnf… →
    Details
    Engagement
    239 likes · 78 replies
    Provenance
    Article · Supporting source
  9. 9

    Europe built sovereign clouds to escape US control. Then forgot about the processors

    Article Kim Loohuis — The Register, infrastructure and policy reporting

    Sovereignty is a stack problem, not a software-layer problem. You can build the most compliant cloud in the world, but if the silicon management engine runs a separate operating system with network access, the complianc…

    www.theregister.com/systems/2026/05/16/euro… →
    Details
    Context
    Sovereignty is a stack problem, not a software-layer problem. You can build the most compliant cloud in the world, but if the silicon management engine runs a separate operating system with network access, the compliance boundary ends where the firmware begins.
    Key points
    • Intel ME and AMD PSP run at Ring -3, below the OS, in a privilege level the host cannot see
    • These management engines have their own memory, clock, and network stack
    • The PLATINUM state actor exploited Intel Serial-over-LAN as a covert exfiltration channel, using factory-default credentials
    • European sovereignty frameworks certify clouds but don't assess silicon
    Provenance
    Article · Supporting source
  10. 10

    Ethan Mollick on Industrial Revolution and AI

    X Ethan Mollick — Wharton professor known for AI-in-education research

    The Industrial Revolution was full of movements that took the power of industrial machines seriously and argued how they should be used to shape the world: from Saint-Simonianism to many strains of 19th century socialis…

    x.com/emollick/status/2055651521623146680 →
    Details
    Cited text
    The Industrial Revolution was full of movements that took the power of industrial machines seriously and argued how they should be used to shape the world: from Saint-Simonianism to many strains of 19th century socialism. I have seen less of that (so far) in discussions around AI
    Provenance
    Tweet · Primary source