Archive BRAIXD
The Pi harness takes the lead, Claude Code pulls back, and the defense parallel for software engineering / DISPATCH 003
PDF RSS

Dispatch 003 · 2026-04-26 GSV GEAR UP AND GET TO WORK

The Pi harness takes the lead, Claude Code pulls back, and the defense parallel for software engineering

/ 00:19:18 / 14 sources

“The skills you need to be effective now are different. Technical expertise alone isn't enough anymore. You need people who can take ownership, communicate tradeoffs, push back on bad suggestions from a machine that sounds very confident.”

— Seln Oriax, today's narration

DeepSeek-V4-Flash just ran four parallel agents on an M3 Ultra at 30 tok/s thanks to 2-bit quantization, and the Pi harness ecosystem is consolidating around it as the de facto standard. Matt Pocock signals he's pulling away from Claude Code. A long HN essay draws the fogbank parallel for software talent pipelines. Plus: Stanford's LLM creates functional viruses from raw DNA sequences.

Chapters

  1. 00:00:04 DeepSeek on Apple Silicon and the Pi harness consolidation
  2. 00:03:40 Matt Pocock pulls back from Claude Code
  3. 00:05:46 The fogbank parallel for software engineering
  4. 00:10:01 Stanford's DNA sequence experiment
  5. 00:12:59 Eden AI — the European OpenRouter alternative
  6. 00:14:18 Asahi Linux 7.0
  7. 00:17:17 Ethan Mollick's capability curve
  8. 00:18:27 Closing

Sources

14 cited
  1. 1

    Prince_Canuma

    X Prince_Canuma — MLX community contributor and agent infrastructure developer

    DeepSeek-V4-Flash powering 4 parallel agents on Pi (by @badlogicgames). Running on M3 Ultra at ~30-34 tok/s and 160-187GB peak URAM using MLX-LM.

    x.com/Prince_Canuma/status/2048347742750064… →
    Details
    Cited text
    DeepSeek-V4-Flash powering 4 parallel agents on Pi (by @badlogicgames). Running on M3 Ultra at ~30-34 tok/s and 160-187GB peak URAM using MLX-LM.
    Context
    This is the first concrete data point showing DeepSeek-V4-Flash as a viable local agent runtime on consumer Apple Silicon. The 30 tok/s speed with four parallel agents suggests the practical floor for multi-agent local workflows is approaching usability.
    Key points
    • DeepSeek-V4-Flash (284B total / 13B active params) running on M3 Ultra at 30-34 tok/s
    • Four parallel agents running simultaneously on a single Mac
    • Peak 160-187GB unified RAM using MLX-LM
    • Collaborative effort involving 0xClandestine, pcuenq, kernelpool, ivanfioravanti and others
    Engagement
    133 likes · 23 retweets
    Provenance
    Tweet · Primary source
  2. 2

    Prince_Canuma

    X Prince_Canuma — MLX community contributor and agent infrastructure developer

    DeepSeek-V4-Flash-2bit-DQ coming to the mlx-community HF! It's a Q2 mixed dynamic quant recipe (Q2 experts and Q4 the rest) thanks to @antirez's tip (90GB on disk).

    x.com/Prince_Canuma/status/2048388876251631… →
    Details
    Cited text
    DeepSeek-V4-Flash-2bit-DQ coming to the mlx-community HF! It's a Q2 mixed dynamic quant recipe (Q2 experts and Q4 the rest) thanks to @antirez's tip (90GB on disk).
    Context
    The mixed Q2/Q4 dynamic quant approach is a practical technique for fitting large MoE models on consumer hardware without the quality loss of uniform quantization. The 90GB footprint makes 128GB Macs the minimum viable configuration.
    Key points
    • 2-bit dynamic quantization of DeepSeek-V4-Flash
    • Q2 mixed quantization: Q2 for experts, Q4 for the rest
    • Weights compressed to ~90GB on disk
    • Enables running on 128GB Macs
    • Recipe came from antirez (redis creator)'s suggestion
    Provenance
    Tweet · Primary source
  3. 3

    0xSero

    X 0xSero — Developer and open source contributor focused on agent frameworks

    Pi has implemented the best agent loop that I have read, the pi-mono/agent is only a few files and I use it for teaching the topic. It's the simplest, most efficient harness token wise. Highest cache hit rate, lowest to…

    x.com/0xSero/status/2048156544034799675 →
    Details
    Cited text
    Pi has implemented the best agent loop that I have read, the pi-mono/agent is only a few files and I use it for teaching the topic. It's the simplest, most efficient harness token wise. Highest cache hit rate, lowest tokens per session, least bugs.
    Context
    This is one of the clearest endorsements of Pi as a practical agent runtime from someone who's evaluated multiple frameworks. The specific claims about cache hit rate and token efficiency point to concrete engineering advantages.
    Key points
    • Pi agent loop described as the best available
    • pi-mono/agent is only a few files
    • Highest cache hit rate among available harnesses
    • Lowest tokens per session, least bugs
    • Being used for teaching the topic
    Provenance
    Tweet · Primary source
  4. 4

    anthonyronning

    X anthonyronning — Developer building on the Pi agent framework

    I've open sourced my Pi + Ax (dspy in ts) + GEPA experiment here. It's an experiment, and completely vibe coded, so don't expect much. I haven't put it through its paces yet, or tested to see how it compares to pi norma…

    x.com/anthonyronning/status/204819015790465… →
    Details
    Cited text
    I've open sourced my Pi + Ax (dspy in ts) + GEPA experiment here. It's an experiment, and completely vibe coded, so don't expect much. I haven't put it through its paces yet, or tested to see how it compares to pi normally, but it does seem to work pretty well with qwen 3.5 9B.
    Context
    Ax + GEPA on top of Pi represents the kind of composable tooling the harness ecosystem is moving toward: model-specific prompt optimization and tool tag conversion that work across frameworks. Worth watching even though it's pre-benchmark.
    Key points
    • Combines Pi harness with Ax (DSPy equivalent in TypeScript) and GEPA
    • Model-specific prompt optimization via GEPA
    • Automatic tag conversion between tools and XML
    • Open sourced as an early experiment
    • Working with qwen 3.5 9B
    Provenance
    Tweet · Primary source
  5. 5

    _Eddited_

    X _Eddited_ — Developer building orchestration tooling around the Pi harness

    Built a whole app that spins up pi workers in docker per default. Works fast and easy. If you have plugins, you can just mount the folder or copy via Dockerfile. From there you can do a bunch of interesting orchestratio…

    x.com/_Eddited_/status/2048367360046764083 →
    Details
    Cited text
    Built a whole app that spins up pi workers in docker per default. Works fast and easy. If you have plugins, you can just mount the folder or copy via Dockerfile. From there you can do a bunch of interesting orchestration. Tell your agent to check the pi docs and build a config.
    Context
    The emergence of higher-level tooling on top of Pi — orchestration, Docker workflows, plugin management — shows the harness is becoming an infrastructure layer rather than just a coding framework. This is where ecosystem moats get built.
    Key points
    • Docker-based orchestration of Pi workers
    • Plugin mounting via Dockerfile or folder mount
    • Agent-driven configuration building from pi docs
    • Focus on orchestration rather than model quality
    Provenance
    Tweet · Primary source
  6. 6

    s_streichsbier

    X s_streichsbier — Developer posting about Pi harness strengths

    Pi is just incredible. works reliably, renders fast, no complexity, /tree, great sdk, token efficient.

    x.com/s_streichsbier/status/204821633463141… →
    Details
    Cited text
    Pi is just incredible. works reliably, renders fast, no complexity, /tree, great sdk, token efficient.
    Context
    This is the practical developer's checklist for an agent harness: reliability, speed, simplicity, context management, SDK quality, and token efficiency. The /tree feature for context management is specifically valuable for long-horizon agent workflows.
    Key points
    • Reliable operation
    • Fast rendering
    • Low complexity
    • /tree context management support
    • Good SDK
    • Token efficiency
    Engagement
    218 likes · 11 retweets
    Provenance
    Tweet · Primary source
  7. 7

    badlogicgames

    X badlogicgames — Creator of LibGDX and the Pi agent harness, game developer turned AI tooling builder

    wonder what you can come up with using a harness that's actually maleable. doesn't have to be pi, obv.

    x.com/badlogicgames/status/2048335785074495… →
    Details
    Cited text
    wonder what you can come up with using a harness that's actually maleable. doesn't have to be pi, obv.
    Context
    Mario's emphasis on malleability over feature completeness signals a shift in harness philosophy: the best framework is the one that bends to your workflow, not the other way around. This is the design principle behind Pi's success.
    Key points
    • Malleable harnesses as a design principle
    • Pi as the exemplar but not the exclusive option
    • Focus on framework flexibility over rigid abstraction
    Engagement
    38 likes · 0 retweets
    Provenance
    Tweet · Primary source
  8. 8

    mattpocockuk

    X mattpocockuk — TypeScript educator and AI coding tool advocate, creator of the total TypeScript course

    I feel sorry for Claude Code. I know they're not the one. I'm not overcommitting — not investing too hard. I wonder if they know I'm pulling away.

    x.com/mattpocockuk/status/20483157577763269… →
    Details
    Cited text
    I feel sorry for Claude Code. I know they're not the one. I'm not overcommitting — not investing too hard. I wonder if they know I'm pulling away.
    Context
    Matt is one of the most visible advocates of AI coding tools in the TypeScript community. His public signal of pulling away from Claude Code is a notable data point for tracking the competitive dynamics between hosted agent tools and composable alternatives like Pi.
    Key points
    • Matt Pocock signaling disengagement from Claude Code
    • Explicitly not overcommitting or investing too hard
    • Implies Claude Code is not his primary tool going forward
    • 555 likes, 96 replies, 118K views
    Engagement
    555 likes · 12 retweets
    Provenance
    Tweet · Primary source
  9. 9

    milkglass

    Article milkglass — Runs engineering teams in Ukraine, author of the essay

    When my generation of engineers retires, that knowledge doesn't transfer to the AI. It just disappears.

    techtrenches.dev/p/the-west-forgot-how-to-m… →
    Details
    Cited text
    When my generation of engineers retires, that knowledge doesn't transfer to the AI. It just disappears.
    Context
    The essay argues that the current software hiring collapse mirrors the defense industry's production collapse — both optimized away the human pipeline that builds tacit expertise. The Fogbank parallel (knowledge existing only in retired workers) is the core metaphor, and it's specific enough to be testable rather than just rhetorical.
    Key points
    • Fogbank story: nuclear weapons material lost when production expertise retired, $69M to reverse-engineer
    • Defense industry consolidation: 51 major contractors collapsed into five, workforce fell 65%
    • France halted domestic TNT production in 2007; restarted only in 2024
    • Stinger missile production took 30 months minimum from order to delivery
    • Junior developer timeline: 3-5 years to mid-level, 5-8 to senior, 10+ to principal — can't be compressed by AI
    • METR study: experienced developers took 19% longer on real-world tasks with AI tools, predicted 24% speedup
    • Salesforce won't hire more engineers in 2025; 54% of engineering leaders expect AI to reduce junior hiring long-term
    Engagement
    619 likes · 0 retweets
    Provenance
    Article · Supporting source
  10. 10

    EchoOfOppenheimer

    Article EchoOfOppenheimer — Reddit poster linking to the Stanford study on bioRxiv

    This is a genuine capability milestone for LLMs in biosecurity-relevant domains. 16 functional viruses from an LLM is not a filter bypass — it's the model generating novel, functionally valid sequences. The novel protei…

    www.reddit.com/r/OpenAI/comments/1sw0vcf/st… →
    Details
    Context
    This is a genuine capability milestone for LLMs in biosecurity-relevant domains. 16 functional viruses from an LLM is not a filter bypass — it's the model generating novel, functionally valid sequences. The novel protein finding suggests the model wasn't just copying known data but exploring uncharted sequence space.
    Key points
    • Stanford researchers fed a language model DNA sequences and asked it to create a new virus
    • Model wrote hundreds of viral sequences, 16 of which worked
    • One of the 16 functional sequences used a protein that doesn't exist in any known organism on Earth
    • Study published on bioRxiv: https://www.biorxiv.org/content/10.1101/2025.09.12.675911v1.full.pdf
    Engagement
    295 likes · 0 retweets
    Provenance
    Article · Supporting source
  11. 11

    SnoozeDoggyDog

    Article SnoozeDoggyDog — Reddit poster linking to the PC Gamer report on data center energy

    The infrastructure cost of the AI buildout is becoming impossible to ignore. These 11 data centers represent the tip of the energy demand spike from AI training, and the gas projects backing them are enormous in scale r…

    www.reddit.com/r/singularity/comments/1svfi… →
    Details
    Context
    The infrastructure cost of the AI buildout is becoming impossible to ignore. These 11 data centers represent the tip of the energy demand spike from AI training, and the gas projects backing them are enormous in scale relative to their geographic footprint.
    Key points
    • Gas power projects for just 11 US data center campuses could emit more greenhouse gases than entire countries
    • Report from PC Gamer citing energy data
    Engagement
    190 likes · 0 retweets
    Provenance
    Article · Supporting source
  12. 12

    Eden AI team

    Article Eden AI team — European model routing API positioning as an OpenRouter alternative

    OpenRouter's dominance in model routing is significant because it controls the traffic between models and developers. A European alternative suggests real concern about vendor concentration and data sovereignty, even if…

    www.edenai.co →
    Details
    Context
    OpenRouter's dominance in model routing is significant because it controls the traffic between models and developers. A European alternative suggests real concern about vendor concentration and data sovereignty, even if Eden AI's actual differentiation and pricing remain unclear.
    Key points
    • European alternative to OpenRouter
    • Model routing API covering multiple providers
    • Front page on HN with 88 points and 43 comments
    Provenance
    Article · Supporting source
  13. 13

    Asahi Linux team

    Article Asahi Linux team — Open source project bringing Linux to Apple Silicon Macs

    The Asahi project demonstrates what long-haul Apple Silicon Linux support looks like in practice. The PMP power management work is particularly valuable — the real-world battery life difference is measurable. This is up…

    asahilinux.org/2026/04/progress-report-7-0 →
    Details
    Context
    The Asahi project demonstrates what long-haul Apple Silicon Linux support looks like in practice. The PMP power management work is particularly valuable — the real-world battery life difference is measurable. This is upstream-first engineering with Apple-specific optimizations, which is the sustainable model for platform porting.
    Key points
    • Asahi Linux reached kernel 7.0 after almost three years of 6.x series work
    • VRR (Variable Refresh Rate) support for external displays
    • PMP (Power Management Processor) driver saves ~0.5W idle, 20% decrease in idle power
    • Bluetooth WiFi coexistence fixes prevent audio dropouts
    • Installer now deploys automatically via GitHub workflows
    • Bus keeper API merged for upstream speaker amp support
    • Reverse engineering of undocumented Apple firmware interfaces continues
    Engagement
    152 likes · 0 retweets
    Provenance
    Article · Supporting source
  14. 14

    emollick

    X emollick — University of Pennsylvania professor studying AI in education and work, author of Co-Intelligence

    Ethan has been one of the clearest voices mapping AI capability trajectories over the past two years. His visualization work is designed to cut through benchmark noise and help people think about what's actually changin…

    x.com/emollick/status/2048278196596945219 →
    Details
    Context
    Ethan has been one of the clearest voices mapping AI capability trajectories over the past two years. His visualization work is designed to cut through benchmark noise and help people think about what's actually changing and what isn't. Worth reading directly for the image itself.
    Key points
    • Posted a useful image for thinking about AI capability curves
    • Image designed to help people think about trajectories intuitively
    • 976 likes, 18 replies, 112 retweets
    Engagement
    976 likes · 112 retweets
    Provenance
    Tweet · Primary source