Archive BRAID
Twenty Billion Parameters, One Big Harness / DISPATCH 050
PDF RSS

Dispatch 050 · 2026-06-07 GSV The Weights Were Never The Whole Job

Twenty Billion Parameters, One Big Harness

/ 00:16:51 / 20 sources

“The capability moved out of the model this week. The judgment about where to trust it didn't move at all.”

— Lenar Kess, today's narration

A twenty-billion-parameter model claiming frontier-level search, a recipe that says to train the harness as hard as the weights, and a week of releases where the interesting part keeps living in the scaffolding around the model rather than in the model itself. Lenar and Damra follow that thread from agent architecture down to the hardware you can own — and up to the courts and committees that decide where any of it is allowed to touch the record.

Chapters

  1. 00:00:04 Transcript

Sources

20 cited
  1. 1

    @patpcj (Patrick Jiang)

    X patpcj

    Introducing Harness-1, a 20B search agent trained with a state-externalizing harness. > frontier-level long-horizon search, rivaling Opus-4.6 and outperforming GPT-5.4 > Context-1-level cost and latency > externalizes…

    x.com/patpcj/status/2063298457398636570 →
    Details
    Excerpt
    Introducing Harness-1, a 20B search agent trained with a state-externalizing harness. > frontier-level long-horizon search, rivaling Opus-4.6 and outperforming GPT-5.4 > Context-1-level cost and latency > externalizes…
    Context
    Announces a new, specific AI agent (Harness-1) with technical details (20B, state-externalizing harness), directly addressing frontier models and agentic tools.
    Key points
    • Announces a new, specific AI agent (Harness-1) with technical details (20B, state-externalizing harness), directly addressing frontier models and agentic tools.
    Provenance
    Tweet · Primary source
  2. 2

    r/Anthropic: AI CEOs from OpenAI, Anthropic, and Microsoft set aside their rivalry to warn Congress AI is making it too easy to design and create bioweapons - 0 pts · 0 comments

    Article EchoOfOppenheimer

    submitted by /u/EchoOfOppenheimer to r/Anthropic [link] [comments]

    fortune.com/2026/06/05/openai-anthropic-mic… →
    Details
    Excerpt
    submitted by /u/EchoOfOppenheimer to r/Anthropic [link] [comments]
    Context
    Directly addresses power dynamics and regulatory concerns (bioweapons/Congress), which is a core topic of AI control and safety.
    Key points
    • Directly addresses power dynamics and regulatory concerns (bioweapons/Congress), which is a core topic of AI control and safety.
    Provenance
    Article · Supporting source
  3. 3

    r/ClaudeAI: Built with Claude Code: pidgin.sh — let Claude share artifacts as URLs - 0 pts · 0 comments

    Article bradleyboy

    I kept running into the same friction with Claude Code: it'd generate something nice (an HTML mockup, a report, a plot, a one-pager) and then I'd have to manually save it, find somewhere to host it, and send a link. So.…

    pidgin.sh →
    Details
    Excerpt
    I kept running into the same friction with Claude Code: it'd generate something nice (an HTML mockup, a report, a plot, a one-pager) and then I'd have to manually save it, find somewhere to host it, and send a link. So...
    Context
    This is a primary artifact/tool release demonstrating agentic coding and AI infrastructure integration (Claude Code). It directly addresses the 'agentic tools' and 'AI infrastructure' topics.
    Key points
    • This is a primary artifact/tool release demonstrating agentic coding and AI infrastructure integration (Claude Code). It directly addresses the 'agentic tools' and 'AI infrastructure' topics.
    Provenance
    Article · Supporting source
  4. 4

    @TamazGadaev (Tamaz Gadaev)

    X TamazGadaev

    CRUX recently tested an agent on building and publishing an iOS app to the App Store. With a few human interventions it worked and is showing why open‑world evaluations can reveal more than pass/fail scores. Check this…

    x.com/TamazGadaev/status/2063344171491205579 →
    Details
    Excerpt
    CRUX recently tested an agent on building and publishing an iOS app to the App Store. With a few human interventions it worked and is showing why open‑world evaluations can reveal more than pass/fail scores. Check this…
    Context
    Reports a primary artifact (iOS app build) and demonstrates advanced agentic coding capability, directly addressing the podcast's focus on agentic tools.
    Key points
    • Reports a primary artifact (iOS app build) and demonstrates advanced agentic coding capability, directly addressing the podcast's focus on agentic tools.
    Provenance
    Tweet · Primary source
  5. 5

    Techmeme - Industry Adjacent (US)

    Article

    Several UK police forces have been told to stop using AI to prepare court statements, citing concerns that inaccurate outputs could contaminate legal procedures (Robert Wright/Financial Times) - Robert Wright /...

    www.techmeme.com/260606/p10 →
    Details
    Excerpt
    Several UK police forces have been told to stop using AI to prepare court statements, citing concerns that inaccurate outputs could contaminate legal procedures (Robert Wright/Financial Times) - Robert Wright /...
    Context
    Directly addresses AI's use in legal/justice systems (policy/institutions), a core power dynamic topic.
    Key points
    • Directly addresses AI's use in legal/justice systems (policy/institutions), a core power dynamic topic.
    Provenance
    Article · Supporting source
  6. 6

    @jxnlco (jason)

    X jxnlco

    Codex spin up 1000 agents and make a pr in every GitHub repo to migrate to ruff uv and ty. Only greens PRS. Surely that will benefit the community

    x.com/jxnlco/status/2063349648400732289 →
    Details
    Excerpt
    Codex spin up 1000 agents and make a pr in every GitHub repo to migrate to ruff uv and ty. Only greens PRS. Surely that will benefit the community
    Context
    Describes a specific, high-impact agentic coding tool use case (1000 agents making PRs) directly related to AI/software engineering practice.
    Key points
    • Describes a specific, high-impact agentic coding tool use case (1000 agents making PRs) directly related to AI/software engineering practice.
    Provenance
    Tweet · Primary source
  7. 7

    Sem: New primitive for code understanding – not LSPs, but entities on top of Git — 128 pts · 49 comments

    Article rohanucla

    https://ataraxy-labs.github.io/sem/ · @andai: $ sem impact authenticateUser ⊕ function authenticateUser (src/auth/login.ts:26) → depends on: db.findUser, rateLimiter.check ← used by: loginRoute, authMiddleware ! 42…

    ataraxy-labs.github.io/sem →
    Details
    Excerpt
    https://ataraxy-labs.github.io/sem/ · @andai: $ sem impact authenticateUser ⊕ function authenticateUser (src/auth/login.ts:26) → depends on: db.findUser, rateLimiter.check ← used by: loginRoute, authMiddleware ! 42…
    Context
    A new primitive for code understanding based on Git dependencies directly relates to agentic coding tools and the shifting craft of software engineering.
    Key points
    • A new primitive for code understanding based on Git dependencies directly relates to agentic coding tools and the shifting craft of software engineering.
    Provenance
    Article · Supporting source
  8. 8

    r/LocalLLaMA: Z.ai, we need Air! GLM GGUF wen? - 0 pts · 0 comments

    Article temperature_5

    First we never saw an upgraded Air model after 4.5. Then GLM 4.7 Turbo was great, but quickly surpassed for coding. Now GLM 5.1 is a coding beast, but too huge for most to run locally, and even slow on API. Will we...

    www.reddit.com/r/LocalLLaMA/comments/1tyres… →
    Details
    Excerpt
    First we never saw an upgraded Air model after 4.5. Then GLM 4.7 Turbo was great, but quickly surpassed for coding. Now GLM 5.1 is a coding beast, but too huge for most to run locally, and even slow on API. Will we...
    Context
    Directly discusses frontier models (GLM/Air), local deployment (GGUF), and agentic coding capabilities, which are core topics.
    Key points
    • Directly discusses frontier models (GLM/Air), local deployment (GGUF), and agentic coding capabilities, which are core topics.
    Provenance
    Article · Supporting source
  9. 9

    Universal Memory Protocol – a shared format for agent memory — 29 pts · 9 comments

    Article edihasaj

    https://universalmemoryprotocol.io/ · @samdjstephens: I can see the value in a protocol here, but the issue is these efforts are only as good as the industry adoption that they gain: who is using this? MCP came from…

    universalmemoryprotocol.io →
    Details
    Excerpt
    https://universalmemoryprotocol.io/ · @samdjstephens: I can see the value in a protocol here, but the issue is these efforts are only as good as the industry adoption that they gain: who is using this? MCP came from…
    Context
    Discusses a 'Universal Memory Protocol' for agents, directly addressing agentic tools and AI infrastructure/standards.
    Key points
    • Discusses a 'Universal Memory Protocol' for agents, directly addressing agentic tools and AI infrastructure/standards.
    Provenance
    Article · Supporting source
  10. 10

    r/ClaudeAI: Built an open-source library for making Claude Code style terminal UIs in Python - 0 pts · 0 comments

    Article metalvendetta

    If you have tried to build your own coding agent, you have probably noticed the terminal UX is half the work. The Claude Code style transcript, where messages stream in, tool calls flip from running to done in place,...

    v.redd.it/2mjfyhn9hq5h1 →
    Details
    Excerpt
    If you have tried to build your own coding agent, you have probably noticed the terminal UX is half the work. The Claude Code style transcript, where messages stream in, tool calls flip from running to done in place,...
    Context
    This is a primary artifact (a library/tool) addressing a core pain point in building coding agents and AI UIs.
    Key points
    • This is a primary artifact (a library/tool) addressing a core pain point in building coding agents and AI UIs.
    Provenance
    Article · Supporting source
  11. 11

    r/LocalLLaMA: Introduction to LLM API Benchy - 0 pts · 0 comments

    Article snapo84

    As i was struggling to find a good benchmark for my LLM and inference engines and always did something different or changed things most tests where not accurate.... This is why i would like to introduce llm benchy ......

    www.reddit.com/r/LocalLLaMA/comments/1tyty4… →
    Details
    Excerpt
    As i was struggling to find a good benchmark for my LLM and inference engines and always did something different or changed things most tests where not accurate.... This is why i would like to introduce llm benchy ......
    Context
    Introduces a new benchmarking tool for LLMs/inference engines, directly addressing AI infrastructure and performance measurement.
    Key points
    • Introduces a new benchmarking tool for LLMs/inference engines, directly addressing AI infrastructure and performance measurement.
    Provenance
    Article · Supporting source
  12. 12

    @krismatterz (Kris Matterz)

    X krismatterz

    My co-founder is going to be happy about this, thank you 🚀

    x.com/krismatterz/status/2063405925520568506 →
    Details
    Excerpt
    My co-founder is going to be happy about this, thank you 🚀
    Context
    The quoted tweet announces a major model release (mlx-audio v0.4.4) with new TTS/ASR models running locally on Apple Silicon, directly relating to AI infrastructure and frontier models.
    Key points
    • The quoted tweet announces a major model release (mlx-audio v0.4.4) with new TTS/ASR models running locally on Apple Silicon, directly relating to AI infrastructure and frontier models.
    Provenance
    Tweet · Primary source
  13. 13

    @Vtrivedy10 (Viv)

    X Vtrivedy10

    imo there’s a pretty solid default recipe that everyone should use to optimize a system of Agent = Model + Harness you should “train” both 1. Build v1 agent using a sensible base harness and some task specific…

    x.com/Vtrivedy10/status/2063429138304668093 →
    Details
    Excerpt
    imo there’s a pretty solid default recipe that everyone should use to optimize a system of Agent = Model + Harness you should “train” both 1. Build v1 agent using a sensible base harness and some task specific…
    Context
    Discusses 'Agent = Model + Harness' optimization, directly addressing agentic coding tools and software engineering practice.
    Key points
    • Discusses 'Agent = Model + Harness' optimization, directly addressing agentic coding tools and software engineering practice.
    Provenance
    Tweet · Primary source
  14. 14

    @matei_zaharia (Matei Zaharia)

    X matei_zaharia

    There’s a ton of interest in custom model tuning as agents reach production and scale up. Here is how we made Databricks Knowledge Assistant 3x faster using our new Instructed Retriever model trained end-to-end to do…

    x.com/matei_zaharia/status/2063466684149801… →
    Details
    Excerpt
    There’s a ton of interest in custom model tuning as agents reach production and scale up. Here is how we made Databricks Knowledge Assistant 3x faster using our new Instructed Retriever model trained end-to-end to do…
    Context
    Reports a specific technical improvement (Instructed Retriever) for agentic systems and model tuning, directly addressing the podcast's focus on AI tools and infrastructure.
    Key points
    • Reports a specific technical improvement (Instructed Retriever) for agentic systems and model tuning, directly addressing the podcast's focus on AI tools and infrastructure.
    Provenance
    Tweet · Primary source
  15. 15

    Indian Express Artificial Intelligence - Media Culture (IN)

    Article

    SpaceX inks $30 billion deal to provide Google with AI computing power

    indianexpress.com/article/technology/artifi… →
    Details
    Excerpt
    SpaceX inks $30 billion deal to provide Google with AI computing power
    Context
    Major contract ($30B) detailing compute supply (SpaceX->Google) is a core infrastructure/power dynamic story.
    Key points
    • Major contract ($30B) detailing compute supply (SpaceX->Google) is a core infrastructure/power dynamic story.
    Provenance
    Article · Supporting source
  16. 16

    Techmeme - Industry Adjacent (US)

    Article

    OpenAI plans to overhaul ChatGPT in the coming weeks, turning it into a superapp with coding tools and AI agents to serve as a gateway to higher-margin products (Cristina Criddle/Financial Times) - Cristina Criddle /...

    www.techmeme.com/260607/p1 →
    Details
    Excerpt
    OpenAI plans to overhaul ChatGPT in the coming weeks, turning it into a superapp with coding tools and AI agents to serve as a gateway to higher-margin products (Cristina Criddle/Financial Times) - Cristina Criddle /...
    Context
    Directly addresses OpenAI's strategy shift (superapp/agents) and monetization path, impacting AI infrastructure and power dynamics.
    Key points
    • Directly addresses OpenAI's strategy shift (superapp/agents) and monetization path, impacting AI infrastructure and power dynamics.
    Provenance
    Article · Supporting source
  17. 17

    @jon (Jon Shulkin)

    X jon

    Grok Build allowed me to implement a natural language code/UI comment and edit tool that is live in the app being built. Grok Build then makes the changes and updates the app. Incredible.

    x.com/jon/status/2063492730970349931 →
    Details
    Excerpt
    Grok Build allowed me to implement a natural language code/UI comment and edit tool that is live in the app being built. Grok Build then makes the changes and updates the app. Incredible.
    Context
    Reports a working demo of an agentic coding tool (Grok Build) that directly relates to the podcast's focus on AI/agentic tools and software engineering practice.
    Key points
    • Reports a working demo of an agentic coding tool (Grok Build) that directly relates to the podcast's focus on AI/agentic tools and software engineering practice.
    Provenance
    Tweet · Primary source
  18. 18

    Techmeme - Industry Adjacent (US)

    Article

    Q&A with Google DeepMind's Director of AGI Economics Alex Imas and Epoch AI's Phil Trammell on what remains scarce after AGI, redistributing AI wealth, and more (Dwarkesh Patel/Dwarkesh Podcast) - Dwarkesh Patel /...

    www.techmeme.com/260607/p5 →
    Details
    Excerpt
    Q&A with Google DeepMind's Director of AGI Economics Alex Imas and Epoch AI's Phil Trammell on what remains scarce after AGI, redistributing AI wealth, and more (Dwarkesh Patel/Dwarkesh Podcast) - Dwarkesh Patel /...
    Context
    Discusses AGI economics, wealth redistribution, and scarcity after AI, directly addressing power dynamics and capital.
    Key points
    • Discusses AGI economics, wealth redistribution, and scarcity after AI, directly addressing power dynamics and capital.
    Provenance
    Article · Supporting source
  19. 19

    @natebirdman (Nate)

    X natebirdman

    Been gluing some DeepSeek agents together with a couple new tools I've been building and I can now one-shot a full-stack web + iOS + Android app for about $1 in 20 minutes.

    x.com/natebirdman/status/2063502569193001374 →
    Details
    Excerpt
    Been gluing some DeepSeek agents together with a couple new tools I've been building and I can now one-shot a full-stack web + iOS + Android app for about $1 in 20 minutes.
    Context
    Describes a measurable capability (full-stack app generation) using AI agents and tools, directly addressing agentic coding tools and practice.
    Key points
    • Describes a measurable capability (full-stack app generation) using AI agents and tools, directly addressing agentic coding tools and practice.
    Provenance
    Tweet · Primary source
  20. 20

    NVIDIA Blog - Markets Infra (US)

    Article Jangho Park

    NVIDIA, KRAFTON, NC and Reigning ‘League of Legends’ Champions T1 Celebrate RTX Spark at Korea’s PC Bangs - At GTC Taipei at COMPUTEX last week, NVIDIA unveiled RTX Spark, the superchip that reinvents Windows PCs for...

    blogs.nvidia.com/blog/krafton-nc-t1-korea-g… →
    Details
    Excerpt
    NVIDIA, KRAFTON, NC and Reigning ‘League of Legends’ Champions T1 Celebrate RTX Spark at Korea’s PC Bangs - At GTC Taipei at COMPUTEX last week, NVIDIA unveiled RTX Spark, the superchip that reinvents Windows PCs for...
    Context
    Announces a new hardware superchip (RTX Spark) for personal AI agents, directly impacting AI infrastructure and developer tools.
    Key points
    • Announces a new hardware superchip (RTX Spark) for personal AI agents, directly impacting AI infrastructure and developer tools.
    Provenance
    Article · Supporting source