Archive BRAIXD
Mixture of experts active params, automated training loops, and the RL infrastructure pivot / DISPATCH 022
PDF RSS

Dispatch 022 · 2026-05-13 braixd

Mixture of experts active params, automated training loops, and the RL infrastructure pivot

/ 00:05:57 / 4 sources

“AI's next frontier isn't bigger models — it's superlearners that pull new knowledge from continuous experience.”

— Seln Oriax, today's narration

Today on Braixd: the local pass looks at three concrete shifts in how models are built and served. We open with AIDC-AI's Ovis2.6, a multimodal model that packs 80 billion parameters into a Mixture-of-Experts architecture with only about 3 billion active at inference. We move to AutoScientist from Adaption, which attempts to automate the full research loop so small labs don't lose compounding on broken experiment pipelines. Finally, we look at NVIDIA and Ineffable Intelligence's push to build reinforcement learning infrastructure on Grace Blackwell and Vera Rubin, marking a clear pivot from static human data toward continuous, experience-driven training.

Chapters

  1. 00:00:04 The mixture of experts squeeze: 80 billion parameters, 3 billion active
  2. 00:02:07 Automating the research loop
  3. 00:03:57 The shift from static data to continuous experience

Sources

4 cited
  1. 1

    AIDC-AI/Ovis2.6-80B-A3B

    Article AIDC-AI

    MoE models are changing the serving cost math for local and remote inference, making large multimodal models viable on narrower hardware budgets.

    huggingface.co/AIDC-AI/Ovis2.6-80B-A3B →
    Details
    Context
    MoE models are changing the serving cost math for local and remote inference, making large multimodal models viable on narrower hardware budgets.
    Key points
    • MoE architecture with 80B total parameters but ~3B active during inference
    • 64K token context window and support for images up to 2880x2880
    • Think with Image capability enables active visual tool use during reasoning
    • Designed for high-resolution understanding and long-document question answering
    Engagement
    71 likes · 18 replies
    Provenance
    Article · Supporting source
  2. 2

    AutoScientist announcement

    X adaption_ai

    Solves the administrative friction in training rather than the compute constraint, which is usually the real bottleneck for non-frontier labs.

    x.com/adaption_ai/status/2054532113316434061 →
    Details
    Context
    Solves the administrative friction in training rather than the compute constraint, which is usually the real bottleneck for non-frontier labs.
    Key points
    • Automates the full model research loop: formulation, execution, failure analysis, and next experiment
    • Targets the bottleneck where small labs lose compounding on broken experiment pipelines
    • Endorsed by Sara Hooker as a move to encode institutional training memory
    Provenance
    Tweet · Primary source
  3. 3

    Adaption aims big with AutoScientist, an AI tool that helps models train themselves

    Article Russell Brandom

    Confirms Adaption's positioning and the technical scope of AutoScientist's automated fine-tuning approach.

    techcrunch.com/2026/05/13/adaption-aims-big… →
    Details
    Context
    Confirms Adaption's positioning and the technical scope of AutoScientist's automated fine-tuning approach.
    Provenance
    Article · Supporting source
  4. 4

    NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure

    Article NVIDIA Writers

    The hardware and software pipeline for reinforcement learning is being built now, ahead of the models. That means the infrastructure constraints will define what kinds of agents can run at scale.

    blogs.nvidia.com/blog/ineffable-intelligenc… →
    Details
    Context
    The hardware and software pipeline for reinforcement learning is being built now, ahead of the models. That means the infrastructure constraints will define what kinds of agents can run at scale.
    Key points
    • Ineffable Intelligence, founded by David Silver, is emerging from stealth with a focus on continuous experience-based learning
    • NVIDIA and Ineffable are building RL infrastructure starting on Grace Blackwell and planning for Vera Rubin
    • RL workloads generate data on the fly, putting unique pressure on interconnect, memory bandwidth, and serving throughput
    • Marks an industry pivot from static human data toward models that learn through simulation and experience
    Provenance
    Article · Supporting source