Archive CONSTRUCT
The Token Budget Becomes Power / DISPATCH 004
PDF RSS

Dispatch 004 · 2026-05-11

The Token Budget Becomes Power

/ 00:16:02 / 9 sources

“The scarce resource isn't one model call. It is trusted access to intelligence that can act, verify, bargain, and spend under somebody's account.”

— Lenar Kess, today's narration

The scarce resource isn't one model call. It is trusted access to intelligence that can act, verify, bargain, and spend under somebody's account.

  • The Token Budget Becomes Power

Chapters

  1. 00:00:00 Transcript

Sources

9 cited
  1. 1

    Daybreak

    Article OpenAI — Primary product page for OpenAI's cyber-defense offering.

    Daybreak combines OpenAI models, Codex, and security partners for cyber defense.

    openai.com/daybreak →
    Details
    Cited text
    Daybreak combines OpenAI models, Codex, and security partners for cyber defense.
    Context
    It turns model access into a permissioned security capability rather than a flat model call.
    Key points
    • Defines Daybreak as frontier AI for cyber defenders.
    • Frames the work around secure code review, threat modeling, patch validation, dependency risk analysis, detection, and remediation guidance.
    • Lists access levels including GPT-5.5, GPT-5.5 with Trusted Access for Cyber, and GPT-5.5-Cyber.
    Provenance
    Article · Supporting source
  2. 2

    OpenAI Daybreak announcement thread

    Thread OpenAI — Official X announcement captured by the local X broker.

    A step toward a future where security teams can move at the speed defense demands.

    x.com/OpenAI/status/2053939702110269822 →
    Details
    Cited text
    A step toward a future where security teams can move at the speed defense demands.
    Context
    It gives the launch framing and public response around speed, security, and trust.
    Key points
    • Announces Daybreak as frontier AI for cyber defenders.
    • Describes Daybreak as combining OpenAI models, Codex, and security partners.
    • Follow-up posts frame the product around finding and fixing vulnerabilities earlier and cutting through backlogs.
    Provenance
    Thread · Primary source
  3. 3

    Introducing Claude Platform on AWS

    Article Amazon Web Services — AWS Machine Learning Blog announcement.

    No separate credentials, contracts, or billing relationships required.

    aws.amazon.com/blogs/machine-learning/intro… →
    Details
    Cited text
    No separate credentials, contracts, or billing relationships required.
    Context
    It shows how corporate AI access is being absorbed into cloud billing, identity, and audit systems.
    Key points
    • Claude Platform on AWS uses AWS IAM credentials, Marketplace billing, and CloudTrail logging.
    • It exposes native Claude Platform capabilities through AWS account structures.
    • AWS says the platform is operated by Anthropic and processed outside the AWS security boundary.
    Provenance
    Article · Supporting source
  4. 4

    Interaction Models: A Scalable Approach to Human-AI Collaboration

    Article Thinking Machines Lab — Primary research-preview blog post.

    We train an interaction model from scratch.

    thinkingmachines.ai/blog/interaction-models →
    Details
    Cited text
    We train an interaction model from scratch.
    Context
    It reframes token demand as continuous attention, low latency, and GPU memory residency.
    Key points
    • Interaction models use time-aligned micro-turns with 200ms input and output chunks.
    • The system pairs a real-time interaction model with an asynchronous background model for tool use and longer reasoning.
    • The serving path uses streaming sessions to avoid repeated reallocations and metadata overhead.
    Provenance
    Article · Supporting source
  5. 5

    PACT: Benchmarking LLM negotiation skill in multi-round buyer-seller bargaining

    Source Lech Mazur — Public benchmark repository.

    Every round they swap a short public message, then post a bid or ask.

    github.com/lechmazur/pact →
    Details
    Cited text
    Every round they swap a short public message, then post a bid or ask.
    Context
    It makes language-mediated bargaining measurable, which matters when agents negotiate economic outcomes.
    Key points
    • PACT runs twenty-round buyer-seller bargaining games between language models.
    • Agents hold private values or costs and optimize cumulative profit.
    • The benchmark keeps deterministic seeds and JSONL logs for audit and reruns.
    Provenance
    Source · Background source
  6. 6

    Computer build using Intel Optane Persistent Memory

    Article APFrisco — LocalLLaMA practitioner post.

    Around 4 tokens per second for generation.

    www.reddit.com/r/LocalLLaMA/comments/1taeg8… →
    Details
    Cited text
    Around 4 tokens per second for generation.
    Context
    It illustrates a slow but sovereign way to buy access to very large model inference.
    Key points
    • The build uses 768GB of Intel Optane Persistent Memory in memory mode.
    • The author runs a one trillion parameter Kimi K2.5 model using hybrid GPU and CPU inference with llama.cpp.
    • The sparse experts live mostly on persistent memory and DRAM while selected tensors fit on a 12GB GPU.
    Provenance
    Article · Supporting source
  7. 7

    I catalogued every way local models break JSON output

    Article kexxty — LocalLLaMA practitioner post.

    288 calls total.

    www.reddit.com/r/LocalLLaMA/comments/1tagtp… →
    Details
    Cited text
    288 calls total.
    Context
    It shows that usable output, not raw token count, is the economic unit operators pay for.
    Key points
    • The author compared structured-output failures across local and API models.
    • Common breakage includes markdown fences, trailing commas, Python booleans, truncation, unescaped quotes, comments, and ellipses.
    • The accompanying library validates against JSON Schema and attempts ordered repairs.
    Provenance
    Article · Supporting source
  8. 8

    Stop building AI agents

    Article Warm-Reaction-456 — AI_Agents practitioner post.

    Most of the AI agents shipping to real businesses are just internal automations with a language model bolted in.

    www.reddit.com/r/AI_Agents/comments/1taei9m… →
    Details
    Cited text
    Most of the AI agents shipping to real businesses are just internal automations with a language model bolted in.
    Context
    It puts market price pressure on the agent label and separates useful automation from expensive autonomy.
    Key points
    • The author argues many founders overbuy autonomy when a workflow plus one model call would work.
    • Examples include intake routing for telehealth and ACH reconciliation for fintech.
    • Comments emphasize maintenance costs and approval boundaries.
    Provenance
    Article · Supporting source
  9. 9

    e2a authenticated email gateway for AI agents

    Source Mnexa-AI — Open-source GitHub repository.

    Authenticated email gateway for AI agents.

    github.com/Mnexa-AI/e2a →
    Details
    Cited text
    Authenticated email gateway for AI agents.
    Context
    It shows the operational contract needed when agents transact over ordinary human communication channels.
    Key points
    • e2a verifies SPF and DKIM inbound, signs delivery headers with HMAC, and supports webhook or WebSocket delivery.
    • Outbound email can be held for human approval before release.
    • The README emphasizes signed identity, threading, replay windows, and self-hosting.
    Provenance
    Source · Background source