Archive BRAID
Five Days to Root, Four Months in Exile / DISPATCH 027
PDF RSS

Dispatch 027 · 2026-05-15 GSV Bugmageddon Forecast

Five Days to Root, Four Months in Exile

/ 00:28:12 / 11 sources

“The shape of the work changes. The standards of craft don't.”

— Lenar Kess, today's narration

Five days for a small security team paired with Mythos Preview to land the first public macOS kernel exploit on Apple's M5 with Memory Integrity Enforcement turned on. Four months for Replit to claw back into the iOS App Store. In between: arXiv starts banning authors of LLM-error papers, Metabase explains why open-source security is being strip-mined this summer, NVIDIA squeezes the 5090, Uncle Bob switches from Claude to Codex, and a pure-OCaml protocol stack boots in low Earth orbit.

Chapters

  1. 00:00:04 Codex everywhere, Claude in the rearview
  2. 00:03:34 Five days to a kernel exploit on M5
  3. 00:07:04 The strip-mining era of open source security
  4. 00:10:25 arXiv bans authors of LLM-error papers
  5. 00:13:33 Replit out of the App Store wilderness
  6. 00:16:38 GDDR7 squeezes the 5090
  7. 00:20:22 The web's secret quirks file
  8. 00:24:06 OCaml in orbit

Sources

11 cited
  1. 1

    First public macOS kernel memory corruption exploit on Apple M5

    Article Calif (Bruce Dang, Dion Blazakis, Josh Maine) — Small security research firm that paired with Mythos Preview on the bug-finding and exploit-development workflow.

    Apple spent five years building hardware and software to make memory corruption exploits dramatically harder. Our engineers, working together with Mythos Preview, built a working exploit in five days.

    blog.calif.io/p/first-public-kernel-memory-… →
    Details
    Cited text
    Apple spent five years building hardware and software to make memory corruption exploits dramatically harder. Our engineers, working together with Mythos Preview, built a working exploit in five days.
    Context
    A concrete data point on how fast model-plus-expert pairs can collapse the calendar against new hardware mitigations, and a follow-up to yesterday's UK AI Security Institute coverage of accelerating autonomous cyber capability.
    Key points
    • Data-only kernel local privilege escalation chain on macOS 26.4.1 (build 25E253) targeting bare-metal M5 with kernel MIE enabled
    • Bugs found April 25 by Bruce Dang; full working exploit by May 1, paired with Mythos Preview
    • Mythos Preview generalizes within a known bug class but did not autonomously bypass MIE — human expertise closed the gap
    • Calif claims this is the first public macOS kernel exploit on MIE hardware; a 55-page report was hand-delivered to Apple Park
    • Frames the moment as the start of 'AI bugmageddon' for hardware mitigations like Memory Tagging Extension
    Provenance
    Article · Supporting source
  2. 2

    Welcome to the strip mining era of open source security

    Article Metabase team — Open-source BI vendor; their security inbox is a representative sample of what's hitting commercial OSS projects this spring.

    Historically, Metabase averaged 10 submissions per month. Starting in January, we've been averaging 10 submissions per week, and many of these are legit.

    www.metabase.com/blog/strip-mining-era-of-o… →
    Details
    Cited text
    Historically, Metabase averaged 10 submissions per month. Starting in January, we've been averaging 10 submissions per week, and many of these are legit.
    Context
    Reframes the economics of being open-source in 2026: the historical security advantage of public-eyes-on-code is dissipating as agents commoditize code review, and commercial OSS projects are starting to close their doors.
    Key points
    • Volume of vulnerability reports moved from ~10/month to ~10/week starting January 2026, most reading like LLM output
    • No single vendor or model is driving it — coding agents in general have crossed a code-reading threshold
    • Cal.com is going closed source as a direct response; more commercial OSS projects expected to follow
    • Maintainer playbook: assume every disclosed vuln is trivially discoverable, drop weekend plans, patch immediately
    • User-facing advice: pin dependencies, upgrade aggressively, defense in depth, least privilege, log everything
    Provenance
    Article · Supporting source
  3. 3

    arXiv implements 1-year ban for papers containing incontrovertible evidence of unchecked LLM-generated errors

    Source u/Nunki08 (paraphrasing Tom Dietterich) — Top-voted MachineLearning subreddit post quoting Tom Dietterich, the arXiv moderator for cs.LG, on a new enforcement policy.

    By signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the content was generated.

    www.reddit.com/r/MachineLearning/comments/1… →
    Details
    Cited text
    By signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the content was generated.
    Context
    Marks a turning point where the citation graph itself starts treating LLM-produced garbage as fraud rather than tolerable noise — and forces every other publication venue to decide whether to do the same.
    Key points
    • arXiv will impose a one-year submission ban on authors of papers with hallucinated references, hallucinated results, or leftover model output
    • The bar is 'incontrovertible evidence' — fake DOIs, citations to nonexistent papers, results referencing experiments that weren't run
    • Policy is driven by a sharp increase in volume across cs.LG, not isolated incidents
    • Open question: whether banned authors will be named publicly; the deterrent effect depends on it
    • Other preprint servers and venues are likely to follow
    Provenance
    Source · Background source
  4. 4

    Replit iOS app back on the App Store after four months

    X @amasad (Amjad Masad) — Founder and CEO of Replit, the highest-profile agentic-coding-for-everyone product on the market.

    We worked things out with Apple, and just published our app for the first time in 4 months.

    x.com/amasad/status/2055185058282226146 →
    Details
    Cited text
    We worked things out with Apple, and just published our app for the first time in 4 months.
    Context
    A concrete reminder that platform gatekeeping, not model capability, sets the actual ceiling for shipping agentic products to consumers — and the resolution path is opaque even for a well-funded company.
    Key points
    • Replit's iOS app was pulled or paused for four months; details of what Apple objected to have not been disclosed
    • Resolution announced May 15 with no public lesson-learned post
    • Replies frame app review as the real ceiling for agentic AI apps on mobile
    • Houman Asefi: 'App Store, cloud credits, GPUs, and payment rails are the actual choke points'
    • Robertus: 'Mobile is where agent workflows stop being a demo and start becoming something you check between errands'
    Provenance
    Tweet · Primary source
  5. 5

    Uncle Bob switches from Claude to Codex

    X @unclebobmartin (Robert C. Martin) — Long-time developer-craft author (Clean Code, Clean Architecture); audience skews senior practitioner.

    Less wordy. More down to earth. More direct. A bit less risk averse — which I consider to be an advantage because I am the guarantor, not it.

    x.com/unclebobmartin/status/205497032759204… →
    Details
    Cited text
    Less wordy. More down to earth. More direct. A bit less risk averse — which I consider to be an advantage because I am the guarantor, not it.
    Context
    A single anecdotal switch, but from someone whose default audience is senior developers; pairs with OpenAI's Codex mobile launch as a signal of where the developer-tooling axis is sitting this week.
    Key points
    • High-profile developer publicly cancels Claude account after weeks of using Codex exclusively
    • Cites tone, directness, and willingness to be 'adventurous' — calls it a vibe choice
    • Reports running 8–9 hour Codex sessions without hitting limits
    • Points to his own swarm-forge GitHub repo, a multi-agent coordinator
    • Engagement: 2,040 likes, 137,000 views in 20 hours
    Engagement
    2040 likes · 119 retweets · 161 replies
    Provenance
    Tweet · Primary source
  6. 6

    Arvind Narayanan on the verification challenge

    X @random_walker (Arvind Narayanan) — Princeton computer science professor; co-author of AI Snake Oil, frequent commenter on the gap between AI marketing and AI use.

    The harder AI companies try to make their products feel like magic genies, the steeper the learning curve gets.

    x.com/random_walker/status/2055271764662296… →
    Details
    Cited text
    The harder AI companies try to make their products feel like magic genies, the steeper the learning curve gets.
    Context
    Names the asymmetry behind every agentic-coding product decision this year: confident-sounding output raises the cost of catching the model's mistakes, and that cost is on the human.
    Key points
    • Frames the irony of magic-genie product design making real use harder, not easier
    • 'Prompt engineering may no longer be a thing, but the verification challenge isn't going away'
    • Verification requires practice and learning — it's not a UX problem you can paper over
    • Lands on the same point Bob Martin makes from the user side: the user is the guarantor
    Provenance
    Tweet · Primary source
  7. 7

    Codex for Everyday Work: AI Agents Beyond Coding

    Video OpenAI — OpenAI Forum conversation with Chris Nicholson (Global Affairs) and Thibault Sottiaux (Head of Codex), May 14, 2026.

    Codex began as a tool for developers. Today, people are using it for much more: research, planning, file organization, automation, data analysis, presentations.

    www.youtube.com/watch?v=DLP9CagE3dU →
    Details
    Cited text
    Codex began as a tool for developers. Today, people are using it for much more: research, planning, file organization, automation, data analysis, presentations.
    Context
    The mobile launch and the broader-than-coding pitch are happening in the same week; OpenAI is positioning Codex as the everyday-work surface, not the developer-only one.
    Key points
    • OpenAI is publicly broadening Codex's positioning from developer tool to knowledge-work agent
    • Companion mobile launch puts Codex in the ChatGPT app for iOS and Android
    • Sottiaux: users now start, steer, and review Codex jobs from a phone while compute runs on a remote machine
    • Frames Codex as the front-end to long-running agent work on shared infrastructure
    Provenance
    Video · Supporting source
  8. 8

    NVIDIA Reportedly Prepares RTX 5090 Price Hike Amid Rising GDDR7 Costs

    Article AleksandarK — TechPowerUp reporter; original report sourced to Chinese Board Channels, a supply-chain leak feed.

    A $300 (about 2,000 RMB) increase for NVIDIA's add-in card (AIC) partners, who purchase these GPUs from NVIDIA.

    www.techpowerup.com/349050/nvidia-reportedl… →
    Details
    Cited text
    A $300 (about 2,000 RMB) increase for NVIDIA's add-in card (AIC) partners, who purchase these GPUs from NVIDIA.
    Context
    Cost of the local-AI hobbyist's default card just stepped up at the same time hosted-model pricing tiers are rising; the all-in cost of doing agentic work yourself versus paying a vendor is being re-priced.
    Key points
    • NVIDIA passing a $300 GPU-kit price increase to add-in-card partners for RTX 5090 and 5090D V2
    • Driven by GDDR7 supply tightness; lead times running into weeks
    • MSRP nominally $1,999; street prices on Newegg regularly cross $4,000
    • Founders Edition restocks on NVIDIA's own marketplace remain the only path near MSRP
    • Hike will likely show up at retailers in days or weeks
    Provenance
    Article · Supporting source
  9. 9

    Browsers Treat Big Sites Differently

    Article Den Odell — Front-end engineer and writer; this piece is a tour through Firefox's about:compat and WebKit's Quirks.cpp source.

    Facebook, X (twitter), and Reddit will naively pause a video element that has scrolled out of the viewport, regardless of whether that element is currently in PiP mode.

    denodell.com/blog/browsers-treat-big-sites-… →
    Details
    Cited text
    Facebook, X (twitter), and Reddit will naively pause a video element that has scrolled out of the viewport, regardless of whether that element is currently in PiP mode.
    Context
    Working theory of modern web compatibility laid out with primary source code: Chrome sets the agenda, other engines maintain quirks files. Worth knowing if you ship to browsers.
    Key points
    • Safari and Firefox both ship domain-specific rendering overrides; Chrome doesn't
    • Firefox exposes its overrides as togglable interventions at about:compat
    • WebKit's Quirks.cpp ships verbatim user-agent strings impersonating Chrome for Amazon Prime Video and other sites
    • Specific quirks for TikTok, Netflix, Instagram, Zillow, SeatGuru, and Amazon product zoom
    • Chrome's market dominance makes its undocumented behaviors the de facto spec other engines must paper over
    Provenance
    Article · Supporting source
  10. 10

    O(x)Caml in Space

    Article Thomas Gazagnaire — Co-founder of Parsimoni (space software spinout from Tarides) and long-time OCaml/MirageOS contributor.

    Switching to OxCaml with exclave_ stack_ annotations drops p99.9 latency from 29 ns to 9 ns per packet on the dispatch hot path, and removes GC pressure entirely.

    gazagnaire.org/blog/2026-05-14-borealis.html →
    Details
    Cited text
    Switching to OxCaml with exclave_ stack_ annotations drops p99.9 latency from 29 ns to 9 ns per packet on the dispatch hot path, and removes GC pressure entirely.
    Context
    A working counter-example to the 'pick whatever Python ships with' default — small team, language-rigour-first stack, formally verified components, actual production hardware in actual orbit.
    Key points
    • Pure-OCaml CCSDS protocol stack 'Borealis' booted in low Earth orbit on April 23, 2026 inside DPhi Space's ClusterGate-2
    • End-to-end-encrypted command and control with post-quantum signing (ML-DSA-65) and over-the-air rekeying
    • Wire formats as typed schemas, GADT-encoded state machines, formally verified crypto primitives (libcrux, fiat-crypto)
    • OxCaml mode-system annotations (locality, uniqueness) drop p99.9 latency from 29 ns to 9 ns and eliminate GC pressure on the dispatch hot path
    • Five-to-ten-MB statically linked flight binary, FROM scratch Docker image, running on a four-core Cortex-A53 module
    Provenance
    Article · Supporting source
  11. 11

    LocalLLaMA discussion on the 5090 price hike

    Source u/panchovix (LocalLLaMA subreddit) — Top thread on LocalLLaMA reacting to the TechPowerUp report; representative of the local-inference hobbyist take.

    NVIDIA Reportedly Prepares RTX 5090 Price Hike Amid Rising GDDR7 Costs (maybe RTX 50 and PRO series as well)

    www.reddit.com/r/LocalLLaMA/comments/1td9eh… →
    Details
    Cited text
    NVIDIA Reportedly Prepares RTX 5090 Price Hike Amid Rising GDDR7 Costs (maybe RTX 50 and PRO series as well)
    Context
    Sentiment check from the people whose monthly Codex/Claude bills are being directly traded against GPU purchase decisions — the price hike accelerates conversations about running quantized models locally.
    Key points
    • 356 upvotes, 160 comments — surface-level frustration plus pragmatic 'glad I bought mine last year' chorus
    • Same subreddit is concurrently celebrating the RTX 5000 Pro 48GB as the new serious-hobbyist ceiling
    • Local-inference community treats the consumer-vs-datacenter GPU competition as a permanent state, not a temporary squeeze
    Provenance
    Source · Background source