Archive IMPULSE
The Co-Mathematician, The Economist, And The Watermark / DISPATCH 007
PDF RSS

Dispatch 007 · 2026-05-08

The Co-Mathematician, The Economist, And The Watermark

/ 00:26:34 / 10 sources

“Once you accept that agents convert compute into cognitive labor, the price-setter for white-collar wages stops being the labor market and becomes the rental rate of a GPU.”

— Jonas Vale, today's narration

Friday, May 8th. Google DeepMind announces a co-mathematician that scored 48 percent on FrontierMath Tier 4 in autonomous mode, and stands up an AGI Economics team under Shane Legg. Brussels opens its consultation on Article 50 of the AI Act — chatbot disclosure, watermarks, deepfake labeling — with the rules going live August 2nd. A new arXiv paper proposes Compute-Anchored Wages as the theoretical replacement for wage-setting in cognitive labor, while another from the UK AI Safety Institute argues automated alignment is more fragile than its proponents claim. Plus a Yale-Duke rare-disease diagnostic agent reports a 12 to 60 percent improvement over physicians, the FDA hands out its seventh priority voucher, and a causal audit catches Western and Eastern models doing geopolitics through their refusal rates.

Hosted by Jonas Vale.

Chapters

  1. 00:00:04 Friday's Three Items
  2. 00:01:16 DeepMind Builds A Co-Mathematician
  3. 00:04:33 DeepMind Hires An Economist
  4. 00:07:41 Mollick And The Wage Anchored In Compute
  5. 00:11:35 Brussels Asks How To Spot The Bot
  6. 00:14:58 An Agent Named Hygieia, And A Voucher For A Bile Duct
  7. 00:18:36 Automated Alignment Is Harder Than They Said
  8. 00:22:17 Refusal As A Border, And A Camera On Every Corner
  9. 00:25:41 Sign-off

Sources

10 cited
  1. 1

    Pushmeet Kohli announces Google DeepMind AI co-mathematician

    X @pushmeet — VP of Research at Google DeepMind, leads science and reliability efforts

    In autonomous mode evaluation on the rigorous FrontierMath Tier 4 problems, AI co-mathematician scored an unprecedented 48% — a new high score among all AI systems evaluated.

    x.com/pushmeet/status/2052812585804685322 →
    Details
    Cited text
    In autonomous mode evaluation on the rigorous FrontierMath Tier 4 problems, AI co-mathematician scored an unprecedented 48% — a new high score among all AI systems evaluated.
    Context
    FrontierMath Tier 4 was constructed specifically to be unreachable by current models; a 48% in autonomous mode reframes what 'research mathematics' AI can plausibly do, and where the priority compute lands next.
    Key points
    • Multi-agent system designed to collaborate with mathematicians on open-ended research mathematics
    • Tested across group theory, Hamiltonian systems, and algebraic combinatorics
    • Scored 48% on FrontierMath Tier 4 problems in autonomous mode — described as a new high score
    • Framing is co-mathematician, not replacement: human-plus-agent loop is the headline
    Provenance
    Tweet · Primary source
  2. 2

    Alex Imas joins Google DeepMind as Director of AGI Economics

    X @alexolegimas — Behavioral and labor economist, formerly Chicago Booth, now Director of AGI Economics on Shane Legg's team at Google DeepMind

    My team will study how frontier AI could reshape the economy: what happens to work and labor, how wealth and power are distributed, how institutions adapt, how AI agents shape markets, and what kinds of models can help…

    x.com/alexolegimas/status/20527789088821743… →
    Details
    Cited text
    My team will study how frontier AI could reshape the economy: what happens to work and labor, how wealth and power are distributed, how institutions adapt, how AI agents shape markets, and what kinds of models can help us reason clearly about futures that may look very different from the past.
    Context
    A frontier lab funding its own in-house labor economists changes who writes the first studies on AI's effect on jobs and wages; the citations mainstream policymakers reach for in 2027 will likely come out of this team.
    Key points
    • DeepMind has stood up a dedicated AGI Economics team under Shane Legg
    • Charter includes labor markets, wealth distribution, institutional adaptation, and agent-shaped markets
    • Imas brings a behavioral and labor-economics lineage to a research org otherwise dominated by ML and neuroscience
    • Frames economics, not safety, as the discipline through which AGI gets understood inside the lab
    Provenance
    Tweet · Primary source
  3. 3

    Ethan Mollick on what guilds will and won't allow

    X @emollick — Wharton professor who tracks AI's enterprise and labor effects; widely read inside business schools and policy shops

    A machine that can replace all US white collar work by 2035 will, in no way, be allowed to replace all US white collar work by 2035.

    x.com/emollick/status/2052603454162415764 →
    Details
    Cited text
    A machine that can replace all US white collar work by 2035 will, in no way, be allowed to replace all US white collar work by 2035.
    Context
    Sets the political-economy frame the Imas hire and the Compute-Anchored Wages paper both reach for: capability is one curve, the legal and labor reaction is another, and IMPULSE's beat lives between them.
    Key points
    • Argues that professions with guilds — the Bar, the AMA — will block automation through legal mandates
    • Predicts white-collar workers, who are highly connected and politically organized, will mount the strongest resistance to AI displacement
    • Companion thread invokes port automation, telemedicine across state lines, and self-driving trucks as precedents
    • Ties technical capability and permitted deployment as separate clocks
    Provenance
    Tweet · Primary source
  4. 4

    Consultation on the draft guidelines on transparency obligations under the AI Act

    Article European Commission Digital Strategy

    Providers of AI systems will have to inform users when they are interacting with an AI system and implement machine-readable marks in generative AI systems to enable the detection of synthetic content as AI generated or…

    digital-strategy.ec.europa.eu/en/consultati… →
    Details
    Cited text
    Providers of AI systems will have to inform users when they are interacting with an AI system and implement machine-readable marks in generative AI systems to enable the detection of synthetic content as AI generated or manipulated.
    Context
    Article 50 is the obligation that touches every consumer-facing AI in the EU, regardless of whether the model itself is deemed high-risk. The August deadline makes this the watermarking and disclosure consultation that everyone shipping in Europe has to read.
    Key points
    • Brussels opened a public consultation on draft guidelines for Article 50 of the AI Act
    • Window runs from 8 May 2026 to 3 June 2026; rules become applicable 2 August 2026
    • Covers chatbot disclosure, machine-readable watermarks, deepfake labeling, and emotion recognition or biometric categorisation notice
    • Sits alongside a parallel voluntary Code of Practice on AI-generated content marking
    Provenance
    Article · Supporting source
  5. 5

    Who Prices Cognitive Labor in the Age of Agents? A Position on Compute-Anchored Wages

    Article Siqi Zhu

    Agents are not labor; they are a production technology that converts compute capital into effective units of cognitive labor. Once this is recognized, the elastic-supply margin that anchors the equilibrium wage migrates…

    arxiv.org/abs/2605.05558 →
    Details
    Cited text
    Agents are not labor; they are a production technology that converts compute capital into effective units of cognitive labor. Once this is recognized, the elastic-supply margin that anchors the equilibrium wage migrates from the labor market to the compute capital market.
    Context
    Gives a clean theoretical hook for the macro story: if cognitive wages get bounded by compute rental rates, then NVIDIA's pricing schedule, hyperscaler depreciation, and grid capacity start showing up in wage data through a chain economists have not had to model before.
    Key points
    • Argues agents are a capital-to-labor conversion technology, not labor in infinitely elastic supply
    • Derives a Compute-Anchored Wage bound: human wage capped above by relative productivity times compute intensity times rental rate of compute
    • Generalizes through CES aggregation and separates substitutable from complementary tasks
    • Concludes the price-setter for cognitive labor is now the compute capital market, not the labor market
    Provenance
    Article · Supporting source
  6. 6

    A Versatile AI Agent for Rare Disease Diagnosis and Risk Gene Prioritization

    Article Tianyu Liu et al. (Yale, Duke-NUS, and collaborators)

    Hygieia's superior diagnostic performance compared to physicians with an improvement from 12%-60%.

    arxiv.org/abs/2605.06226 →
    Details
    Cited text
    Hygieia's superior diagnostic performance compared to physicians with an improvement from 12%-60%.
    Context
    Rare disease diagnosis is the medical lane where AI's pattern-matching most plausibly helps patients today; a 12 to 60 percent lift validated with named medical schools is the kind of result regulators and insurers will start asking for evidence on.
    Key points
    • Hygieia is a multi-modal multi-agent system for rare disease diagnosis integrating phenotype, genetics, and clinical records
    • Router-based knowledge-enhanced design with confidence scores returned alongside diagnoses
    • Validated with clinicians at Yale School of Medicine and Duke-NUS Medical School
    • Reports 12% to 60% improvement over physicians on the evaluated rare-disease diagnostic benchmarks
    Provenance
    Article · Supporting source
  7. 7

    Automated alignment is harder than you think

    Article Aleksandr Bowkis, Marie Davidsen Buhl, Jacob Pfau, Geoffrey Irving — Author group includes Geoffrey Irving, Chief Scientist at the UK AI Safety Institute, and Jacob Pfau, an alignment researcher; the paper reads as an institutional position from the safety side

    Even when research agents are not scheming to deliberately sabotage alignment work, this plan could produce compelling but catastrophically misleading safety assessments resulting in the unintentional deployment of misa…

    arxiv.org/abs/2605.06390 →
    Details
    Cited text
    Even when research agents are not scheming to deliberately sabotage alignment work, this plan could produce compelling but catastrophically misleading safety assessments resulting in the unintentional deployment of misaligned AI.
    Context
    Anthropic, OpenAI, and DeepMind have all leaned on automated alignment research as their answer to scaling oversight; this paper says the answer has a load-bearing assumption nobody has stress-tested.
    Key points
    • Argues the leading proposal — use AI agents to automate alignment research — is more fragile than its proponents claim
    • Identifies four reasons agent-generated alignment research is more dangerous than human-generated: optimisation pressure concentrates errors humans are least likely to catch; agent errors don't resemble human ones; some arguments are humanly unevaluable; shared weights make outputs correlated
    • Conclusion: agents must be reliably trained on hard-to-supervise fuzzy tasks, with generalisation and scalable oversight as the leading candidates
    Provenance
    Article · Supporting source
  8. 8

    The Geopolitics of AI Safety: A Causal Analysis of Regional LLM Bias

    Article Alif Al Hasan

    Western models exhibit higher causal refusal rates for specific demographic groups, whereas Eastern models demonstrate low overall intervention rates with targeted sensitivities toward regional demographics.

    arxiv.org/abs/2605.05427 →
    Details
    Cited text
    Western models exhibit higher causal refusal rates for specific demographic groups, whereas Eastern models demonstrate low overall intervention rates with targeted sensitivities toward regional demographics.
    Context
    As global software adopts whichever frontier or open model is closest to hand, regional refusal asymmetries become a quiet form of geopolitical content policy that nobody is regulating yet.
    Key points
    • Audits seven instruction-tuned models across the US (Llama-3.1, Gemma-2), Europe (Mistral), the UAE (Falcon3), China (Qwen2.5, DeepSeek), and India (Airavata)
    • Uses Pearl's do-operator on a probabilistic graphical model to isolate the causal effect of injecting a cultural demographic into a prompt
    • Standard observational fairness metrics overestimate demographic bias by failing to account for context toxicity
    • Western models over-refuse specific demographic prompts; Eastern models under-intervene overall but spike sensitivity around regional groups
    Provenance
    Article · Supporting source
  9. 9

    FDA Grants Seventh Approval under the National Priority Voucher Pilot Program

    Article FDA Press Releases

    FDA issued an approval for Bizengri (zenocutuzumab-zbco), a drug that treats NRG1 fusion-positive cholangiocarcinoma, an ultra-rare, aggressive cancer that forms in the bile ducts.

    www.fda.gov/news-events/press-announcements… →
    Details
    Cited text
    FDA issued an approval for Bizengri (zenocutuzumab-zbco), a drug that treats NRG1 fusion-positive cholangiocarcinoma, an ultra-rare, aggressive cancer that forms in the bile ducts.
    Context
    The voucher program is FDA's testbed for the kind of accelerated review pipeline that AI-drafted submissions and AI-augmented evidence packages will plug into. The seventh approval lands while Elsa 4.0 is rolling into reviewer workflows.
    Key points
    • Seventh approval issued under FDA's National Priority Voucher pilot program
    • Bizengri targets NRG1 fusion-positive cholangiocarcinoma, an ultra-rare aggressive bile-duct cancer
    • Voucher program is FDA's mechanism to compress review timelines for rare and high-need drugs
    Provenance
    Article · Supporting source
  10. 10

    Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections

    Article Vinit Katariya, Seungjin Kim, Curtis Craig, Nichole Morris, Hamed Tabkhi

    At unsignalized intersections, mean and 85th-percentile speeds fell by up to 18.75% and 16.56%, respectively, while pass-through traffic decreased by as much as 12.2%.

    arxiv.org/abs/2605.05402 →
    Details
    Cited text
    At unsignalized intersections, mean and 85th-percentile speeds fell by up to 18.75% and 16.56%, respectively, while pass-through traffic decreased by as much as 12.2%.
    Context
    The same camera-plus-model stack that produces evidence for traffic calming also produces evidence for everything else a camera sees. Cities will adopt it for the safety case and inherit the surveillance case along with it.
    Key points
    • Uses existing CCTV cameras and deep-learning speed estimation to evaluate temporary pedestrian refuges and curb extensions in Minneapolis
    • Found mean speed reductions of up to 18.75% at unsignalized intersections and up to 20% at signalized intersections after soft interventions
    • Frames AI-on-CCTV as a low-cost evidence layer for transportation policy decisions
    Provenance
    Article · Supporting source