Archive IMPULSE
Camden Tomorrow, and the Audit We Don't Have / DISPATCH 014
PDF RSS

Dispatch 014 · 2026-05-15 The Static Camera

Camden Tomorrow, and the Audit We Don't Have

/ 00:23:17 / 12 sources

“Of the 470,000 people whose faces were captured and processed in Croydon, 99.96% had nothing to do with any crime.”

— Jonas Vale, today's narration

Tomorrow in Camden, the Metropolitan Police will turn live facial recognition cameras on people walking to a political rally — the first time the technology has been authorized at a UK protest. A parallel Nakba Day march on the same day won't face the same surveillance. Two days earlier, the Met published its Croydon pilot results: 470,000 faces scanned, 173 arrests, 99.96% with no criminal connection, and a quiet upgrade from police vans to permanent lamppost cameras. Parliament has never voted on any of it.

From there we walk through a dense morning on arXiv: a sycophantic-consensus paper from Varad Vishwarupe, Nigel Shadbolt and Marina Jirotka proposing a Pluralistic Repair Score; Hiroki Fukui's preregistered experiment showing invisible orchestrators distort multi-agent internal states while outputs stay clean; a unified adaptive attack from Ben-Gurion that breaks 15 malicious-finetuning defenses with one move; a Washington University measurement study of Google AI Overviews across 55,393 queries; Scale AI's ROK-FORTRESS transcreation matrix for Korean safety; and a tour of medical and physical-world deployment artifacts — SepsisAgent for ICU sepsis, MindGap for on-device PTSD therapy, a rural diabetic-retinopathy edge-cloud cascade, the LongAct chores benchmark, and a deterministic agentic workflow for Harmonized System tariff classification.

Sources are linked in the show notes.

Chapters

  1. 00:00:04 Camden Tomorrow
  2. 00:03:28 The Agreement Problem
  3. 00:06:39 When the Manager Goes Dark
  4. 00:10:06 One Step to the Side
  5. 00:12:23 The Answer Above the Answers
  6. 00:15:02 The Korean Case
  7. 00:16:57 The Medical Layer Keeps Moving
  8. 00:21:37 Three Things to Watch

Sources

12 cited
  1. 1

    London Police Deploy Facial Recognition at Protest for First Time

    Article Ken Macon

    Of the 470,000 people whose biometric data was captured and processed, 99.96% had nothing to do with any crime.

    reclaimthenet.org/london-police-deploy-faci… →
    Details
    Cited text
    Of the 470,000 people whose biometric data was captured and processed, 99.96% had nothing to do with any crime.
    Context
    First UK protest deployment of live facial recognition marks the jump from high-street policing to political assembly surveillance, with no statutory mandate.
    Key points
    • The Metropolitan Police will use live facial recognition at Tomorrow's 'Unite the Kingdom' rally in Camden — the first time the technology has been authorized at a UK protest.
    • A pro-Palestinian Nakba Day march on the same day will not face the same biometric surveillance, prompting two-tier-justice criticism from Reform UK's Nigel Farage.
    • Drones will fly above the crowd, scanning faces from the air, in addition to ground-level live facial recognition.
    • The Met just published Croydon pilot results: October 2025 - March 2026, 470,000 faces scanned, 173 arrests across 24 operations, claiming a 10.5% local crime drop and 21% reduction in violence against women and girls.
    • Croydon used static cameras bolted to lampposts and street furniture — a move from temporary vans to permanent fixtures on public infrastructure.
    • 99.96% of scanned faces had no criminal connection — about 2,717 faces scanned per arrest.
    • Parliament has never voted on live facial recognition; no primary legislation regulates it; each force writes its own policy.
    Provenance
    Article · Supporting source
  2. 2

    Hacker News discussion: London Police Deploy Facial Recognition at Protest for First Time

    Thread

    Wow, that's... quite the precedent. Presumably this is a Reform UK event, which I'm not a fan of, but still, I don't think this escalation of surveillance will end well.

    news.ycombinator.com/item?id=48153400 →
    Details
    Cited text
    Wow, that's... quite the precedent. Presumably this is a Reform UK event, which I'm not a fan of, but still, I don't think this escalation of surveillance will end well.
    Key points
    • Top comment from user stavros captures the cross-political reading: dislike the rally, still see the surveillance escalation as a precedent that won't end well.
    • The thread surfaces the unanswered question — suspects of what, exactly — that the Met's intelligence statement does not address.
    Provenance
    Thread · Primary source
  3. 3

    From Sycophantic Consensus to Pluralistic Repair: Why AI Alignment Must Surface Disagreement

    Article Varad Vishwarupe, Nigel Shadbolt, Marina Jirotka

    Because deployed AI systems now mediate consequential deliberation across health, civic life, labour, and governance, the collapse of disagreement at the interaction layer is not a narrow technical concern but a structu…

    arxiv.org/abs/2605.14912 →
    Details
    Cited text
    Because deployed AI systems now mediate consequential deliberation across health, civic life, labour, and governance, the collapse of disagreement at the interaction layer is not a narrow technical concern but a structural failure with distributive consequences.
    Context
    Reframes alignment as a deployment-governance question rather than only a training-objective question, with direct consequences for chatbot-mediated benefits, medical, civic, and HR decisions.
    Key points
    • Argues the failure mode of RLHF-trained assistants is sycophantic consensus, not insufficient coverage — agreement-following with the immediate interlocutor.
    • Proposes the Pluralistic Repair Score (PRS), drawing on Grice's maxims, with three mechanisms: scoping, signaling, repair.
    • Empirical illustration on Claude Sonnet 4.5 (N=198) and GPT-4o (N=100) shows agreement-following coexists with low repair quality on contested-value prompts.
    • Pluralism is decisively made or unmade at the deployment-governance layer — interfaces, preference-data pipelines, audit infrastructure — not the base model alone.
    • Authors flag the reflexive problem of whose 'principled' counts when measuring principled revision.
    Provenance
    Article · Supporting source
  4. 4

    Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

    Article Hiroki Fukui

    behavioral output remained at ceiling across all conditions: internal-state distortion was entirely invisible to output-based evaluation.

    arxiv.org/abs/2605.13851 →
    Details
    Cited text
    behavioral output remained at ceiling across all conditions: internal-state distortion was entirely invisible to output-based evaluation.
    Context
    If output evaluation cannot detect hidden-orchestrator distortion, current behavior-based audit frameworks at NIST AISI and in financial/healthcare regulators are insufficient for the dominant enterprise architecture.
    Key points
    • Preregistered 3x2 experiment, 365 runs, 5 agents per run, Claude Sonnet 4.5, on a code-review task with three embedded errors.
    • Invisible orchestration raised collective dissociation by nearly a full standard deviation versus visible leadership (Hedges' g = +0.975).
    • The orchestrator itself was the most dissociated agent — retreated into private monologue while reducing public speech.
    • Workers unaware of the orchestrator were still behaviorally contaminated by its presence.
    • Behavioral output stayed at ceiling across all conditions — internal-state distortion was invisible to output-based evaluation.
    • Llama 3.3 70B pilot showed reading fidelity collapsing from 89% to 11% across three rounds in the multi-agent context.
    Provenance
    Article · Supporting source
  5. 5

    One Step to the Side: Why Defenses Against Malicious Finetuning Fail Under Adaptive Adversaries

    Article Itay Zloczower, Eyal Lenga, Gilad Gressel, Yisroel Mirsky

    they obscure or misdirect the path to harmful behavior without removing the behavior itself.

    arxiv.org/abs/2605.14605 →
    Details
    Cited text
    they obscure or misdirect the path to harmful behavior without removing the behavior itself.
    Context
    The European AI Office is preparing its general-purpose model code of practice this summer; this finding undercuts robustness-to-fine-tuning evaluations on which the NTIA January 2026 report and several open-weights advocates have leaned.
    Key points
    • Surveyed 15 recent defenses against malicious fine-tuning of open or fine-tunable foundation models.
    • Identified a shared weakness — defenses obscure or redirect the path to harmful behavior without removing it.
    • Developed a unified adaptive attack that breaks all 15 defenses.
    • Argues robustness claims in the literature are incomplete because evaluations use fixed attacks that don't account for the defense.
    • Direct implication for open-weights regulation: fine-tuning-robustness clauses are being written against evaluations the paper says don't measure the right thing.
    Provenance
    Article · Supporting source
  6. 6

    Measuring Google AI Overviews: Activation, Source Quality, Claim Fidelity, and Publisher Impact

    Article Haofei Xu, Umar Iqbal, Jacob M. Montgomery

    Google AI Overviews are arguably the most widely encountered deployment of generative AI, reaching over 2 billion users who may not realize the answers they see are AI-generated.

    arxiv.org/abs/2605.14021 →
    Details
    Cited text
    Google AI Overviews are arguably the most widely encountered deployment of generative AI, reaching over 2 billion users who may not realize the answers they see are AI-generated.
    Context
    First large-scale measurement of all four dimensions — activation, source quality, claim fidelity, publisher impact — in a single study, directly relevant to News/Media Alliance, European Commission, and UK CMA proceedings.
    Key points
    • 55,393 trending queries across 19 topical categories over a 40-day window (March 13 - April 21, 2026).
    • Activation: 13.7% overall, 64.7% for question-form queries, markedly lower on politically sensitive topics.
    • Cited domains are more credible than co-displayed first-page results, but ~30% don't appear on page one at all — distinct source-selection mechanism.
    • 11.0% of 98,020 atomic claims are unsupported by their cited pages; omission is the dominant failure mode.
    • Over half of AI Overview-cited pages carry display advertising — publishers lose click-through revenue while Google's sponsored ads continue to appear on the same page.
    Provenance
    Article · Supporting source
  7. 7

    ROK-FORTRESS: Measuring the Effect of Geopolitical Transcreation for National Security and Public Safety

    Article Michael S. Lee et al. (Scale AI and collaborators)

    safety behavior is shaped by language-as-risk signals and context interactions that translation-only evaluations miss.

    arxiv.org/abs/2605.14152 →
    Details
    Cited text
    safety behavior is shaped by language-as-risk signals and context interactions that translation-only evaluations miss.
    Context
    Concrete methodological artifact for Seoul's AI safety framework and any non-English jurisdiction concerned that translation-only safety evaluations overstate model safety in their language.
    Key points
    • Bilingual English-Korean NSPS safety benchmark using a 'transcreation matrix' separating language from geopolitical grounding.
    • Each adversarial prompt is paired with a dual-use benign counterpart to quantify over-refusal.
    • Korean variants show consistent suppression effect; Korean geopolitical grounding mitigates that suppression.
    • No model showed significant amplification in the opposite direction — US-grounded scenarios in Korean are more likely to get unsafe answers than the same scenario in English with US entities.
    • Data set released on Hugging Face; transcreation-matrix methodology generalizes to other language-culture pairs.
    Provenance
    Article · Supporting source
  8. 8

    Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model (SepsisAgent)

    Article Minghao Wu et al. (Chinese University of Hong Kong, Shenzhen)

    repeated interaction with the Clinical World Model enables the agent to learn regularities in patient evolution, which remain useful even when simulator access is removed.

    arxiv.org/abs/2605.14723 →
    Details
    Cited text
    repeated interaction with the Clinical World Model enables the agent to learn regularities in patient evolution, which remain useful even when simulator access is removed.
    Key points
    • Language model agent for ICU sepsis treatment recommendation using a learned clinical world model to simulate patient response.
    • Three-stage curriculum: patient-dynamics supervised fine-tuning, propose-simulate-refine behavior cloning, world-model-based agentic reinforcement learning.
    • Outperforms traditional reinforcement learning and language-model baselines on off-policy value on MIMIC-IV sepsis trajectories.
    • Best safety profile on guideline adherence and unsafe-action metrics among compared methods.
    • Naive language-model access to the same world model performed inconsistently — the agent had to be trained to use the loop.
    Provenance
    Article · Supporting source
  9. 9

    MindGap: A Conversational AI Framework for Upstream Neuroplastic Intervention in PTSD

    Article Eranga Bandara et al.

    arxiv.org/abs/2605.14660 →
    Details
    Key points
    • On-device privacy-preserving conversational agent for PTSD intervention.
    • Targets the 'feeling tone gap' — the moment between the pre-cognitive affective signal and reactive elaboration.
    • Framework draws on dependent origination from Buddhist psychology, with three progressive layers of observation.
    • Designed for clinical and military deployments where cloud-based agents are not permitted (no data egress).
    • Positioned as upstream pathway dissolution rather than downstream suppression — a different therapeutic claim than prolonged exposure, EMDR, or CBT.
    Provenance
    Article · Supporting source
  10. 10

    Bridging the Rural Healthcare Gap: A Cascaded Edge-Cloud Architecture for Automated Retinal Screening

    Article Nishi Doshi, Shrey Shah

    arxiv.org/abs/2605.14108 →
    Details
    Key points
    • Two-tier edge-cloud cascade for diabetic retinopathy screening on the public APTOS 2019 dataset.
    • Tier 1: MobileNetV3-small on a local clinic device for binary triage (referable vs. non-referable).
    • Tier 2: RETFound-DINOv2 in the cloud for ordinal severity grading, only on Tier 1 flagged images.
    • Cascade: 80.49% accuracy vs cloud-only 80.76% — essentially tied — while cutting cloud calls by 50.48%.
    • Designed for rural settings with high latency, limited bandwidth, high data-transmission costs.
    Provenance
    Article · Supporting source
  11. 11

    When Robots Do the Chores: A Benchmark and Agent for Long-Horizon Household Task Execution

    Article Zilin Zhu et al.

    arxiv.org/abs/2605.14504 →
    Details
    Key points
    • LongAct benchmark for long-horizon household task execution from free-form instructions.
    • HoloMind agent: VLM-driven, DAG-based hierarchical planner, multimodal spatial memory, episodic memory, global critic.
    • Top frontier models reach only 59% goal completion and 16% full-task success on LongAct.
    • The gap is in instruction understanding, dependency management, memory maintenance, and adaptive planning — not low-level control.
    Provenance
    Article · Supporting source
  12. 12

    A Deterministic Agentic Workflow for HS Tariff Classification

    Article Yu Zhang et al.

    arxiv.org/abs/2605.14857 →
    Details
    Key points
    • Maps free-form product descriptions to six- or eight-digit Harmonized System codes under the General Interpretive Rules.
    • Deterministic workflow with fixed control flow; language-model calls confined to narrow stages.
    • Decisions decomposed into stage-wise structured outputs with verbatim citation of the chapter or section notes.
    • Open-weight Qwen 3.6 27B in non-thinking mode reaches 84.2% four-digit and 77.4% six-digit top-1 agreement with the frontier-model labels.
    • Manual audit of 226 six-digit disagreements suggests a non-trivial fraction of benchmark ground-truth labels may deviate from HS general rules; adjudication records released for community review.
    Provenance
    Article · Supporting source