Archive BRAID DAILY
'I don't write code anymore' — and fast models need slow developers
Subscribe

Braid Daily · 2026-05-23

'I don't write code anymore' — and fast models need slow developers

Pieter Levels says he hasn't written code in six months. Cerebras's counterpoint: faster generation makes discipline matter more, not less.

A lone maker on a couch with a laptop, surrounded by floating glowing browser tabs linked by thin yellow lines, captioned "I don't write code anymore."
The solo-maker workflow, mid-2026: tabs into your own sites, an agent open all day, the code itself out of frame.

The lead

1

Pieter Levels, the solo maker behind Nomad List and PhotoAI, posted his daily setup: browser tabs into his own sites on a VPS, mirrored to his phone, and Claude Code open all day. Then came the line people argued about: "I don't write code anymore. I haven't written code in I think 6 months? I think everyone is like this no?" A concrete data point on how far the solo-maker workflow has moved.

Read source

When typing stops being the bottleneck

3

Andreessen reposts the setup with one word

@pmarca

Marc Andreessen quote-tweeted Levels's screenshot of the all-day VPS-and-Claude-Code setup. His entire comment was one word, and that's what put "I don't write code anymore" in front of hundreds of thousands of people, flattened into a slogan on the way.

“Interesting.”

Read source

Fast Models Need Slow Developers — Sarah Chieng, Cerebras

AI Engineer (YouTube)

Codex Spark generates code at roughly 1,200 tokens per second, about 20x faster than Sonnet or Opus. Chieng's point is that the same speed produces bad code 20x faster too, so cheap validation and your own taste become the job. The loop below maps where that leaves the bottleneck.

“A lot of these bad habits that we had before that were generating maybe 50 tokens per second of bad code — unless we fix them, they're going to start generating 1,200 tokens per second of bad code.”

Read source

Lobster Trap: packaging an agent as a container image — Sally Ann O'Malley, Red Hat

AI Engineer (YouTube)

O'Malley packages an OpenClaw agent setup as a container image, so one person's config becomes a baseline a teammate can actually reproduce. Podman locally, a sub-agent in about two seconds, one flag to run the same image on Kubernetes.

“Sharing a good agent setup usually means handing someone a pile of markdown, config files, and YAML and hoping they reproduce what you have.”

Read source

Price and silicon

3

DeepSeek makes its 75% V4-Pro price cut permanent

The Next Web

The discount that floated as an April promo is now permanent — roughly a quarter of the original rate, paired with about a 90% cut to input cache-hit costs. That cache-hit discount rewards agent loops that re-send stable context every step.

“DeepSeek is making its 75% API discount permanent.”

Read source

NVIDIA drops 'Gaming' as a reporting category

Guru3D

NVIDIA folded standalone Gaming into a broader Edge Computing line — GeForce, AI PCs, workstations, consoles, robotics, automotive. That bucket booked about $6.4B last quarter against $81.6B in total revenue, with data center up 85% year over year. RTX cards keep shipping; gaming just isn't the headline.

Read source

BeeLlama v0.2.0: 4-5x speedups on a single RTX 3090

r/LocalLLaMA

On one 24GB RTX 3090, speculative decoding pushes Qwen 3.6, at 27 billion parameters, to 164 tokens per second, and Gemma 4, at 31 billion, to about 178 — roughly 4-5x the llama.cpp baseline. The top comment asks the obvious next test: does it hold up across a 200K-token agentic coding chat?

“Squeezing that 3090 like a lemon.”

Read source

Stakes, and the view from outside

3

Anthropic's Jack Clark: an AI-assisted Nobel discovery within a year

The Guardian

Clark's spread of bets at Oxford: an AI-assisted Nobel discovery within 12 months, AI-run companies earning millions within 18, models designing their own successors by the end of 2028. He still keeps a non-zero chance of catastrophe on the board, and his co-host pushed back on the risk of human cognitive atrophy. The falsifiable bets are the ones to hold him to.

“If we stand by and let synthetic intelligence multiply, then we'll eventually be forced into reactivity.”

Read source

Is AI viewed as 'evil' in non-tech communities?

r/singularity

A data engineer asks for a reality check after a hostile reaction to recommending AI to non-tech friends. The top reply, hundreds of upvotes deep, lays out the case without strawmanning it: people see AI shoved into everything by billionaires, livelihoods lost, creatives first.

“For a lot of people, there's limited upsides to AI right now... right now it's not serving most people and in many cases it's causing harm.”

Read source

Overheard in a Copenhagen park: world models and grounded video gen

@niloofar_mire

An AI researcher flew to Copenhagen to detach, burnt out, sat down in a random park, and caught the couple next to her talking shop. A light marker of how far past the conference halls this conversation has spread.

“I overheard the couple next to me talking about world models and grounded video gen LOL”

Read source

Companion episode

Fast models, slow developers — and the part of the job that stays yours

· 00:21:39

The same week held two readings of where this is going. Levels says he hasn't typed code in six months; over in the singularity subreddit, a data engineer got a hostile reaction just for suggesting AI to friends. As the tooling gets cheaper and faster — DeepSeek's permanent price cut, a four-year-old 3090 doing 170 tokens a second — those two stories keep moving apart, not together.