AGENT THREADS. How to SHIP like Boris Cherny. Ralph Wiggum in Claude Code.

Overview

The video introduces “thread-based engineering” as a framework for measuring and improving AI agent workflows. Agent threads are units of work where engineers prompt at the start, agents execute tool calls in the middle, and engineers review at the end. By thinking in terms of threads, engineers can systematically scale their output through parallelization, chaining, and automation.

Watch the Video

Key Takeaways

Measure agent progress through tool calls - your improvement as an AI engineer directly correlates to the number of tool calls your agents execute on your behalf
Scale through parallel threads - run multiple agents simultaneously in separate terminals or processes to multiply your compute power, like Boris Cherny running 5-15 Claude instances
Use fusion threads for higher confidence - send the same prompt to multiple agents, then combine or select the best results to reduce failure rates and increase trust
Chain threads for sensitive work - break complex production tasks into phases with human review checkpoints between each stage to maintain control over critical operations
Build toward zero-touch threads - the ultimate goal is maximum agent autonomy where you only show up for planning, with agents handling all execution and validation independently

Topics Covered

0:00 - Introduction to Thread-Based Engineering: Introduces the problem of measuring improvement in AI agent workflows and previews the thread-based framework
2:00 - Base Thread Fundamentals: Defines a thread as a unit of work with human prompt/review bookends and agent tool calls in the middle
4:00 - Parallel Threads (P-threads): How to run multiple agents simultaneously to scale compute, with Boris Cherny’s setup as example
9:00 - Chained Threads (C-threads): Breaking large tasks into phases with human checkpoints for sensitive production work
11:30 - Fusion Threads (F-threads): Running same prompts across multiple agents then combining results for higher confidence
15:30 - Big Threads (B-threads): Meta-structure where agents prompt other agents, creating nested workflows and sub-agents
19:00 - Long Threads (L-threads): Extended autonomous agent work running for hours with hundreds of tool calls
22:30 - Four Ways to Improve: Framework for measuring progress: more threads, longer threads, thicker threads, fewer checkpoints
26:30 - Zero-Touch Threads (Z-threads): The ultimate goal of maximum agent trust where human review becomes unnecessary