Red/Green TDD

Details

Also known as: Red-Green-Refactor, Classical TDD Cycle, Test-First Development

Core Concepts:

Red phase: Write a failing test first, before any production code. The test must actually run and fail for the right reason — compilation errors don’t count as red.
Green phase: Write the minimum production code required to make the failing test pass. Resist the urge to add functionality the test does not demand.
Refactor phase: With tests passing, improve the design — rename, extract, deduplicate — confident the test suite will catch regressions.
Small steps: Each cycle is deliberately small (seconds to minutes), not hours. Large leaps break the feedback loop.
Watch it fail: Verifying that the test actually fails before implementation catches tautological tests and typos in the test itself.
Test names describe behavior: Not method names. "Returns empty list when no users exist" tells you what the code does; "test getUsers" does not.
Minimum to pass: Fake it ('return 42'), hardcode, or copy-paste — whatever makes the test pass fastest. Generalization happens in the refactor phase via triangulation.
One test at a time: Only one failing test in the suite at any moment. Never add a second failing test before the first is green.
Key Proponents: Kent Beck ("Test-Driven Development: By Example", 2002); popularized for LLM coding agents by Simon Willison

Instructing LLM coding agents that default to implementing features first and writing tests afterwards
New production code where requirements can be expressed as concrete examples
Bug fixes — write a failing test that reproduces the bug, then fix
Refactoring legacy code that lacks test coverage (start by characterizing behavior)
Teaching disciplined incremental development

TDD Chicago School - State-based classical TDD focused on real objects over mocks
TDD London School - Mock-heavy, interaction-based, outside-in variant
BDD Given-When-Then - Behavior-driven complement for higher-level acceptance tests

David Heinemeier Hansson, "TDD is dead. Long live testing." (2014) — test-first as dogma leads to "test-induced design damage"; the follow-up "Is TDD Dead?" conversation series with Kent Beck and Martin Fowler maps the whole debate
James O. Coplien, "Why Most Unit Testing is Waste" (2013) — most low-level unit tests provide little assurance relative to their maintenance cost
Empirical results are mixed: replication studies (e.g. Fucci et al., "A Dissection of the Test-Driven Development Process", IEEE TSE 2017) suggest observed benefits stem from working in small iterative steps rather than from writing the test first