What is prompt decomposition in AI writing?

Prompt decomposition means breaking a complex task into discrete, single-responsibility prompts that each produce one deterministic output. Instead of "write a good article," a decomposed chain runs: analyze brief, design structure, write lede, write section one, select pull quote, generate meta description. Each prompt is easier and produces better output.

Which AI models does the Article Library use?

Prompts 1-3 (analysis) run on Claude Sonnet — reasoning-heavy tasks where quality matters most. Prompts 4-10 (section bodies) run on Claude Haiku — faster and cheaper for execution. Prompts 11-12 (quality pass) return to Sonnet. This reduces cost by approximately 65%.

Why use 12 prompts instead of one?

A single prompt produces inconsistent quality: strong openings that weaken by section three, voice drift, and no structural control. The 12-prompt chain locks structure first, writes each section independently, and runs a final quality check. The result is consistent quality throughout.

How long does the Article Library take?

The full 12-prompt chain produces a publication-ready article of 2,000-2,800 words in 90-110 seconds. Analysis prompts on Sonnet take ~25 seconds, execution prompts on Haiku ~55 seconds, and quality pass prompts ~20 seconds.

Intelligent OperationsDeep Dives

Inside the Article Library: How the Writing Engine Produces Long-Form at Scale

Q: How does AI produce long-form articles at scale?

By decomposing the writing task into 12 discrete prompts, each scoped to a single responsibility. Structure is locked before body copy is written. Each section prompt receives only its brief plus the locked structure — not prior sections. A final quality pass checks coherence. Quality stays consistent from paragraph one to the conclusion.

12 prompts. Drop cap to footnote. How the Article Library chains prompts to produce publication-ready editorial without a single revision loop.

The Prompt Engineering Project March 29, 2026 10 min read

Quick Answer

The Article Library runs 12 sequential prompts to produce publication-ready long-form content: brief analysis, voice calibration, structure design, lede writing, section bodies (one per section), pull quote selection, footnote generation, meta description, and quality pass. Each prompt is scoped to a single task. Prompts 1-3 run on Sonnet (analysis), 4-10 on Haiku (execution), and 11-12 on Sonnet (quality). The full chain completes in 90-110 seconds, reducing cost by approximately 65% versus running all prompts on Sonnet.

Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of energy and caffeine at the same moment. That is not a people problem. It is an architecture problem.

A single "write me a great article about X" prompt hands the model too many responsibilities at once: understand the brief, choose a structure, establish a voice, write a compelling lede, maintain quality across 2,000 words, end well. Each of these is a separate cognitive task. Bundling them into one prompt means each one gets a fraction of the model's attention -- and the fraction allocated to sections three through five is smaller than sections one and two, because the context window is now full of everything that came before.

The IO Article Library solves this with prompt decomposition. Each of the 12 prompts has one job. The brief analysis prompt reads the context brief and extracts 6 structured parameters. The voice calibration prompt reads those parameters and outputs a 200-token style specification. The structure design prompt reads the style spec and outputs a locked outline. No subsequent prompt writes freeform -- every prompt executes against a tightly constrained input. The quality is consistent because the constraints are consistent.

Why 12 Prompts and Not One

The number 12 is not arbitrary. It is the result of decomposing a publication-ready article into its minimum set of non-overlapping, single-responsibility tasks. Remove any one prompt and you either push its work onto an adjacent prompt (degrading that prompt's output) or you skip the step entirely (producing a detectably worse article).

The key decomposition decisions are three. First, structure before copy: the outline is locked in prompt 3 before any body copy is written in prompts 4 through 10. This means every section prompt receives the full structure as context, which prevents sections from repeating or contradicting each other -- a failure mode endemic to single-prompt generation. Second, sections receive only their brief: each section-body prompt receives the locked outline and its specific section brief, not the full text of prior sections. This prevents voice drift and keeps context windows small. Third, quality pass at the end: prompt 11 reads the assembled article as a whole and flags coherence issues for prompt-level correction, not for manual editing.

Each prompt has one job. Structure before copy. Sections receive only their brief. A quality pass at the end. This is why section five reads as well as section one.

The 12-Prompt Chain

Click any step below to see its input structure, output format, model assignment, token budget, and latency. The indigo nodes run on Claude Sonnet 4 (reasoning-heavy). The green nodes run on Claude Haiku (fast execution).

Article Library -- 12-Prompt Sequential ChainClick any step

Sonnet

Haiku

Phase 1 -- Analysis (Sonnet 4)

Phase 2 -- Execution (Haiku)

Phase 3 -- Quality Pass (Sonnet 4)

Model Routing: Sonnet vs. Haiku

The Article Library does not run all 12 prompts on the same model. It routes each prompt to the model whose capabilities match the task -- Sonnet for reasoning-heavy analysis and quality review, Haiku for high-volume content execution. This is not a cost-cutting measure. It is an architectural decision that produces better output: Haiku writes section bodies faster and more cleanly than Sonnet because its smaller, more focused attention window keeps it on task without introducing the complexity Sonnet adds when given creative latitude.

Model Routing Architecture -- Article Library

Claude Sonnet 4

Reasoning / Analysis / Quality

4 Prompts

P01Brief Analysis -- extract 6 params from brief~600 tkns

P02Voice Calibration -- derive style specification~600 tkns

P03Structure Design -- produce locked outline~900 tkns

P11Coherence Review -- full article quality pass~3000 tkns

Claude Haiku

Execution / Speed / Volume

8 Prompts

P04Lede + Drop Cap paragraph~800 tkns

P05Section 1 body copy~1200 tkns

P06-07Section 2 & 3 body copy~2400 tkns

P08Transitions between sections~1100 tkns

P09-10Pull quote + Footnotes~1400 tkns

P12Meta description + Related articles~2900 tkns

Cost Comparison -- All-Sonnet vs. Hybrid Routing

All Sonnet

$0.048

Hybrid (IO)

$0.017

Cost reduction per article run via hybrid routing:~65%

The counterintuitive finding from routing experiments: Haiku-generated section bodies score higher on voice consistency than Sonnet-generated bodies, because Sonnet's tendency to elaborate pushes it off the locked style specification. Haiku executes the specification without editorializing. The best model for a task is not always the most capable model -- it is the model whose failure modes are most compatible with the constraint structure.

Before / After: Single Prompt vs. Chain

The most direct demonstration of prompt decomposition's value is a side-by-side comparison. Below are outputs for the same article brief -- one generated with a single "write a full article" prompt, one generated through the 12-step chain. The tabs show the Lede, Section 2 (typically where single-prompt quality degrades), and the Conclusion.

Output Comparison -- Single Prompt vs. 12-Step Chain

Single Prompt

Prompt 01 of 01

Artificial intelligence is transforming the way businesses approach content creation. In today's rapidly evolving digital landscape, companies are increasingly turning to AI tools to streamline their content workflows and achieve greater efficiency. This article explores how the IO Platform's unique approach to content operations can help your team produce better content faster.

▼Generic opener. "Today's rapidly evolving digital landscape" is a tell. Second sentence leads with "AI tools" not with the reader's problem. Third sentence sounds like a CTA not a lede.

12-Step Chain

Prompt 04 of 12 (Post-Voice Calibration)

▲Opens with a specific professional recognition. Makes a structural claim. Uses the voice spec's "direct, no-hype, structural argument" register. Second sentence reframes the reader's assumption.

Voice Consistency Across Sections

Voice consistency is the metric that separates AI-generated articles that editors approve from ones they rewrite. The matrix below scores five voice attributes across the 12 prompts -- measuring how consistently each attribute holds from prompt 4 (lede) through prompt 10 (footnotes). A perfect score is 5 out of 5. The single-prompt baseline is shown for comparison.

Voice Attribute	P04 Lede	P05-07 Body	P08 Transitions	P09-10 Close	Single Prompt Avg
Direct register (no hedging)	5.0	5.0	4.7	5.0	3.2
Structural argument (not descriptive)	5.0	4.8	5.0	5.0	2.4
Banned vocabulary avoidance	5.0	5.0	5.0	5.0	3.0
Audience-appropriate specificity	4.8	4.8	5.0	5.0	3.5
Conclusion closes argument (not recap)	N/A	N/A	N/A	5.0	1.4

The single-prompt baseline collapses most dramatically on "conclusion closes argument" -- scoring 1.4 out of 5 across test runs. This is because a single prompt, having written 1,800 words, has almost no context window left for the strategic thinking a strong conclusion requires. The 12-step chain dedicates an entire Sonnet prompt (P11) to reading the complete assembled article and writing a conclusion that responds to its own lede. The chain does not make models smarter. It gives each step the context budget to do its job well.

Latency Breakdown

The full 12-prompt chain completes in 90 to 110 seconds under normal conditions. The latency breaks down across the three phases: analysis on Sonnet takes approximately 25 seconds, execution on Haiku runs in approximately 55 seconds, and the quality pass on Sonnet takes approximately 20 seconds. The complete article is returned to the Orchestrator as a 48-token episode -- not the full 4,000-word text.

P01-P03 (Sonnet)

25s

P04-P10 (Haiku)

55s

P11-P12 (Sonnet)

20s

Total

100s

At scale (100 articles per month), the hybrid Sonnet/Haiku routing costs approximately $0.017 per article -- 65% cheaper than running all 12 prompts on Sonnet at approximately $0.048 per article. The total token budget across all 12 prompts is approximately 18,000 tokens in and 4,000 tokens out, for an article of 2,000 to 2,800 words.

Key Takeaways

Prompt decomposition breaks a complex article into 12 single-responsibility prompts, each with one job -- producing consistent quality from lede to conclusion.

Structure is locked before any body copy is written. Each section prompt receives only its brief plus the locked outline, not prior sections -- eliminating voice drift.

Model routing assigns Sonnet to analysis and quality review (prompts 1-3, 11-12) and Haiku to execution tasks (prompts 4-10), reducing cost by approximately 65% while improving body copy consistency.

The full 12-prompt chain completes in 90 to 110 seconds and is returned to the Orchestrator as a single 48-token episode, not the full article text.

The chain does not make models smarter. It gives each step the context budget to do its job well -- which is why section five reads as well as section one.

Google Search Preview

intelligentoperations.ai/pep/blog/nine-libraries-article-library

AI Article Writing: 12-Prompt Chain for Long-Form Content

How a 12-prompt chain produces publication-ready long-form articles without revision loops. Prompt decomposition, model routing, and latency breakdown revealed.

AI Answer Engine

Perplexity Answer

According to research, The Article Library runs 12 sequential prompts to produce publication-ready long-form content: brief analysis, voice calibration, structure design, lede writing, section bodies (one per section), pull...¹

CRM NURTURE SEQUENCE

Triggered by: Inside the Article Library: How the Writing Engine Produces Long-Form at Scale

0

Context Brief Template

Immediate value: the exact template used to generate this article.

2

How the System Works

Deep-dive into the architecture behind coordinated content.

5

Case Study

Real production results from a complete nine-library run.

8

Demo Invitation

See the system produce a full content package live.

14

Follow-up

Personalized check-in based on engagement patterns.

REFERENCES

ART12p

IMG8p

VID13p

SOC12p

DSN6p

SEO10p

CRM6p

CNT6p

TST6p

Frequently Asked Questions

Common questions about this topic

Inside the Company Identity Prompt Library: How 23 Prompts Build Your Brand DNA Content Strategy + Target Audience: From Questionnaire to Strategic Framework

Intelligent Operations