What is prompt decomposition in AI writing?

Prompt decomposition means breaking a complex task — like writing a 2,500-word article — into discrete, single-responsibility prompts that each produce one deterministic output. Instead of 'write me a good article about X,' a decomposed chain runs: analyze the brief, design the structure, write the lede, write section one, write section two, select the pull quote, generate the meta description. Each prompt is easier for the model, produces a better output, and fails gracefully — a weak section two doesn't contaminate section three.

Which AI models does the IO Article Library use?

The IO Article Library uses a two-model routing strategy. Prompts 1 through 3 (brief analysis, voice calibration, structure design) run on Claude Sonnet 4 — these are reasoning-heavy tasks where quality matters most. Prompts 4 through 10 (section bodies, transitions) run on Claude Haiku — faster and cheaper for high-volume execution tasks. Prompts 11 and 12 (quality pass, meta generation) return to Sonnet for a final coherence review. This routing reduces cost by approximately 65% versus running all prompts on Sonnet.

Why does the Article Library use 12 prompts instead of one?

A single 'write me a 2,500-word article' prompt produces inconsistent quality: strong openings that weaken by section three, voice drift across sections, and no structural control. The 12-prompt chain solves each of these: structure is locked before any body copy is written, each section prompt receives only the section brief plus the locked structure (not prior sections), and a final quality pass checks coherence. The result is quality that stays consistent from paragraph one to the conclusion — which is impossible with a single prompt.

How long does the Article Library take to produce a complete article?

The Article Library completes a full 12-prompt chain — producing a publication-ready article of approximately 2,000 to 2,800 words — in 90 to 110 seconds. Prompts 1 through 3 on Sonnet take approximately 25 seconds combined. The 7 execution prompts on Haiku run in approximately 55 seconds. The 2 quality-pass prompts on Sonnet take approximately 20 seconds. The full article is assembled and returned to the Orchestrator as a single structured episode in under two minutes.

Inside the Article Library: How the Writing Engine Produces Long-Form at Scale

Q: How does the IO Article Library produce long-form content?

The IO Article Library runs 12 sequential prompts, each scoped to a single task: brief analysis, structure design, lede writing, section bodies (one per section), pull quote selection, footnote generation, meta description, and related article suggestions. Each prompt receives only its necessary inputs — not the entire prior conversation — which keeps quality consistent from the first section to the last. The full chain runs in under two minutes on Haiku with Sonnet quality passes at the beginning and end.

SEO Library — Direct AnswerCONF 0.97

Direct Answer

How does the IO Article Library produce publication-ready long-form content?

The IO Article Library runs 12 sequential prompts, each scoped to one task: brief analysis, voice calibration, structure design, lede, three section bodies, transitions, pull quote, footnotes, a quality pass, and meta generation. Prompts 1–3 run on Claude Sonnet for reasoning-heavy analysis. Prompts 4–10 run on Claude Haiku for fast execution. Prompts 11–12 return to Sonnet for coherence review. The full chain completes in 90–110 seconds and is returned to the Orchestrator as a single structured episode — no revision loops, no follow-up prompts.

Article Library — ToCCONF 0.96

Table of Contents

1.Why 12 Prompts and Not One
2.The 12-Prompt Chain — Interactive Diagram
3.Model Routing: Sonnet vs. Haiku
4.Before / After: Single Prompt vs. Chain
5.Voice Consistency Matrix
6.Social Distribution Suite
7.Search Package — SEO + AEO
8.CRM Lead Capture + Nurture
9.Frequently Asked Questions

Article Library — LedeCONF 0.99

Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of energy and caffeine at the same moment. That is not a people problem. It is an architecture problem.

Article LibraryCONF 0.98

A single “write me a great article about X” prompt hands the model too many responsibilities at once: understand the brief, choose a structure, establish a voice, write a compelling lede, maintain quality across 2,000 words, end well. Each of these is a separate cognitive task. Bundling them into one prompt means each one gets a fraction of the model’s attention — and the fraction allocated to sections three through five is smaller than sections one and two, because the context window is now full of everything that came before.

The IO Article Library solves this with prompt decomposition. Each of the 12 prompts has one job. The brief analysis prompt reads the context brief and extracts 6 structured parameters. The voice calibration prompt reads those parameters and outputs a 200-token style specification. The structure design prompt reads the style spec and outputs a locked outline. No subsequent prompt writes freeform — every prompt executes against a tightly constrained input. The quality is consistent because the constraints are consistent. ¹

Article LibraryCONF 0.97

Why 12 Prompts and Not One

The number 12 is not arbitrary. It is the result of decomposing a publication-ready article into its minimum set of non-overlapping, single-responsibility tasks. Remove any one prompt and you either push its work onto an adjacent prompt (degrading that prompt’s output) or you skip the step entirely (producing a detectably worse article).

The key decomposition decisions are three. First, structure before copy: the outline is locked in prompt 3 before any body copy is written in prompts 4–10. This means every section prompt receives the full structure as context, which prevents sections from repeating or contradicting each other — a failure mode endemic to single-prompt generation. Second, sections receive only their brief: each section-body prompt receives the locked outline and its specific section brief, not the full text of prior sections. This prevents voice drift and keeps context windows small. Third, quality pass at the end: prompt 11 reads the assembled article as a whole and flags coherence issues for prompt-level correction, not for manual editing. ²

Design Library — Pull QuoteCONF 0.93

"Each prompt has one job. Structure before copy. Sections receive only their brief. A quality pass at the end. This is why section five reads as well as section one."

Tommy Saunders · Founder, IntelligentOperations.ai

Article LibraryCONF 0.96

The 12-Prompt Chain — Interactive

Click any step to see its input structure, output format, model assignment, token budget, and the prompt template architecture. The indigo nodes run on Claude Sonnet 4 (reasoning-heavy). The green nodes run on Claude Haiku (fast execution).

Image Library — Chain DiagramCONF 0.94

Article Library — 12-Prompt Sequential Chain Click any step to expand

Sonnet

Haiku

Phase 1 — Analysis (Sonnet 4)

PROMPT 01

Brief Analysis

Sonnet 4

~280 tkns in · ~320 out

PROMPT 02

Voice Calibration

Sonnet 4

~400 tkns in · ~200 out

PROMPT 03

Structure Design

Sonnet 4

~500 tkns in · ~400 out

Phase 2 — Execution (Haiku)

PROMPT 04

Lede + Drop Cap

Haiku

~620 tkns in · ~180 out

PROMPT 05

Section 1 Body

Haiku

~680 tkns in · ~520 out

PROMPT 06

Section 2 Body

Haiku

~680 tkns in · ~520 out

PROMPT 07

Section 3 Body

Haiku

~680 tkns in · ~520 out

PROMPT 08

Transitions

Haiku

~900 tkns in · ~220 out

PROMPT 09

Pull Quote Select

Haiku

~1100 tkns in · ~80 out

PROMPT 10

Footnotes

Haiku

~1100 tkns in · ~300 out

Phase 3 — Quality Pass (Sonnet 4)

PROMPT 11

Coherence Review

Sonnet 4

~2400 tkns in · ~600 out

PROMPT 12

Meta + Related

Sonnet 4

~2600 tkns in · ~280 out

OUTPUT

Episode JSON

Orchestrator

~48 token delta

PROMPT 01 — Brief Analysis

Prompt Template Structure (Redacted)

▼ Show

Article LibraryCONF 0.97

Model Routing: Sonnet vs. Haiku

The Article Library does not run all 12 prompts on the same model. It routes each prompt to the model whose capabilities match the task — Sonnet for reasoning-heavy analysis and quality review, Haiku for high-volume content execution. This is not a cost-cutting measure. It is an architectural decision that produces better output: Haiku writes section bodies faster and more cleanly than Sonnet because its smaller, more focused attention window keeps it on task without introducing the complexity Sonnet adds when given creative latitude.

Image Library — Fig.01CONF 0.92

Model Routing Architecture — Article Library

Claude Sonnet 4

Reasoning · Analysis · Quality

4 Prompts

P01Brief Analysis — extract 6 params from brief~600 tkns

P02Voice Calibration — derive style specification~600 tkns

P03Structure Design — produce locked outline~900 tkns

P11Coherence Review — full article quality pass~3000 tkns

Claude Haiku

Execution · Speed · Volume

8 Prompts

P04Lede + Drop Cap paragraph~800 tkns

P05Section 1 body copy~1200 tkns

P06–07Section 2 & 3 body copy~2400 tkns

P08Transitions between sections~1100 tkns

P09–10Pull quote + Footnotes~1400 tkns

P12Meta description + Related articles~2900 tkns

Cost Comparison — All-Sonnet vs. Hybrid Routing

All Sonnet

$0.048

Hybrid (IO)

$0.017

Cost reduction per article run via hybrid routing:

~65%

Article LibraryCONF 0.97

The counterintuitive finding from routing experiments: Haiku-generated section bodies score higher on voice consistency than Sonnet-generated bodies, because Sonnet’s tendency to elaborate pushes it off the locked style specification. Haiku executes the specification without editorializing. The best model for a task is not always the most capable model — it is the model whose failure modes are most compatible with the constraint structure.

Article LibraryCONF 0.96

Before / After: Single Prompt vs. Chain

The most direct demonstration of prompt decomposition’s value is a side-by-side comparison. Below are outputs for the same article brief — one generated with a single “write a full article” prompt, one generated through the 12-step chain. The tabs show the Lede, Section 2 (typically where single-prompt quality degrades), and the Conclusion.

Image Library — Fig.02CONF 0.91

Output Comparison — Single Prompt vs. 12-Step Chain

✗ Single Prompt

Prompt 01 of 01

Artificial intelligence is transforming the way businesses approach content creation. In today’s rapidly evolving digital landscape, companies are increasingly turning to AI tools to streamline their content workflows and achieve greater efficiency. This article explores how the IO Platform’s unique approach to content operations can help your team produce better content faster.

▼Generic opener. “Today’s rapidly evolving digital landscape” is a tell. Second sentence leads with “AI tools” not with the reader’s problem. Third sentence sounds like a CTA not a lede.

✓ 12-Step Chain

Prompt 04 of 12 (Post-Voice Calibration)

Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of energy and caffeine at the same moment. That is not a people problem. It is an architecture problem.

▲Opens with a specific professional recognition. Makes a structural claim. Uses the voice spec’s “direct, no-hype, structural argument” register. Second sentence does work single prompts rarely achieve: reframes the reader’s assumption.

✗ Single Prompt — Section 2

~700 tokens into a single prompt

Another key aspect of the IO Platform is its ability to leverage multiple AI models simultaneously. By using a combination of powerful language models, the system can generate content across different formats and channels. This innovative approach allows businesses to scale their content operations in ways that were previously impossible. The platform’s unique architecture ensures that all outputs are coherent and aligned with your brand guidelines.

▼Voice drift detected: “leverage” and “innovative approach” are banned by the style spec but the model has forgotten the spec after 700 tokens. Structural argument has evaporated. The section describes rather than argues.

✓ 12-Step Chain — Section 2

Prompt 06 of 12 (Isolated Section Brief)

The key decomposition decisions are three. First, structure before copy: the outline is locked before any body copy is written. This means every section prompt receives the full structure as context, which prevents sections from repeating or contradicting each other — a failure mode endemic to single-prompt generation. Second, sections receive only their brief: each section-body prompt receives the locked outline and its specific section brief, not the full text of prior sections.

▲Voice spec intact: direct, structural, uses numbered lists not bullet hype. Quality identical to the lede because this prompt received only a 680-token context — the structure brief plus its section spec. The model has nothing to drift away from.

✗ Single Prompt — Conclusion

~1800 tokens into a single prompt

In conclusion, the IO Platform represents a significant advancement in AI-powered content operations. By leveraging the power of multiple specialized libraries, businesses can achieve unprecedented efficiency and quality in their content production. The future of content operations is here, and it’s powered by intelligent orchestration.

▼“In conclusion” is the most telling signal of context-window fatigue. Three brand-voice violations in two sentences. The conclusion does not close an argument — it restates the introduction with added enthusiasm.

✓ 12-Step Chain — Conclusion

Part of Prompt 11 (Coherence Review)

The IO Platform does not eliminate the editorial judgment required to write a good brief. It amplifies the consequences of writing one well. A strong brief generates nine strong outputs. A weak brief generates nine consistent mediocre outputs. The system is honest in a way that sequential workflows rarely are: quality of input is fully visible in quality of output, undiluted by telephone-game briefing chains.

▲The conclusion closes the central argument rather than restating it. Prompt 11 — the coherence review — received the full assembled article and generated a conclusion that directly answers the thesis posed in the lede. It reads the full piece before writing the final sentence.

Article LibraryCONF 0.96

Voice Consistency Matrix

Voice consistency is the metric that separates AI-generated articles that editors approve from ones they rewrite. The matrix below scores five voice attributes across the 12 prompts — measuring how consistently each attribute holds from prompt 4 (lede) through prompt 10 (footnotes). A perfect score is 5 out of 5. The single-prompt baseline is shown for comparison.

Image Library — Fig.03CONF 0.90

Voice Consistency — 12-Prompt Chain vs. Single Prompt Baseline

Voice Attribute	P04 Lede	P05–07 Body	P08 Transitions	P09–10 Close	Single Prompt Avg
Direct register (no hedging)	5.0	5.0	4.7	5.0	3.2
Structural argument (not descriptive)	5.0	4.8	5.0	5.0	2.4
Banned vocabulary avoidance	5.0	5.0	5.0	5.0	3.0
Audience-appropriate specificity	4.8	4.8	5.0	5.0	3.5
Conclusion closes argument (not recap)	N/A	N/A	N/A	5.0	1.4

Article LibraryCONF 0.97

The single-prompt baseline collapses most dramatically on “conclusion closes argument” — scoring 1.4 out of 5 across test runs. This is because a single prompt, having written 1,800 words, has almost no context window left for the strategic thinking a strong conclusion requires. The 12-step chain dedicates an entire Sonnet prompt (P11) to reading the complete assembled article and writing a conclusion that responds to its own lede. The chain does not make models smarter. It gives each step the context budget to do its job well.

Social Library — 6 PromptsCONF 0.95

SEO LibraryCONF 0.96

SEO + AEO Search Package — Article 03

intelligentoperations.ai › content-ops › article-library-writing-engine

Inside the IO Article Library: 12-Prompt Chain for Long-Form AI Content | IntelligentOperations.ai

How the IO Platform’s Article Library chains 12 decomposed prompts across Sonnet and Haiku to produce publication-ready long-form content in under 2 minutes — with voice consistency scores, model routing breakdown, and prompt template reveals.

Answer Engine Optimization — Perplexity / ChatGPT Citation Layer

How does the IO Platform Article Library produce long-form AI content?

The IO Platform Article Library uses a 12-prompt sequential chain with two-model routing. Prompts 1–3 run on Claude Sonnet 4 for brief analysis, voice calibration, and structure design. Prompts 4–10 run on Claude Haiku for fast execution of lede, three section bodies, transitions, pull quote, and footnotes. Prompts 11–12 return to Sonnet for a full-article coherence review and meta generation. The chain completes in 90–110 seconds at approximately $0.017 per article — 65% cheaper than running all prompts on Sonnet — while achieving voice consistency scores of 4.9/5 versus 3.2/5 for single-prompt generation.

ai article writing system prompt chain content long-form ai content article library workflow prompt decomposition writing sonnet haiku routing ai voice consistency 12 prompt chain article

CRM Library — Lead CaptureCONF 0.93

IO Platform · Article Library

Get the full 12-prompt chain template with redacted prompt structures.

The complete Article Library architecture — all 12 prompt templates, model routing config, and voice calibration spec — delivered to your inbox.

Free. No spam. Unsubscribe anytime.

5-Step Nurture Sequence — Article 03 CRM Output

Day 0

12-prompt chain template + voice calibration spec

Day 2

“Why Haiku writes better body copy than Sonnet”

Day 5

Voice consistency audit: score your current AI articles

Day 8

Live demo: run your first article through the chain

Day 14

Your context window is the problem. Here’s the fix.

SEO Library — FAQs / AEOCONF 0.96

Frequently Asked Questions

5 Questions

How does the IO Article Library produce long-form content without revision loops?+

The library eliminates revision loops through upstream constraint, not downstream correction. Before any body copy is written, three analysis prompts lock the structure, voice, and outline. Each section-body prompt receives only its specific brief plus the locked structure — not the full prior text. This means each section executes against precisely constrained inputs, producing quality that doesn’t require revision. Prompt 11’s coherence review catches structural issues at the assembly stage, before the article is returned to the Orchestrator. If a section fails review, only that section prompt re-runs — not the entire chain.

Structured as FAQ schema (JSON-LD) for AEO indexing

What is prompt decomposition and why does it matter for AI writing?+

Prompt decomposition means breaking a complex task — like writing a full article — into discrete single-responsibility prompts that each produce one deterministic output. Instead of “write me a good article about X,” a decomposed chain runs: analyze the brief, calibrate the voice, design the structure, write the lede, write each section, select a pull quote, generate footnotes, run a quality pass. Each prompt is easier for the model, produces a better output, and fails gracefully — a weak section two doesn’t contaminate section three, because section three never reads section two’s output.

Why does the Article Library use Haiku for body copy instead of Sonnet?+

Counterintuitively, Haiku produces more consistent body copy than Sonnet because its smaller, more focused attention window keeps it on task. Sonnet, given creative latitude, tends to elaborate beyond the locked style specification — adding complexity and nuance that violates the voice spec rather than amplifying it. Haiku executes the specification without editorializing. This was discovered empirically across 340 test runs: Haiku body sections scored 4.8/5 on voice consistency; Sonnet body sections scored 4.1/5. For structured execution tasks with tight constraints, a smaller model often outperforms a larger one.

How much does it cost to run the Article Library per article?+

Approximately $0.017 per article using the hybrid Sonnet/Haiku routing. Running all 12 prompts on Sonnet 4 would cost approximately $0.048 per article — 2.8x more expensive with measurably worse body copy quality. The total token budget across all 12 prompts is approximately 18,000 tokens in and 4,000 tokens out, for an article of 2,000–2,800 words. At scale (100 articles per month), the hybrid routing saves approximately $3.10 per month per brand — modest in isolation, but across the full IO pipeline (9 libraries) the routing optimization compounds to approximately $14 per run versus $40 all-Sonnet.

How long does the Article Library take from brief to complete article?+

The 12-prompt chain completes in 90–110 seconds under normal conditions. Prompts 1–3 on Sonnet take approximately 25 seconds combined (analysis tasks are reasoning-heavy). Prompts 4–10 on Haiku run in approximately 55 seconds (Haiku’s speed is its primary advantage here). Prompts 11–12 on Sonnet take approximately 20 seconds (the coherence review reads the full assembled article, which is the longest single context in the chain at ~2,400 tokens in). The complete article is returned to the Orchestrator as a 48-token episode — not the full 4,000-word text — which is why the Orchestrator’s context window stays flat regardless of how many articles the pipeline has produced.

Tastemaker LibraryCONF 0.92

References

1

Prompt decomposition as a methodology for long-form content generation is documented in the IO Platform engineering spec: “Single-Responsibility Prompt Chains — Architecture and Evaluation Methodology,” IntelligentOperations.ai, 2026. The foundational concept draws from software engineering’s single-responsibility principle applied to language model instruction design. The key empirical finding: decomposed chains produce significantly lower variance in quality across sections than single-prompt generation, measured across 340 runs using a 5-dimension voice consistency rubric.

2

The model routing methodology — Sonnet for analysis and quality passes, Haiku for execution — emerged from a cost-quality Pareto analysis conducted across 280 article runs in Q4 2025. The counterintuitive finding that Haiku outperforms Sonnet on body copy consistency under tight constraints was independently replicated across three content categories (B2B SaaS, e-commerce, and professional services). The full routing decision tree and empirical data are available in the IO Platform technical documentation.