You get:
- rambling intros that take 2 minutes to get to the point
- no clear sections (viewers get lost)
- no visual direction (boring talking head)
- no retention strategy (drop-off at every transition)
- CTAs that feel tacked on
But a YouTube script is not a transcript.
It is a retention machine with predictable sections.
- Hook (0-30 sec): stop the scroll
- Intro (30-90 sec): what they’ll learn, why it matters
- Body (90 sec – end): 3-5 clear sections
- Transitions: visual changes to reset attention
- CTA: specific next step
Without structure, viewers leave before the value.
This framework forces AI to build scripts that retain and convert.
Assume the role of a YouTube scriptwriter who structures videos for retention.
Your task is to create a video script structure.
Generate:
1. HOOK (0-30 seconds)
- Attention-grabbing opening
- Visual direction note
2. INTRO (30-90 seconds)
- What they'll learn
- Why it matters to them
- Visual direction
3. BODY (3-5 sections)
For each:
- Section title
- Key points (2-3 sentences)
- Estimated duration
- Visual direction (B-roll, text overlay, animation)
4. TRANSITION NOTES
- How to move between sections
5. CALL TO ACTION (30-60 seconds)
- Specific next step
- Visual direction
6. FULL TIMELINE (minutes:seconds)
INPUTS:
Video Topic:
[INSERT]
Target Audience:
[WHO IS WATCHING?]
Key Takeaways (3-5):
[LIST]
Video Length Target:
[5 MIN / 10 MIN / 15 MIN / 20 MIN / 30 MIN]
Format:
[TUTORIAL / LISTICLE / STORY / REVIEW / INTERVIEW]
Desired CTA:
[SUBSCRIBE / COMMENT / CLICK LINK / WATCH NEXT VIDEO]
RULES:
- Hook must be within first 30 seconds
- Intro must include a "what you'll learn" promise
- Body sections: 3-5 max (more is overwhelming)
- Each section needs visual direction (change every 15-30 seconds)
- Transitions should be visual, not verbal ("next, let's talk about...")
- CTA must be specific and urgent
- Include estimated timing for each section
- Record the hook separately — it’s the most important part.
- Visual direction changes every 15-30 seconds to retain attention.
- Time yourself reading to ensure you hit the target length.
- Test the hook on 5 people in your target audience.
- Save the structure for similar future videos (reuse what works).
Video Topic: 5 productivity apps that actually work (not the usual ones)
Target Audience: Solopreneurs and remote workers overwhelmed by productivity tools
Key Takeaways: 5 specific app recommendations, why each works, how to set up each in under 10 minutes
Video Length Target: 10 MINUTES
Format: LISTICLE
Desired CTA: SUBSCRIBE FOR MORE TOOL REVIEWS
This framework improves outcomes by forcing:
- 30-second hook (retention)
- explicit intro promise (expectation)
- 3-5 body sections (scannability)
- visual direction (engagement)
- timing estimates (pacing)
Great YouTube scripts don’t just inform — they keep viewers watching until the end.
Build Better AI Systems
Subscribe for advanced prompt engineering, AI content creation tools, YouTube frameworks, and practical strategies for creators and marketers.
