Editing workflow

Course 2 · Ch 6

Editing Workflow

Cutting for pace, music licensing, captions, transitions — from raw footage to upload-ready file

Knowing which software to use (Chapter 5) is different from knowing how to edit. This chapter covers the actual workflow — the repeatable process you run on every video, from dumping the SD card to hitting publish. Get this process right and editing stops feeling like a chore and starts feeling like a system.

The Editing Workflow — End to End

Ingest & organise

Copy, bin, label

Rough cut

Pull selects, structure

Fine cut

Pace, rhythm, b-roll

Audio mix

Levels, music, noise

Titles & captions

Lower thirds, subs

Colour grade

Grade locked picture

Export & review

Render, watch, upload

Never skip steps or reorder them. Colour grading before the edit is locked means re-grading every time you trim a clip. Adding captions before audio is mixed means captions drift when you adjust timing.

Step 1 — Ingest & Organise

Before you touch the timeline, set up your project properly. Five minutes of organisation saves forty minutes of searching mid-edit.

Copy footage to a dedicated project folder — never edit directly from the SD card. SD cards fail; they're also slow for sustained reads during playback.
Create a folder structure: /Camera, /B-Roll, /Music, /Graphics, /Export. Mirror this inside your NLE's media pool.
Label or colour-code clips in the media pool. DaVinci and Premiere both support clip flags. Mark your best takes green, maybes yellow, unusable red — before you open the timeline.
Back up immediately. If you only have the footage in one place, you don't have it. Copy to an external drive or cloud storage before the SD card gets reformatted.

Step 2 — Rough Cut

The rough cut is about structure, not perfection. Move fast. Decisions are cheap here — changing them later is expensive.

Drop your best takes onto the timeline in order. Don't trim precisely yet.
Cut out obvious dead air, false starts, and sections you already know you won't use.
Watch back at 1.5× speed. Anything that makes you reach for the skip button — cut it.
Don't add music, colour, or titles yet. That comes later.
Aim for a rough cut that is 20–30% longer than your target runtime. You'll tighten it in the fine cut.

Step 3 — Cutting for Pace

Pace is the single most important variable in viewer retention. A video with great content but poor pacing loses its audience. A tightly paced video with average content keeps people watching. Pace is not the same as speed — it's about the rhythm between cuts and the energy of each moment.

Cut on the action, not before or after it

When someone picks up an object, starts a sentence, or makes a gesture, cut at the moment the action begins — not before the reach, not after the grab. Action cuts are invisible. Static-to-static cuts feel jarring.

The jump cut — tool or crutch?

A jump cut removes time within a single camera angle. Used deliberately, it creates energy and compresses time (popular in tutorial and vlog content). Used carelessly, it makes the editor look lazy. Cover jump cuts with b-roll wherever possible. Leave them bare only when they serve a stylistic purpose — fast-paced tech channels often use bare jump cuts as a signature, but it's a deliberate aesthetic, not a default.

Cut lengths by content type

Tutorial / Education

15–30 sec talking head

Cut to b-roll or screen recording every 20–30 seconds. Longer single shots are fine if the content is visually active (demonstration, screen).

Vlog / Lifestyle

5–15 sec per cut

Faster cutting rhythm reflects the energy of movement. Cut on action, match audio across cuts, use music to drive the pace.

Gaming / Commentary

Driven by gameplay

Cut dead moments in gameplay ruthlessly. Keep the commentary continuous while cutting the game footage around it. Silence is your enemy.

Review / Product

8–20 sec per shot

Longer shots work when something is happening on screen (demonstrating a feature). Cut between B-roll detail shots and talking head frequently.

Interview / Podcast

Hold on speaker

Cut away only for reaction shots or to cover an edit. Long uncut runs feel authentic. Cut pauses but don't cut the rhythm of speech.

Shorts / Reels

2–6 sec per cut

Extremely fast. Hook in the first 0–3 seconds. No dead air. Every sentence should cut immediately after the last word lands.

The "remove silence" shortcut

DaVinci Resolve, Premiere Pro, and CapCut all have automatic silence-removal tools. They detect pauses in speech and cut them out in seconds. Run it on your rough cut to eliminate breath gaps, umms, and dead air — then review to restore any natural pauses that were cut too aggressively. This one tool alone cuts editing time by 30–40% for talking-head content.

Step 4 — Audio Mix

Mix audio before adding titles or grading. Any timing adjustment after titles are placed will require repositioning every title card manually.

Voice level

Your voice should sit at –12 to –6 dB peak on the meter. The integrated loudness (LUFS) of your finished export should target –14 LUFS — YouTube's normalisation target. Content exported louder gets turned down by YouTube's algorithms; content exported quieter gets turned up but sounds thin and compressed.

Audio Loudness Guide — Integrated LUFS Target

–40–30–20–14–8–30

Too quiet (<–20): sounds thin, YouTube boosts it

Target zone –16 to –12: ideal for most content

Too loud (>–8): YouTube turns it down, sounds compressed

Common audio fixes every creator needs

High-pass filter (HPF): Remove low-frequency rumble (air conditioning, traffic) by filtering everything below 80–100 Hz. Apply to your voice track as standard.
Noise reduction: DaVinci's Fairlight and Premiere's Audition both include noise reduction. Sample a clip of pure room noise (before you started speaking) and subtract it from the full track.
Compression: Reduces the gap between your loudest and quietest moments. A ratio of 3:1 with a threshold around –18 dB is a good starting point for voice. Makes the overall level more even.
De-esser: Reduces harsh 's' and 'sh' sibilance that condenser mics pick up aggressively. Most NLEs include one.
EQ: Boost presence around 2–5 kHz for clarity. Cut boxiness around 200–400 Hz if your voice sounds muddy. Roll off below 80 Hz always.

Music Licensing — Don't Risk Your Channel

Using commercially released music without a licence will trigger YouTube's Content ID system within minutes of uploading. Best case: the rights holder monetises your video (their ads, not yours). Worst case: your video is blocked, you receive a copyright strike, and three strikes delete your channel. This is not a risk worth taking.

Your music options

Source	Cost	Commercial use?	Content ID safe?	Notes
YouTube Audio Library	Free	✓	✓ (mostly)	Built into YouTube Studio. Large library. Some tracks require attribution in description.
Epidemic Sound	~£13/mo	✓	✓	The creator standard. Unlimited tracks, fully cleared. Cancel and keep videos up (but can't use tracks in new videos).
Artlist	~£25/mo	✓	✓	Unlimited downloads. Perpetual licence — keep using tracks even after you cancel. Higher cost but better terms.
Musicbed	~£17/mo	✓	✓	Higher quality, more cinematic catalogue. Popular with documentary and travel creators.
Pixabay / Free Music Archive	Free	Check per track	Varies	Creative Commons music. Free but quality is inconsistent. Read the licence on every individual track.
Commercially released music	Licence fees	Rarely permitted	✗	Do not use without explicit written permission. Content ID will find it. Not worth it.

Music level: how loud?

Background music should sit 15–20 dB below your voice level. If your voice peaks at –12 dB, music should peak around –28 to –32 dB. Listeners should feel the music is there without noticing it. The moment they can hear the lyrics clearly, it's competing with your voice.

Automate your music level — lower it during speech, let it rise slightly during b-roll sections without voiceover. Most NLEs call this a volume keyframe. It's one of the fastest production quality upgrades available.

Transitions — Less Is More

Transitions are one of the most abused features in editing software. Every template library contains dozens of them. Professional editors use almost none of them.

Hard cut (straight cut)

Use freely

One clip ends, the next begins instantly. The default. Used in virtually every professional production. When in doubt, use a hard cut.

Dissolve / cross-fade

Use selectively

Clips blend briefly. Signals a passage of time or a gentle scene change. Appropriate for montages, time-lapses, and emotional transitions. Keep it under 12 frames (0.5 sec) in most cases.

Fade to/from black

Use at scene breaks

Marks the start or end of a major section. Overused mid-video — each one tells the viewer "this is a pause." Use to open the video and to close it; rarely in between.

J-cut / L-cut

Use freely

Audio from the next clip begins before the cut (J-cut), or audio from the current clip continues into the next shot (L-cut). Makes edits invisible and conversation flow natural. The most professional technique here.

Whip pan / zoom

Use sparingly

Fast pan or zoom wipe between clips. Energetic, used in vlogs and fast-paced content. Fine as an occasional transition style — not as a template applied to every cut.

Match cut

Use when it lands

Cut on a visual, audio, or motion match between two shots. Two circles — one is a clock face, one is a planet. Elegant when it works, forced when it doesn't. Earns its place.

Glitch / flash / spin effects

Avoid

Template transitions from effect libraries. Instantly date a video. Scream "2016 gaming montage." The hard cut between the same clips is almost always better.

Star wipe / page peel

Never

These exist. They should not be used. If you find yourself reaching for these, step away from the effects panel and take a walk.

The J-cut and L-cut explained

These are the most used transitions in professional editing and the least taught to beginners. A J-cut: you see someone listening, but you already hear the next speaker's voice — then the picture cuts to them. Natural, invisible. An L-cut: the picture cuts away from the speaker, but their voice continues over the new shot — then their audio fades as the new scene takes over. Used constantly in documentaries and interviews. Master these two and you'll rarely need any other transition.

Captions — Reach, Retention, and Accessibility

85% of Facebook video is watched on mute. A large proportion of YouTube viewers watch with captions on — particularly on mobile, in public spaces, or for non-native speakers. Captions are no longer optional for serious creators.

YouTube's auto-captions

Free

YouTube generates captions automatically after upload. Available in 17+ languages. Accuracy has improved significantly — around 85–95% for clear speech in a quiet environment.

Acceptable as a fallback. Not suitable as your only caption strategy — errors go unfixed and look unprofessional. Review and correct in YouTube Studio.

Premiere Pro / Final Cut (built-in)

Included in subscription

Premiere Pro's text-based editing generates captions in the timeline. Final Cut Pro's built-in speech-to-text works on-device. Both allow you to style and correct captions before export.

Good workflow if you're already in Premiere or Final Cut. Lets you style captions to match your brand before exporting the SRT file.

CapCut auto-captions

Free (with watermark caveat)

CapCut's speech-to-text is fast and accurate. Popular for Shorts where styled, animated captions are part of the aesthetic. Exports captions baked into the video.

Best for short-form styled captions. Less suitable for long-form where you want an editable SRT file.

Descript

~£12/mo (free tier)

Transcribes your video into a word-processor document. Edit the text and the video edit updates automatically. Generates SRT and VTT caption files. Also removes filler words in one click.

Genuinely different workflow — text-based editing is excellent for long-form interview and podcast content.

Rev / Sonix

~£1.25/min (Rev human)

Human transcription services. Near-perfect accuracy. Useful for content where errors are unacceptable — legal, medical, accessibility-critical.

Overkill for most YouTubers. Use AI tools first; bring in human transcription only when accuracy is critical.

SRT file upload to YouTube

Free

Upload a corrected .srt file to YouTube Studio after publishing. YouTube displays these in preference to its auto-generated captions. Allows you to correct any errors post-upload.

Best practice: generate with any tool above, correct the errors, export SRT, upload to YouTube. Takes 10 minutes per video once you have the workflow.

Steps 6 & 7 — Grade, Export & Review

Colour grade only after the picture edit is locked (Step 3) and the audio is mixed (Step 4). Then export once, watch the full file, and upload.

Pre-upload checklist

Picture locked? No more cuts after this point.

Audio levels checked? Voice at –12 to –6 dB peak; music 15–20 dB below voice; integrated loudness –14 to –16 LUFS.

Colour graded? White balance corrected, exposure consistent across clips, grade applied.

Titles checked? Correct spelling, correct timing, no title overlapping a cut.

Captions ready? SRT file generated and corrected; or baked-in captions reviewed for accuracy.

Export settings correct? H.264 or H.265, 1080p minimum (4K if available), YouTube-recommended bitrate (8–12 Mbps for 1080p30).

Rendered file watched in full? Play the entire export before uploading. Render glitches, audio sync issues, and missing credits are invisible until you watch it.

Backed up? Project file and rendered export saved to at least two locations.

The one mistake that kills channels

Uploading without watching the export. A 30-second render glitch at minute 8, a missing audio track for the last 2 minutes, a wrong version of a clip — these are all discovered by your viewers before you. Watch the file. Always.

Chapter 6 Quick Reference

Workflow order: Ingest → Rough cut → Fine cut → Audio mix → Titles/captions → Grade → Export → Review → Upload
Never colour grade before picture is locked
Voice peak level: –12 to –6 dB · Integrated loudness: –14 to –16 LUFS
Music level: 15–20 dB below voice; automate it down under speech
Auto silence removal: Use it — saves 30–40% editing time on talking-head content
Best transitions: Hard cut (default) · J/L-cut (invisible) · Dissolve (scene changes only)
Avoid: Glitch, spin, star wipe, page peel — ever
Captions: Generate → correct errors → export SRT → upload to YouTube Studio
Music: Epidemic Sound or Artlist (paid) · YouTube Audio Library (free)
Export codec: H.264 or H.265 · Bitrate: 8–12 Mbps for 1080p30
Watch the export before uploading — every time