A closed caption is a time-synchronized text track that reproduces a video's spoken dialogue and its non-speech audio — sound effects, music, and speaker labels — that the viewer can switch on or off. The "closed" means hideable. Closed captions are delivered as a separate file (SRT or VTT) and exist primarily for deaf and hard-of-hearing access. With PlainScribe you can produce that file from any video at up to 99% accuracy for $0.067/min.
This is the precise, technical definition. For the plain-language meaning and everyday examples, start with the hub: what does closed caption mean.
A closed caption track consists of caption cues, each with three parts:
[glass shatters]), music (♪ upbeat jazz ♪), and speaker labels when the speaker isn't obvious."Closed" distinguishes it from "open." A closed caption is decoded and displayed on demand by the player; an open caption is rendered into the video pixels and cannot be removed.
| Term | Toggleable | Non-speech audio | Same language as audio? | |------|-----------|------------------|-------------------------| | Closed captions | Yes | Yes | Usually | | Open captions | No (burned in) | Usually | Usually | | Subtitles | Yes | No | Often translated | | SDH | Yes | Yes | Often translated |
Verdict: closed captions are defined by two traits together — toggleable and includes sound description. Drop the sound description and you have subtitles; remove the toggle and you have open captions. The full side-by-side is in closed vs open captions.
PlainScribe exports the two formats you'll actually attach to web and uploaded video — SRT and VTT (plus TXT and CSV). If you need to choose between them, see SRT vs VTT.
SDH (Subtitles for the Deaf and Hard of Hearing) emerged because streaming platforms deliver subtitle files, not broadcast caption signals. SDH packages caption-style content — sound effects, speaker IDs, music notes — inside a subtitle delivery format, and is often offered in multiple languages. Practically, when you "turn on captions" on Netflix you're usually selecting an SDH track.
For the end-to-end version with platform steps, see how to add captions to a video.
What is the definition of a closed caption? A closed caption is a viewer-toggleable, time-synced text track that reproduces a video's dialogue and non-speech audio (sound effects, music, speaker labels), delivered as a separate file like SRT or VTT.
What is the difference between closed captions and SDH? Closed captions are a broadcast concept (CEA-608/708) carried in the video signal. SDH is the streaming-era equivalent — the same sound-inclusive content delivered in a subtitle file format, often in multiple languages.
What file formats are used for closed captions? On broadcast, CEA-608 and CEA-708. For web and uploaded video, WebVTT (.vtt) and SRT (.srt) are standard. PlainScribe exports SRT and VTT.
Are closed captions required by law? In many jurisdictions, yes — for broadcast and certain online content. Requirements vary by country and content type, so check your local accessibility regulations.
Can I make closed captions automatically? Yes. Automatic speech recognition produces the text and timing; you then add sound cues and review. PlainScribe automates the transcription at up to 99% accuracy.
Transcribe any video at up to 99% accuracy, add your sound cues, and export SRT/VTT — pay-as-you-go at $0.067/min, no subscription. Start free with 30 minutes, no credit card. See pricing or the tools page.
Get started with 30 free minutes. No credit card required.