To add captions to a video, transcribe the audio into a timed caption file (SRT or VTT), then upload that file to your video player or burn it into the video. With PlainScribe you upload a file up to 200MB, get a transcript at up to 99% accuracy for $0.067/min ($4/hour), and export ready-to-use SRT/VTT in minutes — no subscription, no credit card for the first 30 minutes.
You need three things: the video file (or its extracted audio), a transcription tool, and a target player or editor. PlainScribe accepts MP4, MOV, WebM, MKV, AAC, M4A, MP3, WAV, FLAC and OGG up to 200MB on the web, so you usually upload the video itself — no separate audio extraction step.
Decide your caption type first, because it changes the last step:
.srt or .vtt). Viewers turn them on/off. Best for YouTube, Vimeo, web players, and accessibility compliance.<track> element pointing to your .vtt.For a subtitle-specific walkthrough, see the sibling guide on how to make subtitles.
| Method | Speed | Cost | Accuracy | Best for | |--------|-------|------|----------|----------| | Manual typing | Slowest | Your time | Highest (with effort) | Short clips, exact scripts | | YouTube auto-captions | Fast | Free | Mixed | Rough draft you'll edit | | PlainScribe (AI + your edit) | Fast | $0.067/min ($4/hr) | Up to 99% | Most creators, any platform | | Human service (e.g. Rev) | Slow | $1.50/min | Highest | Legal/medical, verbatim |
Verdict: for the vast majority of videos, AI transcription you lightly edit is the sweet spot — Rev's human service is ~22x the per-minute cost of PlainScribe, and free auto-captions usually need so much cleanup that you save little. Compare the full field on the pricing page and the comparison hub.
What file format do I need to add captions to a video? Use SRT or VTT. SRT (.srt) is supported by almost every platform and editor; VTT (.vtt) is the standard for HTML5 web players. PlainScribe exports both, plus TXT and CSV.
How long does it take to caption a video? Transcription itself takes a few minutes for a typical video; you're emailed when it's ready. Your manual review (fixing names and terms) usually takes a fraction of the video's runtime.
Can I add captions to a video for free? You can start free: PlainScribe gives 30 minutes with no credit card. YouTube's auto-captions are also free but usually need heavy editing. After the free minutes, PlainScribe is pure pay-as-you-go at $0.067/min.
How accurate are AI-generated captions? Up to 99% on clean audio. Background noise, heavy accents, and specialized vocabulary lower accuracy, which is why a quick human review before publishing is recommended.
Do captions help SEO? Yes. A caption track gives search engines readable text tied to your video, improving discoverability for the words spoken in it.
Upload your video, get a timestamped transcript at up to 99% accuracy, and export SRT/VTT — all pay-as-you-go with no subscription. Try PlainScribe free with 30 minutes, no credit card, and explore more in our tools and use cases.
Get started with 30 free minutes. No credit card required.