How to Transcribe an Interview: A Step-by-Step Guide

To transcribe an interview, upload your recording to an AI transcription tool, let it generate a draft, then review and clean it. With PlainScribe, a one-hour interview transcribes in minutes at up to 99% accuracy for $0.067 per minute ($4/hour) — far faster than the 4-6 hours manual transcription typically takes.

TL;DR

  • Fastest path: upload the audio, get an AI draft, then proofread — versus 4-6 hours of manual typing per recording hour.
  • Cost: $0.067/min ($4/hour), pay-as-you-go, no subscription. A 45-minute interview costs about $3.
  • Accuracy: up to 99% on clear audio; always verify names, acronyms, and technical terms.
  • Timestamps included: export SRT or VTT to jump back to any moment of the conversation.
  • 30 free minutes to test on a real interview, no credit card.

This is the hands-on how-to. For broader use-case context, see the academic transcription guide for researchers and the legal transcription guide for depositions.

Step 1: Record Clean Audio

Transcription accuracy starts at the microphone. Use a decent mic, record in a quiet room, and ask each speaker to introduce themselves so labels are clear. Clean audio directly lowers your editing time and cost.

Step 2: Prepare Context

Before transcribing, jot a short glossary of expected proper names, acronyms, and jargon from the interview. You will use it to fix the handful of terms AI tends to miss.

Step 3: Upload to a Transcription Tool

Drop your file (MP3, MP4, WAV, M4A, MOV, and other common formats, up to 200MB on web) into the PlainScribe dashboard. The language is auto-detected across 47 supported languages, and you will be emailed when the transcript is ready.

Step 4: Get the AI Draft

AI handles the heavy lifting in minutes, producing a first draft at up to 99% accuracy. This is where you save the most time: instead of typing from scratch, you start from a near-complete transcript.

Step 5: Edit and Clean Up

Decide on your style:

  • Verbatim — keep every "um," false start, and pause (required for some qualitative and legal work).
  • Clean/intelligent verbatim — remove fillers and stutters for a readable record.

Then break the text into paragraphs, confirm speaker labels, and fix anything from your glossary.

Step 6: Proofread and Fact-Check

Read the transcript while spot-checking the audio. Verify proper nouns, numbers, and technical terms — these are where errors hide and where they matter most.

Step 7: Add Timestamps and Export

Timestamps make it easy to cite or re-listen to a specific moment. PlainScribe exports TXT and CSV for quotes and analysis, and SRT or VTT (with timestamps) for subtitles. Need an AI summary of a long interview? Smart Notes condenses each transcript into key points.

Handling Tricky Interviews

Real interviews are rarely studio-clean. A few adjustments protect accuracy:

  • Multiple speakers: ask each person to state their name once at the start so labels are unambiguous when you proofread.
  • Accents and crosstalk: AI handles a single clear accent well; overlapping speech is where errors cluster, so flag those segments for a closer listen.
  • Jargon-heavy topics: keep your glossary open while editing and search-replace any consistently misheard term in one pass.
  • Phone or field recordings: noisy audio lowers accuracy, so trim dead air and reduce background hiss before uploading to cut both errors and cost.

Verbatim vs Clean: Which to Choose

The right transcription style depends on what you will do with the text. Verbatim captures every "um," repetition, and pause, which matters when speech patterns are part of the data — qualitative analysis, legal records, or discourse studies. Clean (or intelligent verbatim) strips fillers and false starts for a transcript that reads smoothly, which is what most journalists and publishers want. Because PlainScribe hands you the full draft, you can start verbatim and edit down to clean in a single review pass rather than transcribing twice.

Manual vs AI Transcription

| Approach | Time per audio hour | Cost | Best for | |---|---|---|---| | Manual typing | 4-6 hours | Your time | Tiny clips, total control | | PlainScribe (AI) | Minutes + review | $0.067/min ($4/hr) | Most interviews, fast turnaround | | Rev (AI) | Minutes + review | $0.25/min | One-off uploads | | Rev (human) | 24-48 hours | $1.50/min | Court-grade accuracy |

Verdict: For nearly every journalist or researcher, AI transcription plus a focused proofread is the clear winner — you trade a few minutes of review for hours saved, at $4 an hour.

FAQs

How long does it take to transcribe a one-hour interview? Manually, plan on 4-6 hours of typing per audio hour. With AI like PlainScribe, the draft is ready in minutes; you then spend roughly 30-60 minutes reviewing for accuracy.

How much does it cost to transcribe an interview? PlainScribe charges $0.067 per minute ($4 per audio hour) on a pay-as-you-go basis, so a 45-minute interview is about $3. There is no subscription; the minimum purchase is $10 (about 150 minutes).

Should I transcribe verbatim or clean? Use verbatim (every filler and pause) for qualitative research and legal records; use clean verbatim for readable journalism and publishing. PlainScribe gives you the full draft so you can edit to either style.

Can I transcribe an interview in another language? Yes. PlainScribe auto-detects and transcribes 47 languages and can translate the result to English.

How do I get timestamps for an interview transcript? Export your transcript as SRT or VTT, which include time codes you can use for subtitles or to reference exact moments.

Transcribe Your First Interview Free

Upload a real interview and use your 30 free minutes — no credit card — then pay just $4 an hour as you go. Start in the dashboard, check the pricing, or compare tools at PlainScribe vs Rev.

Transcribe, Translate & Summarize your files

Get started with 30 free minutes. No credit card required.