Question 1

What audio formats can I upload?

Accepted Answer

WAV, MP3, M4A, AAC, OGG, FLAC, and most common video containers (MP4, MOV, MKV, WebM). The audio is extracted server-side with ffmpeg before transcription runs.

Question 2

How accurate is the transcript?

Accepted Answer

Word error rate depends on audio quality and accent. On clean studio interviews in major languages, accuracy lands in the high 90s; on noisy field recordings with strong regional accents, it drops into the 80s. You always get the source audio aligned to the text so you can quickly review and correct.

Question 3

Can I edit the transcript after it lands?

Accepted Answer

Yes. The transcript opens in a three-panel editor next to the audio waveform. Click any line to seek the player, edit text inline, rename speakers, or merge identities. Changes save instantly.

Question 4

Does the transcript include speakers?

Accepted Answer

Speaker turns are detected automatically. Identities are voice-printed and matched across every interview in the same project, so once you name a speaker, the change applies everywhere they appear.

Question 5

How long can a file be?

Accepted Answer

Single uploads are capped at a few hours of audio per file in the free tier. Longer interviews are usually split into reels during recording; you can upload them as a folder and the project handles them as one session.

Audio transcription software built for interview rooms

Try it now

How it works

Frequently asked questions

Related capabilities

Further reading

Put audio transcription to work on your project.