How Long Does It Take To Transcribe 30 Minutes of Audio

Wondering how long it would take to transcribe 30 minutes of audio? Indeed, if you have a backlog of audio files waiting to be transcribed, it’s essential to have a realistic time frame for your project’s completion. This way, you can manage expectations.

Producing an accurate verbatim transcript requires skill and experience, especially for longer and more complex recordings. The time it takes to transcribe audio will also vary based on several factors. Indeed, we will cover them all in this article.

Let’s dive in.

The short answer is: It depends.

The industry standard is four hours for every audio hour. That means 30 minutes of audio will take approximately two hours to transcribe.

If your audio features a single-person narrative (e.g., a doctor doing an independent medical evaluation), it might take 45 minutes to an hour to transcribe 30 mins of audio- if the speaker is a good dictator. If it is a two-person interview, it could take us as much as 90 minutes to complete.

However, if your recording has two people with poor grammar and background noise, transcribing it could take us as long as two hours. Three speakers with good-quality audio would take about two hours. Finally, bad-quality audio or grammar would take about 2.5 hours to complete.

Transcribing audio with four or more speakers takes about the same time as three speakers.

6 Factors That Affect Audio Transcription Speed

To help you better understand the time it would take for your audio recording to be transcribed, here are the most common factors that affect transcription speed.

Experience Level

A seasoned transcriptionist will be able to work more quickly than an amateur. Hiring a professional likely makes more sense if you are inexperienced and have plenty of audio files that need to be transcribed.

In most cases, expert transcriptionists with a fast typing speed can transcribe 20 to 30 minutes of audio recordings in less than an hour. But note that this number will still vary depending on the factors mentioned hereafter. So if you’re pressed for time, hiring a transcription company like Ditto Transcripts might be ideal.

Audio Quality

An audio recording with lots of background noise can make hearing important phrases or even entire sections of a file difficult to impossible. This means transcribing will be equally a challenge.

Also, audio recorded long ago or with a low-quality device can increase the time needed to transcribe a file, as deciphering what is being said will be more difficult.

Indeed, the harder the audio is to understand, the longer the transcription will take.

Number of Speakers/Speech Patterns

macbook pro displaying group of people on a zoom call to be transcribed later

An audio recording with a single speaker will take much less time to transcribe. But what if several people are talking over each other?

If multiple speakers are talking quickly and simultaneously, the transcriber has to pause and play the recording numerous times to understand what each person is saying. Repeated rewinding might even be necessary, further extending the turnaround time of the transcript.

Regional Accents

Thick regional accents can affect transcription speed, especially if the transcriber does not understand foreign languages and accents regularly.

Required Research

A transcriber unfamiliar with an audio file’s subject matter will often take much longer to complete a transcript.

If an audio recording has a lot of industry jargon, a transcriber who is unfamiliar with that industry would need to do a lot of research in that specific industry.

Transcription Speed: Average Person vs. Professional Transcriber

It takes the average person approximately two to three hours to transcribe 15 minutes of audio, provided it is clear and the speaker talks steadily. But if any of the factors mentioned above alters the quality of the recording, the time could be higher.

On the other hand, a professional transcriptionist can transcribe 15 minutes of audio in about one hour. Really experienced transcriptionists can transcribe 20 to 30 minutes of audio in an hour, but that number is an outlier because they are experts in what they do.

What Type of Audio Takes the Longest to Transcribe?

hourglass, time, hours

There is not one type of audio that takes the top spot, as each audio file is different. But to give you an idea, we’ve outlined the typical completion times for each audio type below:

  1. Single speaker (i.e., a doctor dictating patient notes, political speech, sermon, or a police officer dictating a patrol report…) – One to two hours per audio hour.
  2. Multiple speakers (i.e., corporate meetings, group interviews, conference presentations, etc.) – Three to four hours per audio hour.
  3. Complex recordings and requirements (i.e., specialist industry group meetings with many speakers, transcripts that need time stamps, speaker identification, closed captions, etc.) – More than an hour and a half per audio hour.

How Much Does It Cost to Transcribe 30 Minutes of Audio?

