A useful feature in an unexpected location.
A variety of services will upload an audio recording, “listen” to it, and produce a text transcript of what was heard.
If you have a subscription to Microsoft Office, you have one at your fingertips — just not where you’d expect.
Become a Patron of Ask Leo! and go ad-free!
Transcriptions Using Word
Using the online version of Microsoft Word with an Office subscription, the “Dictate” main menu item includes a “Transcribe” option. Use that to upload audio files for Word to process into text, optionally including speaker identification and timestamps.
Creating a transcript using Word
The “trick”, such as it is, is to visit office.com on the web and use the online version of Microsoft Word to perform the task. The desktop version doesn’t appear to have this feature.
Start with a blank document and click on the small down-arrow next to the “Dictate” microphone icon. (If there’s no down-arrow, make sure you’re visiting the web version of Word, not running your installed desktop software.)
Click on Transcribe in the resulting menu; this will open a panel on the right-hand side.
You can click on Start recording to simply speak and have a transcription created. That seems of limited use, to be honest. More interesting is to click on Upload audio and upload an audio file.
Once you do so, the transcription will begin processing.
It may take some time, depending on the length and quality of the recording.
Once completed, the transcription, along with audio play controls, will appear in the panel.
In this interface, you can listen to the audio and make corrections to the transcript simultaneously. Automated transcription is always inaccurate to some degree, so using this tool, you can listen to segments of the audio and make manual edits to the transcription. For example, I might replace the “?” with a period in the first sentence, since it’s a statement and not a question, and I’d replace “ask Leo. Com” with the correct “askleo.com”. You can do this later, after the transcript has been transferred to the document, but this allows you to do so while simultaneously listening to the audio, one segment at a time.
When you’re done, click on Add to document for a list of options.
As you can see from the transcript, the document can include three types of information:
- The transcript of the spoken words.
- An attempt to differentiate and identify the speaker.
- A timestamp identifying when in the recording the words were spoken.
You can choose to add the text to your document with or without the additional information — just click on your choice.
Click the “x” in the Transcribe pane to close it, and you can work with your transcript as you like.
If you look closely at the images above, you’ll see there’s a five-hour-per-month limit on uploaded files for transcription. And, as I said, you need a Microsoft Office subscription to access this feature.
There are alternatives.
- There are paid services, such as HappyScribe, that produce automated transcriptions for a relatively low fee, offering higher-quality human-processed results for an additional fee.
- Google Recorder is an android app (currently only available for Pixel phones) that produces transcriptions. While there’s no way to upload audio files, you can position your device in front of a speaker with the recorder running and play your audio into it to get a transcription. It seems a bit of a hack, but I’ve done this.
- YouTube can also be used. Convert the audio file to a video, upload it to YouTube as “unlisted”, and wait. Eventually, YouTube will analyze the audio for subtitles. Visit YouTube studio, edit your video, and click on Subtitles on the far left. Click on the vertical ellipsis and “Download” will be an option.
I’m sure there are more, but the Microsoft Word approach seems both useful and simple, if you have it available to you.