Subject Guides: Productivity Tools: Speech to Text

Otter

Otter listens to conversations and then transcribes them via automatic speech recognition (ASR). While Otter interfaces through iOS, Android and web apps, most of the processing takes place in the cloud. As the service transcribes the text, it also inserts punctuation and attempts to identify speakers.

Google Docs Voice Typing

Google Docs lets you use voice typing to dictate using your computer’s microphone.

Note: Voice Typing is only available for use in Google Docs and the speaker notes of Google Slides, and only if you’re using Google Chrome.

Microsoft Azure

Speech to text - Azure

The Microsoft's Azure Speech to Text feature is powered by deep neural network models and allows for real-time audio transcription that can be set up to handle multiple speakers. There are also customization options available to work better with different speech patterns, registers, and even background sounds. You can also modify settings to handle different specialist vocabularies, such as product names, technical information, and place names.

Zoom Audio Transcription

Use the Audio transcript option (under Cloud Recording) to automatically transcribe the audio of a meeting or webinar that you record to the cloud. After this transcript is processed, it appears as a separate .vtt text file in the list of recorded meetings.

IBM Watson

Automatically transcribe audio from 7 languages in real-time even from lower quality audio. It recognizes different speakers in the audio and spot specified keywords in real-time with high accuracy.