Transcribing audio files can be quite boring as it requires you to listen carefully and type quickly. It would definitely be great if there was an app that can automatize the whole process. Well guess what, there is an app which can help you so let's see what it's all about!
Easy Speech2Text is a Windows app that can convert MP3 files of recorded speech to text. You can use it for free with limitations such as 5-minute length limit of the file and no more than 500 words of transcribed text per use. If you want to use the app's full features, you can pay the $19.50 one-time fee.
After installing the app, you will have to create a Google Cloud account in order to acquire the JSON file the app needs to operate. The account is free and it provides a $300 bonus which is the equivalent of 12,500 minutes of free transcription. After that, it only costs $1.44 for each hour of audio transcription.
Setting up the Google Cloud account and loading the JSON file may sound like extra work but that is actually why the app's price is so low and why the app is subscription-free. After paying the one-time fee, you can simply continue paying for Google's data.
After you set up everything properly, you will find out that the app works quite well. Transcriptions are accurate and precise. Obviously, you need to revise the final document for punctuation and minor mistakes. The app's accuracy is better for more "popular" languages, so that is why transcribing English spoken word is much better than some more exotic language.
Besides speech-to-text features, you can also have written text spoken out loud in various languages. You can also save the spoken text on your computer. The app is very simple to use and you will quickly find out that you won't have to pay high prices for manual transcription (or do it yourself) as the end product simply needs a bit more editing and you're good to go! It would be great if the app supported more file formats besides the MP3 since that would save even more time for users.
• A low price for what you get • The app uses highly reliable Google data • Supports both speech-to-text and text-to-speech conversions • Plenty of supported languages
• Having to go through the process of acquiring the Google JSON credentials • The only supported audio format is MP3
• .Net 4.5 or later required •
512MB RAM •
800MHz CPU •
512MB disk space