Using google's voice recognition to convert audio files to text

Google’s api for voice recognition is more accurate than most voice recognition software that I have tried, including Nuance’s Dragon. In addition, it recognizes a multitude of languages.

Is there any way I can exploit Google’s API to convert audio files into text? I want to upload lengthy audio files which would be then converted to text; not speak into a microphone.

You could try YouTube’s automatic captions feature.

Or you can try one of the following API demos:

But even if the underlying engine is great (and it is), you don’t get the full UI of something like Nuance. You can’t easily go back and correct mistakes, customize how it’s trained, etc.