Audio To Wav Via Ffmpeg
Convert audio to 1 channel wav for DeepSpeech
It is remarkable that recording sound is the hard part of DeepSpeech.
With deepspeech installed in a virtualenv, do the following.
Recording audio
macos: get a m4a format audio with quicktime
Use ffmpeg
- Get ffmpeg:
brew install ffmpegorapt install ffmpeg - Convert to single channel 192kbps:
$ ffmpeg -i ~/Desktop/test_audio.m4a -ac 1 -ab 192k ~/Desktop/test.wav - Run the demo:
deepspeech models/output_graph.pb ~/Desktop/test.wav models/alphabet.txt