Audio To Wav Via Ffmpeg
Convert audio to 1 channel wav for DeepSpeech
It is remarkable that recording sound is the hard part of DeepSpeech.
With deepspeech installed in a virtualenv
, do the following.
Recording audio
macos: get a m4a
format audio with quicktime
Use ffmpeg
- Get ffmpeg:
brew install ffmpeg
orapt install ffmpeg
- Convert to single channel 192kbps:
$ ffmpeg -i ~/Desktop/test_audio.m4a -ac 1 -ab 192k ~/Desktop/test.wav
- Run the demo:
deepspeech models/output_graph.pb ~/Desktop/test.wav models/alphabet.txt