Spoken Language Processing in Python Chapter2
Spoken Language Processing in Python Chapter2
Python library
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Why the SpeechRecognition library?
Some existing python libraries
CMU Sphinx
Kaldi
SpeechRecognition
Wav2letter++ by Facebook
recognize_google()
recognize_google_cloud()
recognize_wit()
Input: audio_file
Daniel Bourke
Machine Learning Engineer/YouTube
Creator
The AudioFile class
import speech_recognition as sr
<class 'speech_recognition.AudioFile'>
<class 'speech_recognition.AudioData'>
Daniel Bourke
Machine Learning Engineer/YouTube
Creator
What language?
# Create a recognizer class
recognizer = sr.Recognizer()
Ohio gozaimasu
?????????
UnknownValueError:
[]
Text from speaker 0: one of the limitations of the speech recognition library
Text from speaker 1: is that it doesn't recognise different speakers and voices
Text from speaker 2: it will just return it all as one block a text