0% found this document useful (0 votes)

58 views

Spoken Language Processing in Python Chapter3

Uploaded by

Fgpeqw

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

Spoken Language Processing in Python Chapter3

Uploaded by

Fgpeqw

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Introduction to

PyDub
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Installing PyDub
$ pip install pydub

If using les other than .wav , install ffmpeg via ffmpeg.org

SPOKEN LANGUAGE PROCESSING IN PYTHON

PyDub's main class, AudioSegment
# Import PyDub main class
from pydub import AudioSegment

# Import an audio file

wav_file = AudioSegment.from_file(file="wav_file.wav", format="wav")

# Format parameter only for readability

wav_file = AudioSegment.from_file(file="wav_file.wav")

type(wav_file)

pydub.audio_segment.AudioSegment

SPOKEN LANGUAGE PROCESSING IN PYTHON

Playing an audio le
# Install simpleaudio for wav playback
$pip install simpleaudio

# Import play function

from pydub.playback import play

# Import audio file

wav_file = AudioSegment.from_file(file="wav_file.wav")

# Play audio file

play(wav_file)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Import audio files
wav_file = AudioSegment.from_file(file="wav_file.wav")
two_speakers = AudioSegment.from_file(file="two_speakers.wav")

# Check number of channels

wav_file.channels, two_speakers.channels

1, 2

wav_file.frame_rate

480000

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Find the number of bytes per sample
wav_file.sample_width

# Find the max amplitude

wav_file.max

8488

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Duration of audio file in milliseconds
len(wav_file)

3284

SPOKEN LANGUAGE PROCESSING IN PYTHON

Changing audio parameters
# Change ATTRIBUTENAME of AudioSegment to x
changeed_audio_segment = audio_segment.set_ATTRIBUTENAME(x)

# Change sample width to 1

wav_file_width_1 = wav_file.sample_width(1)
wav_file_width_1.sample_width

SPOKEN LANGUAGE PROCESSING IN PYTHON

Changing audio parameters
# Change sample rate
wav_file_16k = wav_file.frame_rate(16000)
wav_file_16k.frame_rate

16000

# Change number of channels

wav_file_1_channel = wav_file.set_channels(1)
wav_file_1_channel.channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Manipulating audio
les with PyDub
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Turning it down to 11
# Import audio file
wav_file = AudioSegment.from_file("wav_file.wav")
# Minus 60 dB
quiet_wav_file = wav_file - 60

# Try to recognize quiet audio

recognizer.recognize_google(quiet_wav_file)

UnknownValueError:

SPOKEN LANGUAGE PROCESSING IN PYTHON

Increasing the volume
# Increase the volume by 10 dB
louder_wav_file = wav_file + 10

# Try to recognize
recognizer.recognize_google(louder_wav_file)

this is a wav file

SPOKEN LANGUAGE PROCESSING IN PYTHON

This all sounds the same
# Import AudioSegment and normalize
from pydub import AudioSegment
from pydub.effects import normalize
from pydub.playback import play

# Import uneven sound audio file

loud_quiet = AudioSegment.from_file("loud_quiet.wav")
# Normalize the sound levels
normalized_loud_quiet = normalize(loud_quiet)

# Check the sound

play(normalized_loud_quiet)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Remixing your audio les
# Import audio with static at start
static_at_start = AudioSegment.from_file("static_at_start.wav")

# Remove the static via slicing

no_static_at_start = static_at_start[5000:]

# Check the new sound

play(no_static_at_start)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Remixing your audio les
# Import two audio files
wav_file_1 = AudioSegment.from_file("wav_file_1.wav")
wav_file_2 = AudioSegment.from_file("wav_file_2.wav")

# Combine the two audio files

wav_file_3 = wav_file_1 + wav_file_2

# Check the sound

play(wav_file_3)

# Combine two wav files and make the combination louder

louder_wav_file_3 = wav_file_1 + wav_file_2 + 10

SPOKEN LANGUAGE PROCESSING IN PYTHON

Splitting your audio
# Import phone call audio
phone_call = AudioSegment.from_file("phone_call.wav")
# Find number of channels
phone_call.channels

# Split stereo to mono

phone_call_channels = phone_call.split_to_mono()
phone_call_channels

[<pydub.audio_segment.AudioSegment, <pydub.audio_segment.AudioSegment>]

SPOKEN LANGUAGE PROCESSING IN PYTHON

Splitting your audio
# Find number of channels of first list item
phone_call_channels[0].channels

# Recognize the first channel

recognizer.recognize_google(phone_call_channel_1)

the pydub library is really useful

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's code!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Converting and
saving audio les
with PyDub
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Exporting audio les
from pydub import AudioSegment

# Import audio file

wav_file = AudioSegment.from_file("wav_file.wav")

# Increase by 10 decibels
louder_wav_file = wav_file + 10

# Export louder audio file

louder_wav_file.export(out_f="louder_wav_file.wav", format="wav")

<_io.BufferedRandom name='louder_wav_file.wav'>

SPOKEN LANGUAGE PROCESSING IN PYTHON

Reformatting and exporting multiple audio les
def make_wav(wrong_folder_path, right_folder_path):

# Loop through wrongly formatted files

for file in os.scandir(wrong_folder_path):

# Only work with files with audio extensions we're fixing

if file.path.endswith(".mp3") or file.path.endswith(".flac"):

# Create the new .wav filename

out_file = right_folder_path + os.path.splitext(os.path.basename(file.path))[0] + ".wav"

# Read in the audio file and export it in wav format

AudioSegment.from_file(file.path).export(out_file,
format="wav")

print(f"Creating {out_file}")

SPOKEN LANGUAGE PROCESSING IN PYTHON

Reformatting and exporting multiple audio les
# Call our new function
make_wav("data/wrong_formats/", "data/right_format/")

Creating data/right_types/wav_file.wav
Creating data/right_types/flac_file.wav
Creating data/right_types/mp3_file.wav

SPOKEN LANGUAGE PROCESSING IN PYTHON

Manipulating and exporting
def make_no_static_louder(static_quiet, louder_no_static):
# Loop through files with static and quiet (already in wav format)
for file in os.scandir(static_quiet_folder_path):

# Create new file path

out_file = louder_no_static + os.path.splitext(os.path.basename(file.path))[0] + ".wav"

# Read the audio file

audio_file = AudioSegment.from_file(file.path)

# Remove first three seconds and add 10 decibels and export

audio_file = (audio_file[3100:] + 10).export(out_file, format="wav")

print(f"Creating {out_file}")

SPOKEN LANGUAGE PROCESSING IN PYTHON

Manipulating and exporting
# Remove static and make louder
make_no_static_louder("data/static_quiet/", "data/louder_no_static/")

Creating data/louder_no_static/speech-recognition-services.wav
Creating data/louder_no_static/order-issue.wav
Creating data/louder_no_static/help-with-acount.wav

SPOKEN LANGUAGE PROCESSING IN PYTHON

Your turn!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Get Started With Databricks For Machine Learning
No ratings yet
Get Started With Databricks For Machine Learning
85 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
iCEDQ Brochure - Product Datasheet
No ratings yet
iCEDQ Brochure - Product Datasheet
5 pages
Hefei Yifei Mac Fix: LAST - MODIFICATION Fri Dec 20 23:04:51 2019
No ratings yet
Hefei Yifei Mac Fix: LAST - MODIFICATION Fri Dec 20 23:04:51 2019
109 pages
Designing Machine Learning Workflows in Python Chapter2
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
39 pages
Analyzing IoT Data in Python Chapter3
No ratings yet
Analyzing IoT Data in Python Chapter3
30 pages
Customer Segmentation in Python Chapter2
No ratings yet
Customer Segmentation in Python Chapter2
33 pages
Instructions Hcc02 Control Unit Analog Control Unit For Ice and Snow Removal
No ratings yet
Instructions Hcc02 Control Unit Analog Control Unit For Ice and Snow Removal
4 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Spoken Language Processing in Python Chapter4
No ratings yet
Spoken Language Processing in Python Chapter4
46 pages
Spoken Language Processing in Python Chapter1
No ratings yet
Spoken Language Processing in Python Chapter1
17 pages
Designing Machine Learning Workflows in Python Chapter4
No ratings yet
Designing Machine Learning Workflows in Python Chapter4
38 pages
Designing Machine Learning Workflows in Python Chapter1
No ratings yet
Designing Machine Learning Workflows in Python Chapter1
32 pages
Analyzing IoT Data in Python Chapter4
No ratings yet
Analyzing IoT Data in Python Chapter4
34 pages
Introduction To Data Visualization With Seaborn Chapter2
No ratings yet
Introduction To Data Visualization With Seaborn Chapter2
38 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
Introduction To Data Visualization With Matplotlib Chapter2
No ratings yet
Introduction To Data Visualization With Matplotlib Chapter2
27 pages
Building Chatbots in Python Chapter2 PDF
No ratings yet
Building Chatbots in Python Chapter2 PDF
41 pages
Analyzing IoT Data in Python Chapter1
100% (1)
Analyzing IoT Data in Python Chapter1
27 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
Introduction To Data Visualization With Python
No ratings yet
Introduction To Data Visualization With Python
47 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
Meeting DWH QA Challenges Part 2
No ratings yet
Meeting DWH QA Challenges Part 2
10 pages
Cloud Practitioner: Aws Certified
No ratings yet
Cloud Practitioner: Aws Certified
18 pages
Building Chatbots in Python Chapter4
No ratings yet
Building Chatbots in Python Chapter4
20 pages
Credit Risk - Predictive Modelling
No ratings yet
Credit Risk - Predictive Modelling
47 pages
Data Scientist Certification Study Guide
No ratings yet
Data Scientist Certification Study Guide
7 pages
Credit Score Validation
No ratings yet
Credit Score Validation
5 pages
Predictive Analytics I: Data Mining: Process, Methods, and Algorithms
No ratings yet
Predictive Analytics I: Data Mining: Process, Methods, and Algorithms
60 pages
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
No ratings yet
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
8 pages
Power BI Cheat Sheet
No ratings yet
Power BI Cheat Sheet
10 pages
Etl Cook Book PDF
No ratings yet
Etl Cook Book PDF
14 pages
Extraction, Transformation, and Load (ETL) Specification
No ratings yet
Extraction, Transformation, and Load (ETL) Specification
8 pages
Cleaning Data With PySpark Chapter3
No ratings yet
Cleaning Data With PySpark Chapter3
25 pages
Introduction To Python For Data Science - Syllabus
100% (1)
Introduction To Python For Data Science - Syllabus
5 pages
List Comprehension in Python
No ratings yet
List Comprehension in Python
8 pages
ETL Testing Concepts iCEDQ
No ratings yet
ETL Testing Concepts iCEDQ
20 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
Bank Stress Testing and Comprehensive Capital Assessment and Review (CCAR)
No ratings yet
Bank Stress Testing and Comprehensive Capital Assessment and Review (CCAR)
34 pages
Accenture Counterparty Credit Risk Basel Framework Successful Implementation
No ratings yet
Accenture Counterparty Credit Risk Basel Framework Successful Implementation
17 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Lesson 07 Data Manipulation With Pandas
No ratings yet
Lesson 07 Data Manipulation With Pandas
82 pages
SQL Server To Aurora PostgreSQL Migration Playbook 1.0 Preliminary
No ratings yet
SQL Server To Aurora PostgreSQL Migration Playbook 1.0 Preliminary
456 pages
Banking, Finance and Insurance Domain
No ratings yet
Banking, Finance and Insurance Domain
14 pages
Time Series
100% (1)
Time Series
91 pages
Analyzing IoT Data in Python Chapter2
No ratings yet
Analyzing IoT Data in Python Chapter2
35 pages
Portfolio Management Report
No ratings yet
Portfolio Management Report
10 pages
Technologies For Handling Big Data: Prepared By: Saidatul Rahah Hamidi
No ratings yet
Technologies For Handling Big Data: Prepared By: Saidatul Rahah Hamidi
49 pages
Time Series
No ratings yet
Time Series
29 pages
DAX Cheat Sheet
No ratings yet
DAX Cheat Sheet
10 pages
Business Requirements Document /: Project Name Module Name
No ratings yet
Business Requirements Document /: Project Name Module Name
11 pages
Data-Science MUMBAI
100% (1)
Data-Science MUMBAI
149 pages
Python Basic
No ratings yet
Python Basic
34 pages
Chapter 5.3-Mulitple Linear Regression
No ratings yet
Chapter 5.3-Mulitple Linear Regression
26 pages
R Programming For NGS Data Analysis
No ratings yet
R Programming For NGS Data Analysis
5 pages
(Morton Lane) Alternative Risk Strategies
No ratings yet
(Morton Lane) Alternative Risk Strategies
725 pages
IICT - Data Science
No ratings yet
IICT - Data Science
22 pages
Usharani Bhimavarapu Jude D
100% (1)
Usharani Bhimavarapu Jude D
349 pages
Pydub
No ratings yet
Pydub
26 pages
SpeechRecognition
No ratings yet
SpeechRecognition
5 pages
Voice_Assistant_Report
No ratings yet
Voice_Assistant_Report
4 pages
Week-8 Nlp Lab Program
No ratings yet
Week-8 Nlp Lab Program
6 pages
Preparing Your Gures To Share With Others: Ariel Rokem
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
35 pages
Changing Plot Style and Color: Erin Case
No ratings yet
Changing Plot Style and Color: Erin Case
54 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
36 pages
Introduction To Data Visualization With Matplotlib: Ariel Rokem
No ratings yet
Introduction To Data Visualization With Matplotlib: Ariel Rokem
30 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Cleaning Data With PySpark Chapter2
100% (1)
Cleaning Data With PySpark Chapter2
25 pages
Credit Risk Modeling in Python Chapter4
100% (1)
Credit Risk Modeling in Python Chapter4
35 pages
Cleaning Data With PySpark Chapter1
0% (1)
Cleaning Data With PySpark Chapter1
20 pages
Cleaning Data With PySpark Chapter4
No ratings yet
Cleaning Data With PySpark Chapter4
23 pages
Advanced NLP With Spacy Chapter4
No ratings yet
Advanced NLP With Spacy Chapter4
26 pages
Manual Porsche Charge o Mat Pro EU
No ratings yet
Manual Porsche Charge o Mat Pro EU
122 pages
P9160-4F Experiment Manual (Extraction) ENG
No ratings yet
P9160-4F Experiment Manual (Extraction) ENG
6 pages
XLamp Lumen Maintenance
No ratings yet
XLamp Lumen Maintenance
7 pages
AVH-X491BHS OwnersManual070617
No ratings yet
AVH-X491BHS OwnersManual070617
208 pages
Current, Voltage & Resistance
No ratings yet
Current, Voltage & Resistance
48 pages
VIC1641DQ
No ratings yet
VIC1641DQ
9 pages
Canvas Re Amp Digital Manual
No ratings yet
Canvas Re Amp Digital Manual
6 pages
Yuva Samaj Sewak Jan Kalyan Sewa Samiti Pilibhit Uttar Pradesh
No ratings yet
Yuva Samaj Sewak Jan Kalyan Sewa Samiti Pilibhit Uttar Pradesh
2 pages
Style 390-1: Supplied Parts
No ratings yet
Style 390-1: Supplied Parts
1 page
II Puc All Mcqs Cs
No ratings yet
II Puc All Mcqs Cs
21 pages
Sony KV-25DXR Service Manual
No ratings yet
Sony KV-25DXR Service Manual
91 pages
An Improved Analog Waveforms Generation Technique Using Direct Digital Synthesizer
No ratings yet
An Improved Analog Waveforms Generation Technique Using Direct Digital Synthesizer
4 pages
8051 microcontroller
No ratings yet
8051 microcontroller
46 pages
Distribution System Modeling Amd Analysis
100% (2)
Distribution System Modeling Amd Analysis
61 pages
JD - Analog Design Engineer - Automotive - TI
No ratings yet
JD - Analog Design Engineer - Automotive - TI
2 pages
Vehicle Theft Intimation11
No ratings yet
Vehicle Theft Intimation11
2 pages
M.B.M. University: S.no Index P.no Sign
No ratings yet
M.B.M. University: S.no Index P.no Sign
4 pages
Education Kit For Embedded Systems Practice
No ratings yet
Education Kit For Embedded Systems Practice
41 pages
Short-Circuit Modeling of A Wind Power Plant
No ratings yet
Short-Circuit Modeling of A Wind Power Plant
12 pages
Renesas-601M-02ILFT-datasheet
No ratings yet
Renesas-601M-02ILFT-datasheet
9 pages
BC846W
No ratings yet
BC846W
8 pages
Operation Manual: Line Tension Transducer Series
No ratings yet
Operation Manual: Line Tension Transducer Series
16 pages
A6V11456174 - Touch Screen Flush-Mount Room Thermostats With KNX - en
No ratings yet
A6V11456174 - Touch Screen Flush-Mount Room Thermostats With KNX - en
26 pages
Piher PC16
No ratings yet
Piher PC16
6 pages
[FREE PDF sample] Optoelectronics 1st Edition Emmanuel Rosencher ebooks
100% (1)
[FREE PDF sample] Optoelectronics 1st Edition Emmanuel Rosencher ebooks
77 pages
Carrier 30AW USER INTERFACE Installation Manual 2
100% (1)
Carrier 30AW USER INTERFACE Installation Manual 2
19 pages
12v Battery Charger Circuit With Auto Cut Off - Circuits Gallery
0% (1)
12v Battery Charger Circuit With Auto Cut Off - Circuits Gallery
40 pages
Diagnosa RS Kalmar
100% (3)
Diagnosa RS Kalmar
41 pages

Spoken Language Processing in Python Chapter3

Uploaded by

Spoken Language Processing in Python Chapter3

Uploaded by

Introduction to

If using les other than .wav , install ffmpeg via ffmpeg.org

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import an audio file

# Format parameter only for readability

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import play function

# Import audio file

# Play audio file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Check number of channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Find the max amplitude

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Change sample width to 1

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Change number of channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Try to recognize quiet audio

SPOKEN LANGUAGE PROCESSING IN PYTHON

this is a wav file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import uneven sound audio file

# Check the sound

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Remove the static via slicing

# Check the new sound

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Combine the two audio files

# Check the sound

# Combine two wav files and make the combination louder

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Split stereo to mono

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Recognize the first channel

the pydub library is really useful

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import audio file

# Export louder audio file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Loop through wrongly formatted files

# Only work with files with audio extensions we're fixing

# Create the new .wav filename

# Read in the audio file and export it in wav format

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Create new file path

# Read the audio file

# Remove first three seconds and add 10 decibels and export

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

You might also like