Article

Continuous Wavelet Transform for Severity-Level Classification of Dysarthria

Authors:

Aastha Kachhi,

Anand Therattil,

Priyanka Gupta,

Hemant A. PatilAuthors Info & Claims

Speech and Computer: 24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings

Pages 312 - 324

https://doi.org/10.1007/978-3-031-20980-2_27

Published: 14 November 2022 Publication History

Abstract

Dysarthria is a neuro-motor speech defect that causes speech to be unintelligible and is largely unnoticeable to humans at various severity-levels. Dysarthric speech classification is used as a diagnostic method to assess the progression of a patient’s severity of the condition, as well as to aid with automatic dysarthric speech recognition systems (an important assistive speech technology). This study investigates the significance of Generalized Morse Wavelet (GMW)-based scalogram features for capturing the discriminative acoustic cues of dysarthric severity-level classification for low-frequency regions, using Convolutional Neural Network (CNN). The performance of scalogram-based features is compared with Short-Time Fourier Transform (STFT)-based features, and Mel spectrogram-based features. Compared to the STFT-based baseline features with a classification accuracy of

91.76 %

, the proposed Continuous Wavelet Transform (CWT)-based scalogram features achieve significantly improved classification accuracy of

95.17 %

on standard and statistically meaningful UA-Speech corpus. The remarkably improved results signify that for better dysarthric severity-level classification, the information in the low-frequency regions is more discriminative, as the proposed CWT-based time-frequency representation (scalogram) has a high-frequency resolution in the lower frequencies. On the other hand, STFT-based representations have constant resolution across all the frequency bands and therefore, are not as better suited for dysarthric severity-level classification, as the proposed Morse wavelet-based CWT features. In addition, we also perform experiments on the Mel spectrogram to demonstrate that even though the Mel spectrogram also has a high frequency resolution in the lower frequencies with a classification accuracy of

92.65 %

, the proposed system is better suited. We see an increase of

3.41 %

and

2.52 %

in classification accuracy of the proposed system to STFT and Mel spectrogram respectively. To that effect, the performance of the STFT, Mel spectrogram, and scalogram are analyzed using F1-Score, Matthew’s Correlation Coefficients (MCC), Jaccard Index, Hamming Loss, and Linear Discriminant Analysis (LDA) scatter plots.

References

[1]

Al-Qatab BA and Mustafa MB Classification of dysarthric speech according to the severity of impairment: an analysis of acoustic features IEEE Access 2021 9 18183-18194

Abstract

References

Index Terms

Recommendations

Significance of Energy Features for Severity Classification of Dysarthria

Automatic dysarthria detection and severity level assessment using CWT-layered CNN model

Residual Neural Network precisely quantifies dysarthria severity-level based on short-duration speech segments

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Share

Share this Publication link

Share on social media

Affiliations