Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Pitch Estimation Based on the Cepstrum Analysis by the Multi Scale Product of Clean and Noisy Speech

  • Chapter
  • First Online:
Recent Advances in Nonlinear Speech Processing

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 48))

Abstract

In this paper we propose a new method for estimating the pitch from the speech signal which consists of analysing real cepstrum by the multiscale product (MP) using continuous wavelet transform (WTC) having one vanishing moment (CAMP). Our approach to estimate the pitch consists of the following steps: first we frame the voiced signal, second we calculate the real cepstrum of each frame. Finally, we compute the MP of the cepstrum. The MP is the product of the WTC at three scales. Our method will be evaluated by the Keele database under clean and noisy conditions. Experimental results indicate that the gross pitch errors (GPE) are lower than the compared methods under clean and noisy conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Wang, M., Lin, M.: An Analysis of Pitch in Chinese Spontaneous Speech. In: International Symposium on Tonal Aspects of Tone Languages, Beijing, China (2004)

    Google Scholar 

  2. Spanias, A.S.: Speech coding: a tutorial review. Proc. IEEE 82, 1541–1582 (2004)

    Article  Google Scholar 

  3. De Bot, K.: Visual feedback of intonation I: effectiveness and induced practice behavior. Lang. Speech 26, 331–350 (1983)

    Google Scholar 

  4. Hess, W.: Pitch Determination of Speech Signals: Algorithms and Devices. Springer, Berlin and Heidelberg (1983)

    Book  Google Scholar 

  5. Boersma, P.: Praat, a system for doing phonetics by computer. Glot. Int. 5(9/10), 341–345 (2001)

    Google Scholar 

  6. Noll, A.M.: Cepstrum pitch determination. Acoust. Soc. Am. 41, 293–309 (1967)

    Google Scholar 

  7. Makhljanl, R., Shrawankar Hrawankar, U., Thakare, V.M.: Speech enhancement using pitch detection approach for noisy environement. Int. J. Eng. Sci. Technol. (IJEST) 3(2), (2011)

    Google Scholar 

  8. Bouzid, A., Ellouze, N.: Electroglottographic measures based on GCI and GOI detection using multiscale product. Int. J. Comput. Commun. Control 3(1), 21–32 (2008)

    Google Scholar 

  9. Mallat, S.: A Wavelet Tour of Signal Processing, 2nd edn. Academic Press (1999); Detection of speech signals. IEEE Trans. Inf. Theory 38, 917–924 (1992)

    Google Scholar 

  10. Plante, F., Meyer, G., Ainsworth, W.A.: A pitch extraction reference database. In: EUROSPEECH, pp. 837–840 (1995)

    Google Scholar 

  11. Bouzid, A., Ellouze, N.: Voice source measurement based on multiscale analysis of electroglottographic signal. Speech Commun. (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wided Jlassi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Jlassi, W., Bouzid, A., Ellouze, N. (2016). Pitch Estimation Based on the Cepstrum Analysis by the Multi Scale Product of Clean and Noisy Speech. In: Esposito, A., et al. Recent Advances in Nonlinear Speech Processing. Smart Innovation, Systems and Technologies, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-28109-4_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-28109-4_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-28107-0

  • Online ISBN: 978-3-319-28109-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics