Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A subband excitation substitute based scheme for narrowband speech watermarking

  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

We propose a new narrowband speech watermarking scheme by replacing part of the speech with a scaled and spectrally shaped hidden signal. Theoretically, it is proved that if a small amount of host speech is modified, then not only an ideal channel model for hidden communication can be established, but also high imperceptibility and good intelligibility can be achieved. Furthermore, a practical system implementation is proposed. At the embedder, the power normalization criterion is first imposed on a passband watermark signal by forcing its power level to be the same as the original passband excitation of the cover speech, and a synthesis filter is then used to spectrally shape the scaled watermark signal. At the extractor, a bandpass filter is first used to get rid of the out-of-band signal, and an analysis filter is then employed to compensate for the distortion introduced by the synthesis filter. Experimental results show that the data rate is as high as 400 bits/s with better bandwidth efficiency, and good imperceptibility is achieved. Moreover, this method is robust against various attacks existing in real applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Cai, L.B., Tu, R.H., Zhao, J.Y., et al., 2007. Speech quality evaluation: a new application of digital watermarking. IEEE Trans. Instrum. Meas., 56(1):45–55. http://dx.doi.org/10.1109/TIM.2006.887773

    Article  Google Scholar 

  • Chen, S., Leung, H., 2006. Concurrent data transmission through PSTN by CDMA. IEEE Int. Symp. on Circuits and Systems, p.3001–3004. http://dx.doi.org/10.1109/ISCAS.2006.1693256

    Google Scholar 

  • Chen, S., Leung, H., Ding, H., 2007. Telephony speech enhancement by data hiding. IEEE Trans. Instrum. Meas., 56(1):63–74. http://dx.doi.org/10.1109/TIM.2006.887409

    Article  Google Scholar 

  • Chen, Z., Zhao, C., Geng, G., et al., 2013. An audio watermark-based speech bandwidth extension method.

  • EURASIP J. Audio Speech Music Process., 2013(1):1–8. http://dx.doi.org/10.1186/1687-4722-2013-10

  • Cheng, Q., Sorensen, J., 2001. Spread spectrum signaling for speech watermarking. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, p.1337–1340. http://dx.doi.org/10.1109/ICASSP.2001.941175

    Google Scholar 

  • Eslami, R., Deller, J.R.Jr, Radha, H., 2006. On the detection of multiplicative watermarks for speech signals in the wavelet and DCT domains. IEEE Int. Conf. on Multimedia and Expo, p.1369–1372. http://dx.doi.org/10.1109/ICME.2006.262793

    Google Scholar 

  • Fan, M.Q., Liu, P.P., Wang, H.X., et al., 2013. A semi-fragile watermarking scheme for authenticating audio signal based on dual-tree complex wavelet transform and discrete cosine transform. Int. J. Comput. Math., 90(12):2588–2602. http://dx.doi.org/10.1080/00207160.2013.805752

    Article  Google Scholar 

  • Faundez-Zanuy, M., Hagmü ller, M., Kubin, G., 2006. Speaker verification security improvement by means of speech watermarking. Speech Commun., 48(12):1608–1619. http://dx.doi.org/10.1016/j.specom.2006.06.010

    Article  Google Scholar 

  • Faundez-Zanuy, M., Hagmüller, M., Kubin, G., 2007. Speaker identification security improvement by means of speech watermarking. Patt. Recogn., 40(11):3027–3034. http://dx.doi.org/10.1016/j.patcog.2007.02.016

    Article  MATH  Google Scholar 

  • Faundez-Zanuy, M., Lucena-Molina, J.J., Hagmü ller, M., 2010. Speech watermarking: an approach for the forensic analysis of digital telephonic recordings. J. Forens. Sci., 55(4):1080–1087. http://dx.doi.org/10.1111/j.1556-4029.2010.01395.x

    Article  Google Scholar 

  • Malepati, H., 2010. Digital Media Processing: DSP Algorithms Using C. Elsevier, Burlington, USA, p.416–431. http://dx.doi.org/10.1016/B978-1-85617-678-1.00008-9

    Google Scholar 

  • Hofbauer, K., Hering, H., 2007. Noise robust speech watermarking with bit synchronisation for the aeronautical radio. LNCS, 4567:252–266. http://dx.doi.org/10.1007/978-3-540-77370-2_17

    Google Scholar 

  • Hofbauer, K., Kubin, G., 2006. High-rate data embedding in unvoiced speech. INTERSPEECH, p.241–244.

    Google Scholar 

  • Hofbauer, K., Hering, H., Kubin, G., 2005. Speech watermarking for the VHF radio channel. EUROCONTROL Innovative Research Workshop and Exhibition: Envisioning the Future, p.215–220.

    Google Scholar 

  • Hofbauer, K., Kubin, G., Kleijn, W.B., 2009. Speech watermarking for analog flat-fading bandpass channels. IEEE Trans. Audio Speech Lang. Process., 17(8):1624–1637. http://dx.doi.org/10.1109/TASL.2009.2021543

    Article  Google Scholar 

  • Nematollahi, M.A., Al-Haddad, S.A.R., 2013. An overview of digital speech watermarking. Int. J. Speech Technol., 16(4):471–488. http://dx.doi.org/10.1007/s10772-013-9192-6

    Article  Google Scholar 

  • Nematollahi, M.A., Gamboa-Rosales, H., Akhaee, M.A., et al., 2015a. Robust digital speech watermarking for online speaker recognition. Math. Probl. Eng., 2015:372398. http://dx.doi.org/10.1155/2015/372398

    Article  Google Scholar 

  • Nematollahi, M.A., Akhaee, M.A., Al-Haddad, S.A.R., et al., 2015b. Semi-fragile digital speech watermarking for online speaker recognition. EURASIP J. Audio Speech Music Process., 2015(1):1–15. http://dx.doi.org/10.1186/s13636-015-0074-5

    Article  Google Scholar 

  • Nematollahi, M.A., Vorakulpipat, C., Rosales, H.G., 2017. Digital Watermarking: Techniques and Trends. Springer, Singapore, p.39–51. http://dx.doi.org/10.1007/978-981-10-2095-7

    Google Scholar 

  • Park, C.M., Thapa, D., Wang, G.N., 2007. Speech authentication system using digital watermarking and pattern recovery. Patt. Recogn. Lett., 28(8):931–938. http://dx.doi.org/10.1016/j.patrec.2006.12.010

    Article  Google Scholar 

  • Sarreshtedari, S., Akhaee, M.A., Abbasfar, A., 2015. A watermarking method for digital speech self-recovery. IEEE/ACM Trans. Audio Speech Lang. Process., 23(11):1917–1925. http://dx.doi.org/10.1109/TASLP.2015.2456431

    Google Scholar 

  • Suzuki, J., Hingdi, B., Yashima, H., 1997. Transmission of data on analog speech channel by spread spectrum modulation. IEEE Pacific Rim Conf. on Communications, Computers and Signal Processing, p.697–700. http://dx.doi.org/10.1109/PACRIM.1997.620355

    Google Scholar 

  • Wang, S.B., Unoki, M., 2015. Speech watermarking method based on formant tuning. IEICE Trans. Inform. Syst., E98D(1):29–37. http://dx.doi.org/10.1587/TRANSINF.2014MUP0009

    Article  Google Scholar 

  • Yan, B., Guo, Y.J., 2013. Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization. Multim. Tools Appl., 67(2):383–405. http://dx.doi.org/10.1007/s11042-011-0861-7

    Article  MathSciNet  Google Scholar 

  • Zamani, M., Manaf, A.B.A., 2015. Genetic algorithm for fragile audio watermarking. Telecommun. Syst., 59(3):291–304. http://dx.doi.org/10.1007/s11235-014-9936-x

    Article  Google Scholar 

  • Zheng, W.X., 2005. Fast identification of autoregressive signals from noisy observations. IEEE Trans. Circ. Syst. II, 52(1):43–48. http://dx.doi.org/10.1109/TCSII.2004.838435

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ai-qun Hu.

Additional information

Project supported by the National Natural Science Foundation of China (No. 61571110)

ORCID: Wei LIU, http://orcid.org/0000-0002-7930-1943

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, W., Hu, Aq. A subband excitation substitute based scheme for narrowband speech watermarking. Frontiers Inf Technol Electronic Eng 18, 627–643 (2017). https://doi.org/10.1631/FITEE.1601503

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.1601503

Key words

CLC number