article

Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment

Authors:

Hyoung-Gook Kim,

Jin Young KimAuthors Info & Claims

Cluster Computing, Volume 19, Issue 1

Pages 315 - 323

https://doi.org/10.1007/s10586-015-0523-z

Published: 01 March 2016 Publication History

Abstract

In this paper, we propose a high-performance audio fingerprinting system used in real-world query-by-example applications for acoustic audio-based content identification, especially for use in heterogeneous portable consumer devices or on-line audio distributed system. In the proposed method, audio fingerprints are generated using a modulated complex lapped transform-based non-repeating foreground audio extraction and an adaptive thresholding method for prominent peak detection. Effective matching is performed using a robust peak-pair-based hash function of non-repeating foreground audio to protect against noise, echo, artifacts from pitch-shifting, time-stretching, resampling, equalization, or compression. Experimental results confirm that the proposed method is quite robust in various distorted conditions and achieves preliminarily promising accuracy results.

References

[1]

Cano, P., Batlle, E., Kalker, T., Haitsma, J.: A review of algorithms for audio fingerprinting. In: International Workshop on Multimedia Signal Processing, pp. 169---173 (2002)

[2]

Li, W., Xiao, C., Liu, Y.: Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain. EURASIP J. Adv. Sig. Process. 1, 1---15 (2013)

[3]

Sinitsyn, A.: Duplicate song detection using audio fingerprinting for consumer electronics devices. In: IEEE International Symposium on Consumer Electronics (ISCE06), St. Petersburg, Russia, pp. 1---6 (2006)

[4]

Cerquides, J.: A real time audio fingerprinting system for advertisement tracking and reporting in FM radio. In: 17th International Conference on Radioelektronika, Brno, Czech, pp. 1---4 (2007)

[5]

Haitsma, J., Kalker, T.: A highly robust audio fingerprinting system. In: 3rd International Society for Music Information Retrieval Conference (ISMIR), Paris, France, pp. 107---115 (2002)

[6]

Liu, Y., Yun, H.-S., Kim, N.S.: Audio fingerprinting based on multiple hashing in DCT domain. IEEE Sig. Process. Lett. 6(6), 525---528 (2009)

[7]

Chandrasekhar, V., Sharifi, M., Ross, D.A.: Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications. In: 12th International Society for Music Information Retrieval Conference (ISMIR), Miami, USA, pp. 801---806 (2011)

[8]

Pan, X., Yu, X., Deng, J., Yang, W., Wang, H.: Audio fingerprinting based on local energy centroid. In: IET International Communication Conference on Wireless Mobile and Computing (CCWMC), Shanghai, China, pp. 351---354 (2011)

[9]

Baluja, S., Covel, M.: Audio fingerprinting: combining computer vision and data-stream processing. In: International Conference on Acoustic, Speech, and Signal Processing (ICASSP), Honolulu, Hawaii, pp. 2:213---2:216 (2007)

[10]

Anguera, X., Garzon, A., Adamek, T.: MASK: robust local feature for audio fingerprinting. In: International Conference on Multimedia and Expo (ICME), pp. 455---460 (2012)

Digital Library

[11]

Wang, A.: An industrial strength audio search algorithm. In: 4th International Society for Music Information Retrieval Conference (ISMIR), Baltimore, pp. 7---13 (2003)

[12]

Kim, H.-G., Kim, J.Y.: Robust audio fingerprinting method using prominent peak pair based on modulated complex lapped transform. ETRI J. 36(6), 999---1007 (2014)

[13]

Fenet, S., Richard, G., Grenier, Y.: A scalable audio fingerprint method with robustness to pitch-shifting. In: 12th International Society for Music Information Retrieval Conference, Taipei, Taiwan, pp. 121---126 (2011)

[14]

Malvar, H.: Fast algorithm for the modulated complex lapped transform. IEEE Sig. Process. Lett. 10(1), 8---10 (2003)

[15]

Rafii, Z., Pardo, B.: Repeating pattern extraction technique (REPET): a simple method for music/voice separation. EEE Trans. Audio Speech Lang. Process. 21(1), 73---84 (2013)

Digital Library

[16]

Liutkus, A., Rafii, Z., Badeau, R., Pardo, B., Richard, G.: Adaptive filtering for music/voice separation exploiting the repeating musical structure. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, Japan, pp. 53---56 (2012)

Cited By

Chen THuang YPu XYan SZhang Q(2022)Encrypted speech Biohashing authentication algorithm based on 4D hyperchaotic Bao system and feature fusionMultimedia Tools and Applications10.1007/s11042-022-13933-682:11(16767-16792)Online publication date: 8-Oct-2022
https://dl.acm.org/doi/10.1007/s11042-022-13933-6
Huang YYuan-Zhang Chen TYan SZhang Q(2022)Speech BioHashing security authentication algorithm based on CNN hyperchaotic mapMultimedia Tools and Applications10.1007/s11042-022-12985-y81:26(37953-37979)Online publication date: 1-Nov-2022
https://dl.acm.org/doi/10.1007/s11042-022-12985-y
Huang YHou HChen TLi HZhang Q(2022)Long sequence biometric hashing authentication based on 2D-SIMM and CQCC cosine valuesMultimedia Tools and Applications10.1007/s11042-021-11708-z81:2(2873-2899)Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1007/s11042-021-11708-z
Show More Cited By

Index Terms

Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Hardware
  1. Communication hardware, interfaces and storage
    1. Signal processing systems

Index terms have been assigned to the content through auto-classification.

Recommendations

Robust quad-based audio fingerprinting

We propose an audio fingerprinting method that adapts findings from the field of blind astrometry to define simple, efficiently representable characteristic feature combinations called quads. Based on these, an audio identification algorithm is ...
A novel audio fingerprinting method robust to time scale modification and pitch shifting
MM '10: Proceedings of the 18th ACM international conference on Multimedia

A novel audio fingerprinting method that is highly robust to Time Scale Modification (TSM) and pitch shifting is proposed. Instead of simply employing spectral or tempo-related features, our system is based on computer-vision techniques. We transform ...
Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment

The noise robustness of an audio fingerprinting system is one of the most important issues in music information retrieval by the content-based audio identification technique. In a real environment, sound recordings are commonly distorted by channel and ...

Comments

Information & Contributors

Information

Published In

cover image Cluster Computing

Cluster Computing Volume 19, Issue 1

March 2016

545 pages

ISSN:1386-7857

Issue’s Table of Contents

Copyright © Copyright © 2016 Springer Science+Business Media New York.

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 March 2016

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 11 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chen THuang YPu XYan SZhang Q(2022)Encrypted speech Biohashing authentication algorithm based on 4D hyperchaotic Bao system and feature fusionMultimedia Tools and Applications10.1007/s11042-022-13933-682:11(16767-16792)Online publication date: 8-Oct-2022
https://dl.acm.org/doi/10.1007/s11042-022-13933-6
Huang YYuan-Zhang Chen TYan SZhang Q(2022)Speech BioHashing security authentication algorithm based on CNN hyperchaotic mapMultimedia Tools and Applications10.1007/s11042-022-12985-y81:26(37953-37979)Online publication date: 1-Nov-2022
https://dl.acm.org/doi/10.1007/s11042-022-12985-y
Huang YHou HChen TLi HZhang Q(2022)Long sequence biometric hashing authentication based on 2D-SIMM and CQCC cosine valuesMultimedia Tools and Applications10.1007/s11042-021-11708-z81:2(2873-2899)Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1007/s11042-021-11708-z
Tang ZZhang SChen ZZhang X(2021)Robust Video Hashing Based on Multidimensional Scaling and Ordinal MeasuresSecurity and Communication Networks10.1155/2021/99306732021Online publication date: 1-Jan-2021
https://dl.acm.org/doi/10.1155/2021/9930673
Huang YWang YZhang QZhang WFan M(2020)Multi-format speech BioHashing based on spectrogramMultimedia Tools and Applications10.1007/s11042-020-09211-y79:33-34(24889-24909)Online publication date: 1-Sep-2020
https://dl.acm.org/doi/10.1007/s11042-020-09211-y
McCallum M(2018)Foreground Harmonic Noise Reduction for Robust Audio Fingerprinting2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2018.8462636(3146-3150)Online publication date: 15-Apr-2018
https://dl.acm.org/doi/10.1109/ICASSP.2018.8462636
Zhang QQiao SHuang YZhang T(2018)A high-performance speech perceptual hashing authentication algorithm based on discrete wavelet transform and measurement matrixMultimedia Tools and Applications10.1007/s11042-018-5613-577:16(21653-21669)Online publication date: 1-Aug-2018
https://dl.acm.org/doi/10.1007/s11042-018-5613-5

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents