Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment

Published: 01 March 2016 Publication History
  • Get Citation Alerts
  • Abstract

    In this paper, we propose a high-performance audio fingerprinting system used in real-world query-by-example applications for acoustic audio-based content identification, especially for use in heterogeneous portable consumer devices or on-line audio distributed system. In the proposed method, audio fingerprints are generated using a modulated complex lapped transform-based non-repeating foreground audio extraction and an adaptive thresholding method for prominent peak detection. Effective matching is performed using a robust peak-pair-based hash function of non-repeating foreground audio to protect against noise, echo, artifacts from pitch-shifting, time-stretching, resampling, equalization, or compression. Experimental results confirm that the proposed method is quite robust in various distorted conditions and achieves preliminarily promising accuracy results.

    References

    [1]
    Cano, P., Batlle, E., Kalker, T., Haitsma, J.: A review of algorithms for audio fingerprinting. In: International Workshop on Multimedia Signal Processing, pp. 169---173 (2002)
    [2]
    Li, W., Xiao, C., Liu, Y.: Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain. EURASIP J. Adv. Sig. Process. 1, 1---15 (2013)
    [3]
    Sinitsyn, A.: Duplicate song detection using audio fingerprinting for consumer electronics devices. In: IEEE International Symposium on Consumer Electronics (ISCE06), St. Petersburg, Russia, pp. 1---6 (2006)
    [4]
    Cerquides, J.: A real time audio fingerprinting system for advertisement tracking and reporting in FM radio. In: 17th International Conference on Radioelektronika, Brno, Czech, pp. 1---4 (2007)
    [5]
    Haitsma, J., Kalker, T.: A highly robust audio fingerprinting system. In: 3rd International Society for Music Information Retrieval Conference (ISMIR), Paris, France, pp. 107---115 (2002)
    [6]
    Liu, Y., Yun, H.-S., Kim, N.S.: Audio fingerprinting based on multiple hashing in DCT domain. IEEE Sig. Process. Lett. 6(6), 525---528 (2009)
    [7]
    Chandrasekhar, V., Sharifi, M., Ross, D.A.: Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications. In: 12th International Society for Music Information Retrieval Conference (ISMIR), Miami, USA, pp. 801---806 (2011)
    [8]
    Pan, X., Yu, X., Deng, J., Yang, W., Wang, H.: Audio fingerprinting based on local energy centroid. In: IET International Communication Conference on Wireless Mobile and Computing (CCWMC), Shanghai, China, pp. 351---354 (2011)
    [9]
    Baluja, S., Covel, M.: Audio fingerprinting: combining computer vision and data-stream processing. In: International Conference on Acoustic, Speech, and Signal Processing (ICASSP), Honolulu, Hawaii, pp. 2:213---2:216 (2007)
    [10]
    Anguera, X., Garzon, A., Adamek, T.: MASK: robust local feature for audio fingerprinting. In: International Conference on Multimedia and Expo (ICME), pp. 455---460 (2012)
    [11]
    Wang, A.: An industrial strength audio search algorithm. In: 4th International Society for Music Information Retrieval Conference (ISMIR), Baltimore, pp. 7---13 (2003)
    [12]
    Kim, H.-G., Kim, J.Y.: Robust audio fingerprinting method using prominent peak pair based on modulated complex lapped transform. ETRI J. 36(6), 999---1007 (2014)
    [13]
    Fenet, S., Richard, G., Grenier, Y.: A scalable audio fingerprint method with robustness to pitch-shifting. In: 12th International Society for Music Information Retrieval Conference, Taipei, Taiwan, pp. 121---126 (2011)
    [14]
    Malvar, H.: Fast algorithm for the modulated complex lapped transform. IEEE Sig. Process. Lett. 10(1), 8---10 (2003)
    [15]
    Rafii, Z., Pardo, B.: Repeating pattern extraction technique (REPET): a simple method for music/voice separation. EEE Trans. Audio Speech Lang. Process. 21(1), 73---84 (2013)
    [16]
    Liutkus, A., Rafii, Z., Badeau, R., Pardo, B., Richard, G.: Adaptive filtering for music/voice separation exploiting the repeating musical structure. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, Japan, pp. 53---56 (2012)

    Cited By

    View all
    • (2022)Encrypted speech Biohashing authentication algorithm based on 4D hyperchaotic Bao system and feature fusionMultimedia Tools and Applications10.1007/s11042-022-13933-682:11(16767-16792)Online publication date: 8-Oct-2022
    • (2022)Speech BioHashing security authentication algorithm based on CNN hyperchaotic mapMultimedia Tools and Applications10.1007/s11042-022-12985-y81:26(37953-37979)Online publication date: 1-Nov-2022
    • (2022)Long sequence biometric hashing authentication based on 2D-SIMM and CQCC cosine valuesMultimedia Tools and Applications10.1007/s11042-021-11708-z81:2(2873-2899)Online publication date: 1-Jan-2022
    • Show More Cited By

    Index Terms

    1. Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Cluster Computing
        Cluster Computing  Volume 19, Issue 1
        March 2016
        545 pages

        Publisher

        Kluwer Academic Publishers

        United States

        Publication History

        Published: 01 March 2016

        Author Tags

        1. Audio fingerprinting
        2. Modulated complex lapped transform
        3. Peak detection
        4. Robust hash function

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 11 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2022)Encrypted speech Biohashing authentication algorithm based on 4D hyperchaotic Bao system and feature fusionMultimedia Tools and Applications10.1007/s11042-022-13933-682:11(16767-16792)Online publication date: 8-Oct-2022
        • (2022)Speech BioHashing security authentication algorithm based on CNN hyperchaotic mapMultimedia Tools and Applications10.1007/s11042-022-12985-y81:26(37953-37979)Online publication date: 1-Nov-2022
        • (2022)Long sequence biometric hashing authentication based on 2D-SIMM and CQCC cosine valuesMultimedia Tools and Applications10.1007/s11042-021-11708-z81:2(2873-2899)Online publication date: 1-Jan-2022
        • (2021)Robust Video Hashing Based on Multidimensional Scaling and Ordinal MeasuresSecurity and Communication Networks10.1155/2021/99306732021Online publication date: 1-Jan-2021
        • (2020)Multi-format speech BioHashing based on spectrogramMultimedia Tools and Applications10.1007/s11042-020-09211-y79:33-34(24889-24909)Online publication date: 1-Sep-2020
        • (2018)Foreground Harmonic Noise Reduction for Robust Audio Fingerprinting2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2018.8462636(3146-3150)Online publication date: 15-Apr-2018
        • (2018)A high-performance speech perceptual hashing authentication algorithm based on discrete wavelet transform and measurement matrixMultimedia Tools and Applications10.1007/s11042-018-5613-577:16(21653-21669)Online publication date: 1-Aug-2018

        View Options

        View options

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media