Abstract
This paper presents a tunable content-based music retrieval (CBMR) system suitable the for retrieval of music audio clips. The audio clips are represented as extracted feature vectors. The CBMR system is expert-tunable by altering the feature space. The feature space is tuned according to the expert-specified similarity criteria expressed in terms of clusters of similar audio clips. The main goal of tuning the feature space is to improve retrieval performance, since some features may have more impact on perceived similarity than others. The tuning process utilizes our genetic algorithm. The R-tree index for efficient retrieval of audio clips is based on the clustering of feature vectors. For each cluster a minimal bounding rectangle (MBR) is formed, thus providing objects for indexing. Inserting new nodes into the R-tree is efficiently performed because of the chosen Quadratic Split algorithm. Our CBMR system implements the point query and the n-nearest neighbors query with the O(logn) time complexity. Different objective functions based on cluster similarity and dissimilarity measures are used for the genetic algorithm. We have found that all of them have similar impact on the retrieval performance in terms of precision and recall. The paper includes experimental results in measuring retrieval performance, reporting significant improvement over the untuned feature space.
Similar content being viewed by others
References
Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM/Addison-Wesley, New York
Brochu E, de Freitas N (2002) ‘name that song!’: a probabilistic approach to querying on music and text. In: Neural information processing systems: natural and synthetic
Casey MA, Slaney M (2006) Song intersection by approximate nearest neighbor search. In: ISMIR 2006, 7th international conference on music information retrieval, Proceedings, Victoria, 8–12 October 2006, pp 144–149
Cheung KL, Fu AW-C (1998) Enhanced nearest neighbour search on the r-tree. SIGMOD Record 27(3):16–21
Foote JT (1997) Content-based retrieval of music and audio. In: Proc. SPIE multimedia storage archiving systems II, vol 3229, pp 138–147
Ghias A, Logan J, Chamberlin D, Smith B (1995) Query by humming: musical information retrieval in an audio database. In: ACM international multimedia conference
Guo G, Li SZ (2003) Content-based audio classification and retrieval by support vector machines. IEEE Trans Neural Netw 14(1):209–215
Guttman A (1984) R-trees: a dynamic index structure for spatial searching. In: Proc. ACM SIGMOD intl. conf. on management of data, pp 47–57
Habich D, Lehner W, Hinneburg A, Kitzmantel P, Kimpl M (2005) Eyes4ears—more than a classical music retrieval system. In: Proceedings of 5th open workshop of musicnetwork—integration of music in multimedia applications
Haitsma J, Kalker T (2002) A highly robust audio fingerprinting system. In: 3rd int. symposium on music information retrieval (ISMIR)
Hoos H, Renz K, Gorg M (2001) GUIDO/MIR—an experimental musical information retrieval system based on guido music notation. In: Int. symposium on music information retrieval (ISMIR), p 4150
Hu N, Dannenberg R (2002) A comparison of melodic database retrieval techniques using sung queries. In: ACM joint conference on digital libraries
Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323
Jang J, Lee H, Yeh C (2001) Query by tapping: a new paradigm for content-based music retrieval from acoustic input. In IEEE pacific-rim conference on multimedia
Karydis I, Nanopoulos A, Papadopoulos AN, Manolopoulos Y (2005) Audio indexing for efficient music information retrieval. In: Proceedings of the 11th ieee international multimedia modelling conference (MMM’05)
Kovacevic A, Milosavljevic B, Konjovic Z (2006) Tuning the feature space for content-based music retrieval. In: Proceedings of the third starting AI reserchers’ symposium (STAIRS’06), pp 62–72
Kuo F-F, Shan M-K (2004) Looking for new, not known music only: music retrieval by melody style. In: Fourth ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL’04), Tuscon, 7–11 June 2004
Kurth F, Ribbrock A, Clausen M (2002) Identification of highly distorted audio material for querying large scale data bases. In: 112th convention of the audio engineering society
Lampropoulos AS, Lampropoulou PS, Tsihrintzis GA (2005) A middleware system for web-based digital music libraries. In: The 2005 IEEE/WIC/ACM intl. conf. on web intelligence, vol 0, pp 136–142
Li T, Ogihara M, Zhu S (2006) Integrating features from different sources for music information retrieval. In: Sixth intl. conf. on data mining, pp 372–381
Liu Z, Huang J, Wang Y, Chen T (1997) Audio feature extraction and analysis for scene classification. In: IEEE signal processing soc. workshop multimedia signal processing
Lo Y-L, Chen S-J (2002) The numeric indexing for music data. In: 22nd international conference on distributed computing systems workshops, p 258
Milosavljević B (2004) Models for extensible multimedia document retrieval. In: IEEE multimedia software engineering (MSE2004), pp 218–221
Milosavljević B, Konjović Z (2002) Design of an XML-based extensible multimedia information retrieval system. In: IEEE multimedia software engineering (MSE2002), pp 114–121
Ortega M, Rui Y, Chakrabarti K, Porkaew K, Mehrotra S, Huang TS (1998) Supporting ranked boolean similarity queries in MARS. IEEE Trans Knowl Data Eng 10(6):905–925
Pickens J (2001) A survey of feature selection techniques for music information retrieval. Technical Report, Center for Intelligent Information Retrieval, Departament of Computer Science, University of Massachussetts
Pickens J, Crawford T (2002) Harmonic models for polyphonic music retrieval. In: ACM international conference on information and knowledge management
Rho S, Hwang E, Kim M (2007) Music information retrieval using a GA-based relevance feedback. In: 2007 intl. conf. on multimedia and ubiquitous engineering, pp 739–744
Rui Y, Huang TS (1999) A novel relevance feedback technique in image retrieval. In: Seventh ACM international conference on multimedia (part 2), pp 67–70
Shalev-Shwartz S, Dubnov S, Friedman N, Singer Y (2002) Robust temporal and spectral modeling for query by melody. In: ACM SIGIR conference on research and development in information retrieval
Tokui N, Iba H (2000) Music composition with interactive evolutionary computation. In: Third international conference on generative art, Milan
Typke R, Giannopoulos P, Veltkamp R, Wiering F, van Oostrum R (2003) Using transportation distances for measuring melodic similarity. In: Int. symposium on music information retrieval (ISMIR), pp 107–114
Typke R, Wiering F, Veltkamp RC (2004) A survey of music information retrieval systems. http://mirsystems.info
Unehara M, Onisawa T (2003) Construction of music composition system with interactive genetic algorithm. J Asian Des Intern Conf
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kovačević, A., Milosavljević, B., Konjović, Z. et al. Adaptive content-based music retrieval system. Multimed Tools Appl 47, 525–544 (2010). https://doi.org/10.1007/s11042-009-0336-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-009-0336-2