Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2348283.2348346acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Modeling concept dynamics for large scale music search

Published: 12 August 2012 Publication History

Abstract

Continuing advances in data storage and communication technologies have led to an explosive growth in digital music collections. To cope with their increasing scale, we need effective Music Information Retrieval (MIR) capabilities like tagging, concept search and clustering. Integral to MIR is a framework for modelling music documents and generating discriminative signatures for them. In this paper, we introduce a multimodal, layered learning framework called DMCM. Distinguished from the existing approaches that encode music as an ensemble of order-less feature vectors, our framework extracts from each music document a variety of acoustic features, and translates them into low-level encodings over the temporal dimension. From them, DMCM elucidates the concept dynamics in the music document, representing them with a novel music signature scheme called Stochastic Music Concept Histogram (SMCH) that captures the probability distribution over all the concepts. Experiment results with two large music collections confirm the advantages of the proposed framework over existing methods on various MIR tasks.

References

[1]
Cal500 data set annotation, 2007. http://cosmal.ucsd.edu/cal/pubs/annotations.txt.
[2]
Nielsen company & billboard's 2011 music industry report. Business Wire, 5 January 2012.
[3]
F. Bach and M. I. Jordan. A Probabilistic Interpretation of Canonical Correlation Analysis. Technical Report 688, Department of Statistics, University of California, Berkeley, 2005.
[4]
F. R. Bach and M. I. Jordan. Kernel independent component analysis. Journal of Machine Learning Research, 3:1--48, 2002.
[5]
T. Bertin-Mahieux, D. Eck, F. Maillet, and P. Lamere. Autotagger: A model for predicting social tags from acoustic features on large music databases. Journal of New Music Research, 37(2), 2008.
[6]
S. Bhattacharjee, R. D. Gopal, K. Lertwachara, and J. R. Marsden. Consumer search and retailer strategies in the presence of online music sharing. J. of Management Information Systems, 23(1), 2006.
[7]
S. Bhattacharjee, R. D. Gopal, K. Lertwachara, J. R. Marsden, and R. Telang. The effect of digital sharing technologies on music markets: A survival analysis of albums on ranking charts. Management Science, 53(9), 2007.
[8]
E. Coviello, A. B. Chan, and G. Lanckriet. Time series models for semantic music annotation. IEEE Trans. on Audio, Speech & Language Processing, 19(5), 2011.
[9]
L. Daudet. Transients modeling by pruned wavelet trees. In Proc. of International Computer Music Conference, 2001.
[10]
G. Doretto, A. Chiuso, Y. N. Wu, and S. Soatto. Dynamic textures. International Journal of Computer Vision, 51(2), 2003.
[11]
R. Duda, P. Hart, and D. Stork. Pattern Classification. John Wiley and Sons, 2001.
[12]
D. Eck, P. Lamere, T. Bertin-Mahieux, and S. Green. Automatic generation of social tags for music recommendation. In Proc. of NIPS, 2007.
[13]
H. Green. Kissing off the big music labels. Businessweek, 2004.
[14]
A. Haghighi, P. Liang, T. Berg-Kirkpatrick, and D. Klein. Learning bilingual lexicons from monolingual corpora. In Proc. of ACL, 2008.
[15]
D. R. Hardoon, S. Szedmak, and J. Shawe-Taylor. Canonical Correlation Analysis; An Overview with Application to Learning Methods. Technical Report CSD-TR-03-02, Computer Science Dept. Royal Holloway, University of London, 2003.
[16]
T. Li, M. Ogihara, and Q. Li. A comparative study on content-based music genre classification. In Proc. of ACM SIGIR, 2003.
[17]
W. Li, Y. Liu, and X. Xue. Robust audio identification for mp3 popular music. In Proc. of ACM SIGIR, 2010.
[18]
B. Logan. Mel frequency cepstral coefficients for music modeling. In Proc. of ISMIR, 2000.
[19]
L. Lu, S. H. Li, and J. Zhang. Content-based audio segmentation using support vector machines. In Proc. of IEEE ICME, 2001.
[20]
L. Lu, D. Liu, and H. Zhang. Automatic mood detection and tracking of music audio signals. IEEE Trans. Acoust., Speech, Signal, 2006.
[21]
S. Mallat. A Wavelet Tour of Signal Processing. Acadamic PressAcademic Press, 3rd edition, 2008.
[22]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008.
[23]
R. Miotto and N. Orio. A probabilistic model to combine tags and acoustic similarity for music retrieval. ACM Trans. Inf. Syst., 30(2), May 2012.
[24]
U. Nam and J. Berger. Addressing the same but different-different but similar problem in automatic music classification. In Proc. of ISMIR, 2001.
[25]
C. Sanden and J. Zhang. Enhancing multi-label music genre classification through ensemble techniques. In Proc. of ACM SIGIR, 2011.
[26]
B. Scholkopf and A. J. Smola. Learning with Kernels. MIT Press, 2002.
[27]
A. Sheh and D. Ellis. Chord segmentation and recognition using em-trained hidden markov models. In Proc. of ISMIR, 2003.
[28]
J. Shen, B. Cui, J. Shepherd, and K. Tan. Towards efficient automated singer identification in large music databases. In Proc. of ACM SIGIR, 2006.
[29]
J. Shen, W. Meng, S. Yan, H. Pang, and X. Hua. Effective music tagging through advanced statistical modeling. In Proc. of ACM SIGIR, 2010.
[30]
J. Shen, J. Shepherd, and A. H. H. Ngu. Towards effective content-based music retrieval with multiple acoustic feature combination. IEEE Trans. on Multimedia, 8(6), 2006.
[31]
Y. Song and C. Zhang. Content-based information fusion for semi-supervised music genre classification. IEEE Trans. on Multimedia, 10(1), 2008.
[32]
D. Turnbull, L. Barrington, and G. Lanckriet. Modeling music and words using a multi-class naíve bayes approach. In Proc. of ISMIR, 2006.
[33]
D. Turnbull, L. Barrington, G. R. G. Lanckriet, and M. Yazdani. Combining audio content and social context for semantic music discovery. In Proc. of ACM SIGIR, 2009.
[34]
D. Turnbull, L. Barrington, D. Torres, and G. Lanckriet. Towards musical query-by-semantic-description using thetextscCAL500 data set. In Proc. of ACM SIGIR, 2007.
[35]
D. Turnbull, L. Barrington, D. Torres, and G. Lanckriet. Semantic annotation and retrieval of music and sound effects. IEEE Trans. on Audio, Speech & Language Processing, 16(2), 2008.
[36]
G. Tzanetakis and P. Cook. Musical genre classification of audio signals. IEEE Trans. on Speech and Audio Processing, 2002.
[37]
G. Tzanetakis, A. Ermolinskyi, and P. Cook. Pitch histograms in audio and symbolic music information retrieval. Journal of New Music Research, 2003.
[38]
L. G. Valiant. The complexity of computing the permanent. Theoretical Computer Science, 8, 1995.
[39]
B. Zhang, J. Shen, Q. Xiang, and Y. Wang. Compositemap: a novel framework for music similarity measure. In Proc. of ACM SIGIR, 2009.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
August 2012
1236 pages
ISBN:9781450314725
DOI:10.1145/2348283
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. music concepts
  2. music information retrieval
  3. similarity measure

Qualifiers

  • Research-article

Conference

SIGIR '12
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Personalized Song Recommendation System Based on Vocal CharacteristicsMathematical Problems in Engineering10.1155/2022/36057282022(1-10)Online publication date: 16-Mar-2022
  • (2021)MIMVOGUE: modeling Indian music using a variable order gapped HMMMultimedia Tools and Applications10.1007/s11042-020-10303-yOnline publication date: 30-Jan-2021
  • (2019)SATINMultimedia Tools and Applications10.1007/s11042-018-5797-878:3(2703-2718)Online publication date: 1-Feb-2019
  • (2017)Exploiting music play sequence for music recommendationProceedings of the 26th International Joint Conference on Artificial Intelligence10.5555/3172077.3172400(3654-3660)Online publication date: 19-Aug-2017
  • (2017)Exploring User-Specific Information in Music RetrievalProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080772(655-664)Online publication date: 7-Aug-2017
  • (2017)A Personalized Recommendation Method Considering Local and Global InfluencesIntelligent Information and Database Systems10.1007/978-3-319-54472-4_62(663-672)Online publication date: 26-Feb-2017
  • (2016)On Effective Personalized Music Retrieval by Exploring Online User BehaviorsProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911491(125-134)Online publication date: 7-Jul-2016
  • (2016)Importance of audio feature reduction in automatic music genre classificationMultimedia Tools and Applications10.1007/s11042-014-2418-z75:6(3013-3026)Online publication date: 1-Mar-2016
  • (2015)VenueMusicProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/2766462.2767869(1029-1030)Online publication date: 9-Aug-2015
  • (2015)A Deep Neural Network for Modeling MusicProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749367(379-386)Online publication date: 22-Jun-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media