Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Bag of Systems Representation for Music Auto-Tagging

Published: 01 December 2013 Publication History

Abstract

We present a content-based automatic tagging system for music that relies on a high-level, concise “Bag of Systems” (BoS) representation of the characteristics of a musical piece. The BoS representation leverages a rich dictionary of musical codewords, where each codeword is a generative model that captures timbral and temporal characteristics of music. Songs are represented as a BoS histogram over codewords, which allows for the use of traditional algorithms for text document retrieval to perform auto-tagging. Compared to estimating a single generative model to directly capture the musical characteristics of songs associated with a tag, the BoS approach offers the flexibility to combine different generative models at various time resolutions through the selection of the BoS codewords. Additionally, decoupling the modeling of audio characteristics from the modeling of tag-specific patterns makes BoS a more robust and rich representation of music. Experiments show that this leads to superior auto-tagging performance.

Cited By

View all
  • (2020)Music auto-tagging using scattering transform and convolutional neural network with self-attentionApplied Soft Computing10.1016/j.asoc.2020.10670296:COnline publication date: 1-Nov-2020
  • (2019)Music auto-tagging based on the unified latent semantic modelingMultimedia Tools and Applications10.1007/s11042-018-5632-278:1(161-176)Online publication date: 1-Jan-2019
  • (2018)Detection and Classification of Acoustic Scenes and EventsIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2017.277842326:2(379-393)Online publication date: 1-Feb-2018
  • Show More Cited By
  1. A Bag of Systems Representation for Music Auto-Tagging

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Audio, Speech, and Language Processing
    IEEE Transactions on Audio, Speech, and Language Processing  Volume 21, Issue 12
    December 2013
    170 pages

    Publisher

    IEEE Press

    Publication History

    Published: 01 December 2013

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Music auto-tagging using scattering transform and convolutional neural network with self-attentionApplied Soft Computing10.1016/j.asoc.2020.10670296:COnline publication date: 1-Nov-2020
    • (2019)Music auto-tagging based on the unified latent semantic modelingMultimedia Tools and Applications10.1007/s11042-018-5632-278:1(161-176)Online publication date: 1-Jan-2019
    • (2018)Detection and Classification of Acoustic Scenes and EventsIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2017.277842326:2(379-393)Online publication date: 1-Feb-2018
    • (2017)Exploring User-Specific Information in Music RetrievalProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080772(655-664)Online publication date: 7-Aug-2017
    • (2015)A Scalable and Accurate Descriptor for Dynamic Textures Using Bag of System TreesIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2014.235943237:4(697-712)Online publication date: 1-Apr-2015
    • (2015)Bag of Class Posteriors, a new multivariate time series classifier applied to animal behaviour identificationExpert Systems with Applications: An International Journal10.1016/j.eswa.2014.11.03342:7(3774-3784)Online publication date: 1-May-2015
    • (2014)Codebook-based audio feature representation for music information retrievalIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2014.233784222:10(1483-1493)Online publication date: 1-Oct-2014

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media