Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2948992.2949002acmotherconferencesArticle/Chapter ViewAbstractPublication PagesuccsConference Proceedingsconference-collections
research-article

Using Smartphones to Classify Urban Sounds

Published: 20 July 2016 Publication History
  • Get Citation Alerts
  • Abstract

    The aim of this work is to develop an application for Android able to classifying urban sounds in a real life context. It also enables the collection and classification of new sounds. To train our classifier we use the UrbanSound8K data set available online. We have used a hybrid approach to obtain features, by combining SAX-based multiresolution motif discovery with Mel-Frequency Cepstral Coefficients (MFCC). We also describe different configurations of motif discovery for defining attributes and compare the use of Random Forest and SVM algorithms on this kind of data.

    References

    [1]
    http://www.euro.who.int/en/health-topics/environment-and-health/noise/data-and-statistics, 2011.
    [2]
    http://publish.illinois.edu/audioanalytics/, 2015.
    [3]
    https://serv.cusp.nyu.edu/projects/urbansounddataset, 2015.
    [4]
    https://github.com/jameslyons/python_speech_features, 2015.
    [5]
    https://www.sqlite.org/, 2015.
    [6]
    http://simpligility.github.io/ksoap2-android/index.html, 2015.
    [7]
    F. Beritelli and R. Grasso. A pattern recognition system for environmental sound classification based on mfccs and neural networks. In Signal Processing and Communication Systems, 2008. ICSPCS 2008. 2nd International Conference on, pages 1--4, Dec 2008.
    [8]
    B. E. Boser, I. M. Guyon, and V. N. Vapnik. A training algorithm for optimal margin classifiers. In D. Haussler, editor, Proceedings of the 5th Annual Workshop on Computational Learning Theory (COLT'92), pages 144--152, Pittsburgh, PA, USA, July 1992. ACM Press.
    [9]
    L. Breiman. Random forests. Machine Learning, 45(1):5--32, 2001.
    [10]
    N. Castro. Multiresolution motif discovery in time series website, http://www.di.uminho.pt/ castro/mrmotif.
    [11]
    N. Castro and P. J. Azevedo. Multiresolution Motif Discovery in Time Series. In SDM, pages 665--676, 2010.
    [12]
    S. Chaudhuri and B. Raj. Unsupervised hierarchical structure induction for deeper semantic analysis of audio. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pages 833--837, May 2013.
    [13]
    S. Davis and P. Mermelstein. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. Acoustics, Speech and Signal Processing, IEEE Transactions on, 28(4):357--366, Aug 1980.
    [14]
    D. Ellis, X. Zeng, and J. McDermott. Classifying soundtracks with audio texture features. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5880--5883, May 2011.
    [15]
    P. G. Ferreira, P. J. Azevedo, C. G. Silva, and R. M. M. Brito. Mining approximate motifs in time series. In Discovery Science, pages 89--101, 2006.
    [16]
    J. Geiger, B. Schuller, and G. Rigoll. Large-scale audio feature extraction and svm for acoustic scene classification. In Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on, pages 1--4, Oct 2013.
    [17]
    E. F. Gomes and F. Batista. Using multiresolution time series motifs to classify urban sounds. International Journal of Software Engineering and Its Applications, 9(8):189--196, 2015.
    [18]
    E. F. Gomes, A. M. Jorge, and P. J. Azevedo. Classifying heart sounds using multiresolution time series motifs: an exploratory study. In Proceedings of the International C* Conference on Computer Science and Software Engineering, pages 23--30. ACM, 2013.
    [19]
    E. F. Gomes, A. M. Jorge, and P. J. Azevedo. Classifying heart sounds using sax motifs, random forests and text mining techniques. In Proceedings of the 18th International Database Engineering & Applications Symposium, pages 334--337. ACM, 2014.
    [20]
    T. Heittola, A. Mesaros, T. Virtanen, and A. Eronen. Sound event detection in multisource environments using source separation. Proc CHiME, pages 36--40, 2011.
    [21]
    J. Lin, E. Keogh, S. Lonardi, and P. Patel. Finding motifs in time series. In Proceedings of the 2nd Workshop on Temporal Data Mining, pages 53--68, 2002.
    [22]
    S. Ntalampiras, I. Potamitis, and N. Fakotakis. Automatic recognition of urban soundscenes. In New Directions in Intelligent Interactive Multimedia, pages 147--153. Springer, 2008.
    [23]
    G. Roma, W. Nogueira, P. Herrera, and R. de Boronat. Recurrence quantification analysis features for auditory scene classification. IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events, 2013.
    [24]
    J. Salamon, C. Jacoby, and J. P. Bello. A dataset and taxonomy for urban sound research. In 22st ACM International Conference on Multimedia (ACM-MM'14), Orlando, FL, USA, Nov. 2014.
    [25]
    B. Uzkent, B. D. Barkana, and H. Cevikalp. Non-speech environmental sound classification using svms with a new set of features. International Journal of Innovative Computing, Information and Control, 8(5B):3511--3524, 2012.
    [26]
    X. Valero and F. Alías. Hierarchical classification of environmental noise sources considering the acoustic signature of vehicle pass-bys. Archives of Acoustics, 37(4):423--434, 2012.
    [27]
    I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques. 2005.
    [28]
    D. Yankov, E. J. Keogh, J. Medina, B. Y. chi Chiu, and V. B. Zordan. Detecting time series motifs under uniform scaling. In P. Berkhin, R. Caruana, and X. Wu, editors, KDD, pages 844--853. ACM, 2007.

    Cited By

    View all
    • (2022)For Your Voice Only: Exploiting Side Channels in Voice Messaging for Environment DetectionComputer Security – ESORICS 202210.1007/978-3-031-17143-7_29(595-613)Online publication date: 24-Sep-2022
    • (2019)Recognition of Activities of Daily Living and Environments Using Acoustic Sensors Embedded on Mobile DevicesElectronics10.3390/electronics81214998:12(1499)Online publication date: 7-Dec-2019
    • (2019)Environmental Audio Scene and Sound Event Recognition for Autonomous SurveillanceACM Computing Surveys10.1145/332224052:3(1-34)Online publication date: 18-Jun-2019

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    C3S2E '16: Proceedings of the Ninth International C* Conference on Computer Science & Software Engineering
    July 2016
    152 pages
    ISBN:9781450340755
    DOI:10.1145/2948992
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • BytePress
    • ISEP

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 July 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. MFCC
    2. Random Forest
    3. SAX
    4. Urban sound classification
    5. mobile app
    6. motif discovery
    7. time series analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    C3S2E '16

    Acceptance Rates

    Overall Acceptance Rate 12 of 42 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 26 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)For Your Voice Only: Exploiting Side Channels in Voice Messaging for Environment DetectionComputer Security – ESORICS 202210.1007/978-3-031-17143-7_29(595-613)Online publication date: 24-Sep-2022
    • (2019)Recognition of Activities of Daily Living and Environments Using Acoustic Sensors Embedded on Mobile DevicesElectronics10.3390/electronics81214998:12(1499)Online publication date: 7-Dec-2019
    • (2019)Environmental Audio Scene and Sound Event Recognition for Autonomous SurveillanceACM Computing Surveys10.1145/332224052:3(1-34)Online publication date: 18-Jun-2019

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media