Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2733373.2806363acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

"Clustering of Dancelets": Towards Video Recommendation Based on Dance Styles

Published: 13 October 2015 Publication History
  • Get Citation Alerts
  • Abstract

    Dance is a special and important type of action, composed of abundant and various action elements. However, the recommendation of dance videos on the web are still not well studied. It is hard to realize it in the way of traditional methods using associated texts or static features of video content. In this paper, we study the problem focusing on extraction and representation of action information in dances. We propose to recommend dance videos based on the automatically discovered ``Dance Styles'', which play a significant role in characterizing different types of dances. To bridge the semantic gap of video content and mid-level concept, style, we take advantage of a mid-level action representation method, and extract representative patches as ``Dancelets'', a sort of intermediation between videos and the concepts. Furthermore, we propose to employ Motion Boundaries as saliency priors and sparsely extract patches containing more representative information to generate a set of dancelet candidates. Dancelets are then discovered by Normalized-cut method, which is superior in grouping visually similar patterns into the same clusters. For the fast and effective recommendation, a random forest-based index is built, and the ranking results are derived according to the matching results in all the leaf notes. Extensive experiments validated on the web dance videos demonstrate the effectiveness of the proposed methods for dance style discovery and video recommendation based on styles.

    References

    [1]
    H. B and S. B. Determining optical flow. Artificial Intelligence, 17:185--203, 1981.
    [2]
    P. Cui, Z. Wang, and Z. Su. What videos are similar with you? learning a common attributed representation for video recommendation. In MM, 2014.
    [3]
    N. Dalal, B. Triggs, and C. Schmid. Human detection using oriented histograms of flowand appearance. In ECCV, 2006.
    [4]
    T. Han, H. Yao, Y. Zhang, and P. Xu. A spatial-temporal constraint-based action recognition method. In ICIP, pages 2767--2771, 2013.
    [5]
    I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld. Learning realistic human actions from movies. In CVPR, pages 1--8, 2008.
    [6]
    T. Mei, B. Yang, X.-S. Huan, and S. Li. Contextual video recommendation by multimodal relevance and user feedback. ACM Transactions on Information Systems, 29:1--24, 2011.
    [7]
    X. Peng, Y. Qiao, and Q. Peng. Motion boundary based sampling and 3d co-occurrence descriptors for action recognition. Image and Vision Computing, 32:616--628, 2014.
    [8]
    S. Sadanand and J. Corso. Action bank: A high-level representation of activity in video. In CVPR, pages 1234--1241, 2012.
    [9]
    X. Sun, R. Ji, H. Yao, P. Xu, T. Liu, and X. Liu. Place retrieval with graph-based place-view model. In MIR, pages 268--275, 2012.
    [10]
    J. Uijlings, I. C. Duta, E. Sangineto, and N. Sebe. Video classification with densely extracted hog/hof/mbh features: an evaluation of the accuracy/computational efficiency trade-off. International Journal of Multimedia Information Retrieval, 4:33--44, 2015.
    [11]
    Wang, Heng, A. Klaser, C. Schmid, and C.-L. Liu. Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision, 103(1):60--79, 2013.
    [12]
    G. Yu, J. Yuan, and Z. Liu. Unsupervised random forest indexing for fast action search. In CVPR, pages 865--872, 2011.
    [13]
    S. X. Yu and J. Shi. Multiclass spectral clustering. In ICCV, pages 313--319, 2003.
    [14]
    X. Zhao, J. Yuan, M. Wang, G. Li, R. Hong, Z. Li, and T.-S. Chua. Video recommendation over multiple information sources. Multimedia Systems, 19(1):3--15, 2013.

    Cited By

    View all
    • (2021)Design of embedded dance teaching control system based on FPGA and motion recognition processingMicroprocessors and Microsystems10.1016/j.micpro.2021.10399083(103990)Online publication date: Jun-2021
    • (2017)Dancelets Mining for Video Recommendation Based on Dance StylesIEEE Transactions on Multimedia10.1109/TMM.2016.263188119:4(712-724)Online publication date: 1-Apr-2017

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '15: Proceedings of the 23rd ACM international conference on Multimedia
    October 2015
    1402 pages
    ISBN:9781450334594
    DOI:10.1145/2733373
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dance style
    2. dancelets mining
    3. normalized-cut
    4. random forest-based index
    5. video recommendation

    Qualifiers

    • Short-paper

    Funding Sources

    Conference

    MM '15
    Sponsor:
    MM '15: ACM Multimedia Conference
    October 26 - 30, 2015
    Brisbane, Australia

    Acceptance Rates

    MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;
    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Design of embedded dance teaching control system based on FPGA and motion recognition processingMicroprocessors and Microsystems10.1016/j.micpro.2021.10399083(103990)Online publication date: Jun-2021
    • (2017)Dancelets Mining for Video Recommendation Based on Dance StylesIEEE Transactions on Multimedia10.1109/TMM.2016.263188119:4(712-724)Online publication date: 1-Apr-2017

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media