Dance is a special and important type of action, composed of abundant and various action elements. However, the recommendation of dance videos on the web are still not well studied. It is hard to realize it in the way of traditional methods using associated texts or static features of video content. In this paper, we study the problem focusing on extraction and representation of action information in dances. We propose to recommend dance videos based on the automatically discovered ``Dance Styles'', which play a significant role in characterizing different types of dances. To bridge the semantic gap of video content and mid-level concept, style, we take advantage of a mid-level action representation method, and extract representative patches as ``Dancelets'', a sort of intermediation between videos and the concepts. Furthermore, we propose to employ Motion Boundaries as saliency priors and sparsely extract patches containing more representative information to generate a set of dancelet candidates. Dancelets are then discovered by Normalized-cut method, which is superior in grouping visually similar patterns into the same clusters. For the fast and effective recommendation, a random forest-based index is built, and the ranking results are derived according to the matching results in all the leaf notes. Extensive experiments validated on the web dance videos demonstrate the effectiveness of the proposed methods for dance style discovery and video recommendation based on styles.

References

[1]

H. B and S. B. Determining optical flow. Artificial Intelligence, 17:185--203, 1981.

Digital Library

Google Scholar

[2]

P. Cui, Z. Wang, and Z. Su. What videos are similar with you? learning a common attributed representation for video recommendation. In MM, 2014.

Digital Library

Google Scholar

[3]

N. Dalal, B. Triggs, and C. Schmid. Human detection using oriented histograms of flowand appearance. In ECCV, 2006.

Digital Library

Google Scholar

[4]

T. Han, H. Yao, Y. Zhang, and P. Xu. A spatial-temporal constraint-based action recognition method. In ICIP, pages 2767--2771, 2013.

Crossref

Google Scholar

[5]

I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld. Learning realistic human actions from movies. In CVPR, pages 1--8, 2008.

Crossref

Google Scholar

[6]

T. Mei, B. Yang, X.-S. Huan, and S. Li. Contextual video recommendation by multimodal relevance and user feedback. ACM Transactions on Information Systems, 29:1--24, 2011.

Digital Library

Google Scholar

[7]

X. Peng, Y. Qiao, and Q. Peng. Motion boundary based sampling and 3d co-occurrence descriptors for action recognition. Image and Vision Computing, 32:616--628, 2014.

Crossref

Google Scholar

[8]

S. Sadanand and J. Corso. Action bank: A high-level representation of activity in video. In CVPR, pages 1234--1241, 2012.

Digital Library

Google Scholar

[9]

X. Sun, R. Ji, H. Yao, P. Xu, T. Liu, and X. Liu. Place retrieval with graph-based place-view model. In MIR, pages 268--275, 2012.

Digital Library

Google Scholar

[10]

J. Uijlings, I. C. Duta, E. Sangineto, and N. Sebe. Video classification with densely extracted hog/hof/mbh features: an evaluation of the accuracy/computational efficiency trade-off. International Journal of Multimedia Information Retrieval, 4:33--44, 2015.

Crossref

Google Scholar

[11]

Wang, Heng, A. Klaser, C. Schmid, and C.-L. Liu. Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision, 103(1):60--79, 2013.

Crossref

Google Scholar

[12]

G. Yu, J. Yuan, and Z. Liu. Unsupervised random forest indexing for fast action search. In CVPR, pages 865--872, 2011.

Digital Library

Google Scholar

[13]

S. X. Yu and J. Shi. Multiclass spectral clustering. In ICCV, pages 313--319, 2003.

Digital Library

Google Scholar

[14]

X. Zhao, J. Yuan, M. Wang, G. Li, R. Hong, Z. Li, and T.-S. Chua. Video recommendation over multiple information sources. Multimedia Systems, 19(1):3--15, 2013.

Digital Library

Google Scholar

Cited By

View all

Wu H(2021)Design of embedded dance teaching control system based on FPGA and motion recognition processingMicroprocessors and Microsystems10.1016/j.micpro.2021.10399083(103990)Online publication date: Jun-2021
https://doi.org/10.1016/j.micpro.2021.103990
Han TYao HXu CSun XZhang YCorso J(2017)Dancelets Mining for Video Recommendation Based on Dance StylesIEEE Transactions on Multimedia10.1109/TMM.2016.263188119:4(712-724)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1109/TMM.2016.2631881

Index Terms

"Clustering of Dancelets": Towards Video Recommendation Based on Dance Styles
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
  2. Computer graphics
    1. Image manipulation
2. Information systems
  1. Information retrieval

Recommendations

Beyond Labels: Leveraging Deep Learning and LLMs for Content Metadata
RecSys '23: Proceedings of the 17th ACM Conference on Recommender Systems

Content metadata plays a very important role in movie recommender systems as it provides valuable information about various aspects of a movie such as genre, cast, plot synopsis, box office summary, etc. Analyzing the metadata can help understand the ...
Read More
Video recommendation based on multi-modal information and multiple kernel

Collaborative Filter (CF) algorithms often suffer from data sparsity and item cold start problem, for the user-item matrix is insufficient and extremely sparse especially when new item is added to recommendation system. These two problems also exist in ...
Read More
Real-time Video Recommendation Exploration
SIGMOD '16: Proceedings of the 2016 International Conference on Management of Data

Video recommendation has attracted growing attention in recent years. However, conventional techniques have limitations in real-time processing, accuracy or scalability for the large-scale video data. To address the deficiencies of current ...
Read More

Comments

Information & Contributors

Information

Published In

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

October 2015

1402 pages

ISBN:9781450334594

DOI:10.1145/2733373

General Chairs:
Xiaofang Zhou
The University of Queensland, Australia
,
Alan F. Smeaton
Dublin City University, Ireland
,
Qi Tian
The University of Texas at San Antonio, USA
,
Program Chairs:
Dick C.A. Bulterman
FXPAL, USA
,
Heng Tao Shen
The University of Queensland, Australia
,
Ketan Mayer-Patel
The University of North Carolina, USA
,
Shuicheng Yan
National University of Singapore, Singapore

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 October 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

National Natural Science Foundation of China
National Natural Science Foundation of China Key Program

Conference

MM '15

Sponsor:

SIGMM

MM '15: ACM Multimedia Conference

October 26 - 30, 2015

Brisbane, Australia

Acceptance Rates

MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
197
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Other Metrics

View Author Metrics

Citations

Cited By

View all

Wu H(2021)Design of embedded dance teaching control system based on FPGA and motion recognition processingMicroprocessors and Microsystems10.1016/j.micpro.2021.10399083(103990)Online publication date: Jun-2021
https://doi.org/10.1016/j.micpro.2021.103990
Han TYao HXu CSun XZhang YCorso J(2017)Dancelets Mining for Video Recommendation Based on Dance StylesIEEE Transactions on Multimedia10.1109/TMM.2016.263188119:4(712-724)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1109/TMM.2016.2631881

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Beyond Labels: Leveraging Deep Learning and LLMs for Content Metadata

Video recommendation based on multi-modal information and multiple kernel

Real-time Video Recommendation Exploration