Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2324796.2324829acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Making a scene: alignment of complete sets of clips based on pairwise audio match

Published: 05 June 2012 Publication History

Abstract

As the amount of social video content captured at physical-world events, and shared online, is rapidly increasing, there is a growing need for robust methods for organization and presentation of the captured content. In this work, we significantly extend prior work that examined automatic detection of videos from events that were captured at the same time, i.e. "overlapping". We go beyond finding pairwise matches between video clips and describe the construction of scenes, or sets of multiple overlapping videos, each scene presenting a coherent moment in the event. We test multiple strategies for scene construction, using a greedy algorithm to create a mapping of videos into scenes, and a clustering refinement step to increase the precision of each scene. We evaluate the strategies in multiple settings and show that a greedy and clustering approach results in best possible balance between recall and precision for all settings.

References

[1]
E. Amigó, J. Gonzalo, J. Artiles, and F. Verdejo. A comparison of extrinsic clustering evaluation metrics based on formal constraints. Information Retrieval, 2008.
[2]
L. Ballan, G. J. Brostow, J. Puwein, and M. Pollefeys. Unstructured video-based rendering: interactive exploration of casually captured videos. ACM Trans. Graph., 29:87:1--87:11, July 2010.
[3]
H. Becker, D. Iter, M. Naaman, and L. Gravano. Identifying content for planned events across social media sites. In Proceedings of the fourth ACM international conference on Web search and data mining, WSDM '12, New York, NY, USA, 2011. ACM.
[4]
A. Broder, S. Glassman, M. Manasse, and G. Zweig. Syntactic clustering of the web. Computer Networks and ISDN Systems, 29(8-13):1157--1166, 1997.
[5]
D. L. Davies and D. W. Bouldin. A Cluster Separation Measure. In IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 224--227, April 1979.
[6]
D. Ellis. Robust Landmark-Based Audio Fingerprinting, web resource, http://labrosa.ee.Columbia.edu/matlab/fingerprint/, 2009.
[7]
J. Foo, J. Zobel, and R. Sinha. Clustering near-duplicate images in large collections. In Proceedings of the international workshop on Workshop on multimedia information retrieval, pages 21--30. ACM, 2007.
[8]
L. Kennedy and M. Naaman. Less talk, more rock: automated organization of community-contributed collections of concert videos. In WWW '09: Proceeding of the 18th international conference on World Wide Web, pages 311--320, New York, NY, USA, 2009. ACM.
[9]
X. Liu, R. Troncy, and B. Huet. Finding media illustrating events. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval, ICMR '11, pages 58:1--58:8, New York, NY, USA, 2011. ACM.
[10]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge Univ. Press, 2008.
[11]
A. Y. Ng, M. I. Jordan, and Y. Weiss. On spectral clustering: Analysis and an algorithm. In Advances In Neural Information Processing Systems, pages 849--856. MIT Press, 2001.
[12]
P. Shrestha, M. Barbieri, and H. Weda. Synchronization of multi-camera video recordings based on audio. In Proceedings of the 15th international conference on Multimedia, MM '07, pages 545--548. ACM Press, 2007.
[13]
P. Shrestha, P. H. de With, H. Weda, M. Barbieri, and E. H. Aarts. Automatic mashup generation from multiple-camera concert recordings. In Proceedings of the international conference on Multimedia, MM '10, pages 541--550, New York, NY, USA, 2010. ACM.
[14]
C. G. Snoek, B. Freiburg, J. Oomen, and R. Ordelman. Crowdsourcing rock n' roll multimedia retrieval. In Proceedings of the international conference on Multimedia, MM '10, pages 1535--1538, New York, NY, USA, 2010. ACM.
[15]
A. Strehl, J. Ghosh, and C. Cardie. Cluster ensembles-A knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research, 3:583--617, 2002.
[16]
A. Wang. An Industrial Strength Audio Search Algorithm. In Proceedings of the International Conference on Music Information Retrieval, 2003.
[17]
V. Zsombori, M. Frantzis, R. L. Guimaraes, M. F. Ursu, P. Cesar, I. Kegel, R. Craigie, and D. C. Bulterman. Automatic generation of video narratives from shared ugc. In Proceedings of the 22nd ACM conference on Hypertext and hypermedia, HT '11, pages 325--334, New York, NY, USA, 2011. ACM.

Cited By

View all
  • (2022)ARION: A Digital eLearning Educational Tool Library for Synchronization Composition & Orchestration of Learning Session DataApplied Sciences10.3390/app1217872212:17(8722)Online publication date: 31-Aug-2022
  • (2018)Automated Video Mashups: Research and ChallengesMediaSync10.1007/978-3-319-65840-7_6(167-190)Online publication date: 27-Mar-2018
  • (2017)Synchronization for multi-perspective videos in the wild2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2017.7952425(1592-1596)Online publication date: 5-Mar-2017
  • Show More Cited By

Index Terms

  1. Making a scene: alignment of complete sets of clips based on pairwise audio match

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '12: Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
    June 2012
    489 pages
    ISBN:9781450313292
    DOI:10.1145/2324796
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 June 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. audio fingerprinting
    2. social media
    3. synchronization
    4. video

    Qualifiers

    • Research-article

    Conference

    ICMR '12
    Sponsor:

    Acceptance Rates

    ICMR '12 Paper Acceptance Rate 50 of 145 submissions, 34%;
    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)ARION: A Digital eLearning Educational Tool Library for Synchronization Composition & Orchestration of Learning Session DataApplied Sciences10.3390/app1217872212:17(8722)Online publication date: 31-Aug-2022
    • (2018)Automated Video Mashups: Research and ChallengesMediaSync10.1007/978-3-319-65840-7_6(167-190)Online publication date: 27-Mar-2018
    • (2017)Synchronization for multi-perspective videos in the wild2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2017.7952425(1592-1596)Online publication date: 5-Mar-2017
    • (2016)Robust and efficient multiple alignment of unsynchronized meeting recordingsIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2016.252678724:5(833-845)Online publication date: 1-May-2016
    • (2015)Syncing Shared Multimedia through Audiovisual Bimodal SegmentationIEEE MultiMedia10.1109/MMUL.2015.3322:3(26-42)Online publication date: 1-Jul-2015
    • (2013)Socially-aware multimedia authoringACM Transactions on Multimedia Computing, Communications, and Applications10.1145/24918939:1s(1-23)Online publication date: 17-Oct-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media