Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/342009.335436acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article
Free access

Efficient and cost-effective techniques for browsing and indexing large video databases

Published: 16 May 2000 Publication History

Abstract

We present in this paper a fully automatic content-based approach to organizing and indexing video data. Our methodology involves three steps:
Step 1: We segment each video into shots using a Camera-Tracking technique. This process also extracts the feature vector for each shot, which consists of two statistical variances VarBA and VarOA. These values capture how much things are changing in the background and foreground areas of the video shot.
Step 2: For each video, We apply a fully automatic method to build a browsing hierarchy using the shots identified in Step 1.
Step 3: Using the VarBA and VarOA values obtained in Step 1, we build an index table to support a variance-based video similarity model. That is, video scenes/shots are retrieved based on given values of VarBA and VarOA.
The above three inter-related techniques offer an integrated framework for modeling, browsing, and searching large video databases. Our experimental results indicate that they have many advantages over existing methods.

References

[1]
A. Elmagarmid, H. Jiang, A. Helal, A. Joshi, and M. Ahmed. Video Database Systems- Issues, Products, and Applications. Kulwer Academic Publishers, 1997.
[2]
R. Lienhart. Comparison of automatic shot boundary detection algorithms. In Proc. SPIE Vol. 3656, Storage and Retrieval for Image and Video Databases VII, pages 290-301, San Jose, CA, January 1999.
[3]
M. A. Smith and M. G. Christel. Automating the creation of a digital vidoe library. In Proc. of A CM Multimedia '95, pages 357-358, 1995.
[4]
R. Lienhart, S. Pfeiffer, and W. Effelsberg. The moca workbench: Support for creativity in movie content analysis. In Proc. of the IEEE Int'l Conf. on Multimedia Systems '96, June 1996.
[5]
M. Abdel-Mottaleb and N. Dimitrova. Conivas: Content-based image and video access system. In Proc. of A CM Int'l Conf. on Multimedia, pages 427-428, Boston, MA, November 1996.
[6]
H. Yu and W. Wolf. A visual search system for video and image databases. In Proc. IEEE Int'l Conf. on Multimedia Computing and Systems, pages 517-524, Ottawa, Canada, June 1997.
[7]
R. Zabih, J. Miller, and K. Mai. A featurebased algorithm for detecting and classifying scene breaks. In Proc. of A CM Multimedia '95, pages 189-200, San Francisco, CA, 1995.
[8]
P. Aigrain, P. Joly, and V. Longueville. Medium knowledge-based macro-segmentation of video into sequences. In IJCAI Workshop on Intelligent Multimedia Information Retrieval, pages 5-14, 1995.
[9]
H. Aoki, S. Shimotsuji, and O. Hori. A shot classification method of selecting effective keyframe for video browsing. In Proc. of ACM Int'l Conf. on Multimedia, pages 1-10, Boston, MA, November 1996.
[10]
M. M. Yeung, B. Yeo, and B. Liu. Extracting story units from long programs for video browsing and navigation. In Proc. of the IEEE Int'l Conf. on Multimedia Systems '96, pages 296-304, Hiroshima, Japan, June 1996.
[11]
D. Zhong, H. Zhang, and S-F Chang. Clustering methods for video browsing and annotation. Technical report, Columbia University, 1997.
[12]
R. Hjelsvold and R. Midtstraum. Modeling and querying video data. In Proc. of 20th Int'l Conf. on Very Large Database (VLDB '9#), 1994.
[13]
G. Davenport, T. Smith, and N. Pincever. Cinematic primitives for multimedia. In Proc. IEEE Computer Graphics # Applications, pages 67-74, July 1991.
[14]
R. Hamakawa and J. Rekimoto. Object composition and playback models for handling multimedia data. In Proc. of ACM Multimedia, pages 273-281, Anaheim, CA, August 1993.
[15]
R. Weiss, A. Duda, and D. Gifford. Contentbased access to algebraic video. In Proc. of IEEE Int'l Conf. Multimedia Computing and Systems, Los Alamitos, CA, 1994.
[16]
J. M. Corridoni, A. D. Bimbo, D. Lucarella, and H. Wenxue. Multi-perspective navigation of movies. Journal of Visual Languages and Computing, 7:445-466, July 1996.
[17]
H. Jiang and A. K. Elmagarmid. Wvtdb - a semantic content-based video database system on the world wide web. IEEE Transactions on Knowledge and Data Engineering, 10(6):947-966, 1998.
[18]
H. J. Zhang, S. W. Smoliar, and J. Wu. Contentbased video browsing tools. In Proc. of IS#T/SPIE Con. on Multimedia Computing and Networking, 1995.
[19]
D. Swanberg, C. F. Shu, and R. Jain. Knowledge guided parsing in video databases. In Proc. of SPIE Symposium on Electronic Imaging: Science and Technology, pages 13-24, February 1993.
[20]
H. Zhang and S. W. Smoliar. Developing power tools for video indexing and retrieval. In Proc. of SPIE Storage and Retrieval for Image and Video Database, San Jose, CA, Jan. 1994.
[21]
Y. Gong, H. Chua, and X. Guo. Image indexing and retrieval based on color histogram. In Proc. of Int'l Conf. Multimedia Modeling, pages 115-126, Singapore, Nov. 1995.
[22]
Y. Rui, T. S. Huang, and S. Mehratra. Constructing table-of-cont for videos. A CM Multimedia Systems, 7(5):359-368, 1999.
[23]
JungHwan Oh, Kien A. Hua, and Ning Liang. A content-based scene change detection and classification technique using background tracking. In SPIE Conf. on Multimedia Computing and Networking 2000, San Jose, CA, Jan. 2000.
[24]
P. J. Burt and E. H. Adelson. The laplacian pyramid as a compact image code. In IEEE Transactions on Communications V COM-31, pages 532- 540, April 1983.
[25]
Kien A. Hua, W. Tavanapong, and J. Wang. 2psm: An efficient framework for searching video information in a limited-bandwidth environment. ACM Multimedia Systems, 7(5):396-408, September 1999.
[26]
B. Taves, J. Hoffman, and K. Lund. The moving image genre-form guide. In Motion Picture/Broadcasting/Recoreded Sound Division Library of Congress, 1997.
[27]
W. B. Frakes and R. Baeza-Yates. Information Retrieval- Data Structures and Algorithms. Prentice Hall, Englewood Cliffs, 1992.
[28]
A. Hampapur, R. Jain, and T. Weymouth. Digital video segmentation. In Proc. of A CM Multimedia, pages 357-364, October 1994.
[29]
S. Chang, W. Chen, H. J. Meng, H. Sundaram, and D. Zhong. Videoq: An automated content based video search system using visual cues. In A CM Proc. of the conf. on Mutimedia '97, pages 313- 324, Seattle Washington, November 1997.
[30]
Y. Rui, T. S. Huang, and S. Mehratra. Exploring video structure beyond the shots. In Proc. of 98 IEEE Conf. on Multimedia Computing and Systems, pages 237-240, Austin Texas, June 1998.
[31]
H. D. Wactlar, M. G. Christel, Y. Gong, and A. G. Hauptmann. Lessons learned from building terabyte digital video library. Computer, pages 66- 73, February 1999.

Cited By

View all

Index Terms

  1. Efficient and cost-effective techniques for browsing and indexing large video databases

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SIGMOD '00: Proceedings of the 2000 ACM SIGMOD international conference on Management of data
        May 2000
        604 pages
        ISBN:1581132174
        DOI:10.1145/342009
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 16 May 2000

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. shot detection
        2. video browsing
        3. video indexing
        4. video retrieval
        5. video similarity model

        Qualifiers

        • Article

        Conference

        SIGMOD/PODS00
        Sponsor:

        Acceptance Rates

        SIGMOD '00 Paper Acceptance Rate 42 of 248 submissions, 17%;
        Overall Acceptance Rate 785 of 4,003 submissions, 20%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)323
        • Downloads (Last 6 weeks)7
        Reflects downloads up to 12 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2021)Top-K Deep Video AnalyticsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452786(1037-1050)Online publication date: 9-Jun-2021
        • (2019)VStoreProceedings of the Fourteenth EuroSys Conference 201910.1145/3302424.3303971(1-17)Online publication date: 25-Mar-2019
        • (2015)Shot-HRProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695841(1257-1262)Online publication date: 13-Apr-2015
        • (2014)KS-SIFTProceedings of the 2014 IEEE International Symposium on Multimedia10.1109/ISM.2014.52(13-17)Online publication date: 10-Dec-2014
        • (2013)Video shot representation based on histogramsProceedings of the 28th Annual ACM Symposium on Applied Computing10.1145/2480362.2480547(961-966)Online publication date: 18-Mar-2013
        • (2012)Real-Time Query Processing on Live Videos in Networks of Distributed CamerasResearch, Practice, and Educational Advancements in Telecommunications and Networking10.4018/978-1-4666-0050-8.ch003(27-48)Online publication date: 2012
        • (2011)Adaptive clustering and interactive visualizations to support the selection of video clipsProceedings of the 1st ACM International Conference on Multimedia Retrieval10.1145/1991996.1992030(1-8)Online publication date: 18-Apr-2011
        • (2010)Real-Time Query Processing on Live Videos in Networks of Distributed CamerasInternational Journal of Interdisciplinary Telecommunications and Networking10.4018/jitn.20100101032:1(27-48)Online publication date: 1-Jan-2010
        • (2009)Large scale incremental web video categorizationProceedings of the 1st workshop on Web-scale multimedia corpus10.1145/1631135.1631142(33-40)Online publication date: 23-Oct-2009
        • (2009)Hierarchical Video Data Modeling and Indexing for Virtual Scene ConstructionProceedings of the 2009 Second International Conference on Machine Vision10.1109/ICMV.2009.15(69-73)Online publication date: 28-Dec-2009
        • Show More Cited By

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media