Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2647868.2654942acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Scalable Visual Instance Mining with Threads of Features

Published: 03 November 2014 Publication History
  • Get Citation Alerts
  • Abstract

    We address the problem of visual instance mining, which is to extract frequently appearing visual instances automatically from a multimedia collection. We propose a scalable mining method by exploiting Thread of Features (ToF). Specifically, ToF, a compact representation that links consistent features across images, is extracted to reduce noises, discover patterns, and speed up processing. Various instances, especially small ones, can be discovered by exploiting correlated ToFs. Our approach is significantly more effective than other methods in mining small instances. At the same time, it is also more efficient by requiring much fewer hash tables. We compared with several state-of-the-art methods on two fully annotated datasets: MQA and Oxford, showing large performance gain in mining (especially small) visual instances. We also run our method on another Flickr dataset with one million images for scalability test. Two applications, instance search and multimedia summarization, are developed from the novel perspective of instance mining, showing great potential of our method in multimedia analysis.

    References

    [1]
    R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In Proc. VLDB, pages 487--499, 1994.
    [2]
    D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, pages 993--1022, 2003.
    [3]
    A. Broder. On the resemblance and containment of documents. In SEQUENCES, 1997.
    [4]
    O. Chum and J. Matas. Large-scale discovery of spatially related images. IEEE Trans. PAMI, 32:371--377, 2010.
    [5]
    O. Chum, M. Perdoch, and J. Matas. Geometric min-hashing: Finding a (thick) needle in a haystack. Proc. CVPR, pages 17--24, 2009.
    [6]
    O. Chum, J. Philbin, M. Isard, and A. Zisserman. Scalable near identical image and shot detection. In Proc. CIVR, 2007.
    [7]
    J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In Proc. SIGMOD, pages 1--12, 2000.
    [8]
    T. Hofmann. Probabilistic latent semantic indexing. Proc. SIGIR, pages 50--57, 1999.
    [9]
    H. Jégou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, 87(3):192--212, May 2010.
    [10]
    P. Letessier, O. Buisson, and A. Joly. Scalable mining of small visual objects. In ACM Multimedia, 2012.
    [11]
    H. Liu and S. Yan. Common visual pattern discovery via spatially coherent correspondences. In Proc. CVPR, pages 1609--1616, 2010.
    [12]
    D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In Proc. CVPR, pages 2161--2168, 2006.
    [13]
    J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In Proc. CVPR, 2007.
    [14]
    J. Philbin, J. Sivic, and A. Zisserman. Geometric LDA: A generative model for particular object discovery. In Proc. BMVC, 2008.
    [15]
    J. Philbin and A. Zisserman. Object mining using a matching graph on very large image collections. In Proc. ICVGIP, pages 738--745, 2008.
    [16]
    G. F. Pineda, H. Koga, and T. Watanabe. Scalable object discovery: A hash-based approach to clustering co-occurring visual words. IEICE Trans. on Information and Systems, pages 2024--2035, 2011.
    [17]
    T. Quack, V. Ferrari, and L. V. Gool. Video mining with frequent itemset configurations. In Proc. CIVR, pages 360--369, 2006.
    [18]
    B. C. Russell, W. T. Freeman, A. A. Efros, J. Sivic, and A. Zisserman. Using multiple segmentations to discover objects and their extent in image collections. In Proc. CVPR, pages 1605--1614, 2006.
    [19]
    J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proc. ICCV, 2003.
    [20]
    J. Sivic and A. Zisserman. Video data mining using configurations of viewpoint invariant regions. In Proc. CVPR, 2004.
    [21]
    A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In Proc. MIR, 2006.
    [22]
    H.-K. Tan and C.-W. Ngo. Localized matching using earth mover's distance towards discovery of common patterns from small image samples. Image Vision Computing, 27(10):1470--1483, 2009.
    [23]
    G. Xin, L. Dong, J. Brendan, Z. Mojun, C. Anni, and S.-F. Chang. Robust object co-detection. In Proc. CVPR, 2013.
    [24]
    J. Yuan and Y. Wu. Spatial random partition for common visual pattern discovery. In Proc. ICCV, 2007.
    [25]
    M. J. Zaki. Scalable algorithms for association mining. IEEE Trans. on KDE, pages 372--390, 2000.
    [26]
    W. Zhang and C.-W. Ngo. Searching visual instances with topology checking and context modeling. In Proc. ICMR, 2012.
    [27]
    W. Zhang, L. Pang, and C. W. Ngo. Snap-and-ask: Answering multimodal question by naming visual instance. In ACM Multimedia, 2012.
    [28]
    C. Zhu and S. Satoh. Large vocabulary quantization for searching instances from videos. In Proc. ICMR, 2012.

    Cited By

    View all

    Index Terms

    1. Scalable Visual Instance Mining with Threads of Features

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '14: Proceedings of the 22nd ACM international conference on Multimedia
      November 2014
      1310 pages
      ISBN:9781450330633
      DOI:10.1145/2647868
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 03 November 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. clustering
      2. instance mining
      3. min-hash
      4. summarization
      5. thread of features

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      MM '14
      Sponsor:
      MM '14: 2014 ACM Multimedia Conference
      November 3 - 7, 2014
      Florida, Orlando, USA

      Acceptance Rates

      MM '14 Paper Acceptance Rate 55 of 286 submissions, 19%;
      Overall Acceptance Rate 995 of 4,171 submissions, 24%

      Upcoming Conference

      MM '24
      The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2020)Multi-Modal Deep Analysis for MultimediaIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2019.294064730:10(3740-3764)Online publication date: Oct-2020
      • (2019)Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV SeriesProceedings of the ACM Multimedia Asia10.1145/3338533.3366594(1-6)Online publication date: 15-Dec-2019
      • (2019)Name-face association with web facial image supervisionMultimedia Systems10.1007/s00530-017-0544-y25:1(1-20)Online publication date: 1-Feb-2019
      • (2018)PatternNetProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206039(291-299)Online publication date: 5-Jun-2018
      • (2018)Automatic visual pattern mining from categorical image datasetInternational Journal of Multimedia Information Retrieval10.1007/s13735-018-0163-18:1(35-45)Online publication date: 19-Dec-2018
      • (2018)Visual instance mining from the graph perspectiveMultimedia Systems10.1007/s00530-016-0533-624:2(147-162)Online publication date: 1-Mar-2018
      • (2018)Saliency-GD: A TF-IDF Analogy for Landmark Image MiningAdvances in Multimedia Information Processing – PCM 201710.1007/978-3-319-77380-3_45(477-486)Online publication date: 10-May-2018
      • (2017)Graph-based visual instance mining with geometric matching and nearest candidates selection2017 9th International Conference on Knowledge and Systems Engineering (KSE)10.1109/KSE.2017.8119469(263-268)Online publication date: Oct-2017
      • (2017)Binarized Mode Seeking for Scalable Visual Pattern Discovery2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR.2017.722(6827-6835)Online publication date: Jul-2017
      • (2016)Event Specific Multimodal Pattern Mining for Knowledge Base ConstructionProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2964287(821-830)Online publication date: 1-Oct-2016
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media