research-article

Scalable Visual Instance Mining with Threads of Features

Authors:

Shih-Fu ChangAuthors Info & Claims

MM '14: Proceedings of the 22nd ACM international conference on Multimedia

Pages 297 - 306

https://doi.org/10.1145/2647868.2654942

Published: 03 November 2014 Publication History

Abstract

We address the problem of visual instance mining, which is to extract frequently appearing visual instances automatically from a multimedia collection. We propose a scalable mining method by exploiting Thread of Features (ToF). Specifically, ToF, a compact representation that links consistent features across images, is extracted to reduce noises, discover patterns, and speed up processing. Various instances, especially small ones, can be discovered by exploiting correlated ToFs. Our approach is significantly more effective than other methods in mining small instances. At the same time, it is also more efficient by requiring much fewer hash tables. We compared with several state-of-the-art methods on two fully annotated datasets: MQA and Oxford, showing large performance gain in mining (especially small) visual instances. We also run our method on another Flickr dataset with one million images for scalability test. Two applications, instance search and multimedia summarization, are developed from the novel perspective of instance mining, showing great potential of our method in multimedia analysis.

References

[1]

R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In Proc. VLDB, pages 487--499, 1994.

Digital Library

[2]

D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, pages 993--1022, 2003.

Digital Library

[3]

A. Broder. On the resemblance and containment of documents. In SEQUENCES, 1997.

Digital Library

[4]

O. Chum and J. Matas. Large-scale discovery of spatially related images. IEEE Trans. PAMI, 32:371--377, 2010.

Digital Library

[5]

O. Chum, M. Perdoch, and J. Matas. Geometric min-hashing: Finding a (thick) needle in a haystack. Proc. CVPR, pages 17--24, 2009.

[6]

O. Chum, J. Philbin, M. Isard, and A. Zisserman. Scalable near identical image and shot detection. In Proc. CIVR, 2007.

Digital Library

[7]

J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In Proc. SIGMOD, pages 1--12, 2000.

Digital Library

[8]

T. Hofmann. Probabilistic latent semantic indexing. Proc. SIGIR, pages 50--57, 1999.

Digital Library

[9]

H. Jégou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, 87(3):192--212, May 2010.

Digital Library

[10]

P. Letessier, O. Buisson, and A. Joly. Scalable mining of small visual objects. In ACM Multimedia, 2012.

Digital Library

[11]

H. Liu and S. Yan. Common visual pattern discovery via spatially coherent correspondences. In Proc. CVPR, pages 1609--1616, 2010.

[12]

D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In Proc. CVPR, pages 2161--2168, 2006.

Digital Library

[13]

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In Proc. CVPR, 2007.

[14]

J. Philbin, J. Sivic, and A. Zisserman. Geometric LDA: A generative model for particular object discovery. In Proc. BMVC, 2008.

[15]

J. Philbin and A. Zisserman. Object mining using a matching graph on very large image collections. In Proc. ICVGIP, pages 738--745, 2008.

Digital Library

[16]

G. F. Pineda, H. Koga, and T. Watanabe. Scalable object discovery: A hash-based approach to clustering co-occurring visual words. IEICE Trans. on Information and Systems, pages 2024--2035, 2011.

[17]

T. Quack, V. Ferrari, and L. V. Gool. Video mining with frequent itemset configurations. In Proc. CIVR, pages 360--369, 2006.

Digital Library

[18]

B. C. Russell, W. T. Freeman, A. A. Efros, J. Sivic, and A. Zisserman. Using multiple segmentations to discover objects and their extent in image collections. In Proc. CVPR, pages 1605--1614, 2006.

Digital Library

[19]

J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proc. ICCV, 2003.

Digital Library

[20]

J. Sivic and A. Zisserman. Video data mining using configurations of viewpoint invariant regions. In Proc. CVPR, 2004.

[21]

A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In Proc. MIR, 2006.

Digital Library

[22]

H.-K. Tan and C.-W. Ngo. Localized matching using earth mover's distance towards discovery of common patterns from small image samples. Image Vision Computing, 27(10):1470--1483, 2009.

Digital Library

[23]

G. Xin, L. Dong, J. Brendan, Z. Mojun, C. Anni, and S.-F. Chang. Robust object co-detection. In Proc. CVPR, 2013.

Digital Library

[24]

J. Yuan and Y. Wu. Spatial random partition for common visual pattern discovery. In Proc. ICCV, 2007.

[25]

M. J. Zaki. Scalable algorithms for association mining. IEEE Trans. on KDE, pages 372--390, 2000.

Digital Library

[26]

W. Zhang and C.-W. Ngo. Searching visual instances with topology checking and context modeling. In Proc. ICMR, 2012.

Digital Library

[27]

W. Zhang, L. Pang, and C. W. Ngo. Snap-and-ask: Answering multimodal question by naming visual instance. In ACM Multimedia, 2012.

Digital Library

[28]

C. Zhu and S. Satoh. Large vocabulary quantization for searching instances from videos. In Proc. ICMR, 2012.

Digital Library

Cited By

Zhu WWang XLi H(2020)Multi-Modal Deep Analysis for MultimediaIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2019.294064730:10(3740-3764)Online publication date: Oct-2020
https://doi.org/10.1109/TCSVT.2019.2940647
Wang ZYang FSatoh S(2019)Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV SeriesProceedings of the ACM Multimedia Asia10.1145/3338533.3366594(1-6)Online publication date: 15-Dec-2019
https://dl.acm.org/doi/10.1145/3338533.3366594
Chen ZZhang WDeng BXie HGu X(2019)Name-face association with web facial image supervisionMultimedia Systems10.1007/s00530-017-0544-y25:1(1-20)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1007/s00530-017-0544-y
Show More Cited By

Index Terms

Scalable Visual Instance Mining with Threads of Features
1. Information systems
  1. Information retrieval

Recommendations

Infrequent pattern mining in smart healthcare environment using data summarization

A summarization technique creates a concise version of large amount of data (big data!) which reduces the computational cost of analysis and decision-making. There are interesting data patterns, such as rare anomalies, which are more infrequent in ...
Mining summarization of high utility itemsets

Mining interesting itemsets from transaction databases has attracted a lot of research interests for decades. In recent years, high utility itemset (HUI) has emerged as a hot topic in this field. In real applications, the bottleneck of HUI mining is not ...
Supporting efficient and scalable frequent pattern mining

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '14: Proceedings of the 22nd ACM international conference on Multimedia

November 2014

1310 pages

ISBN:9781450330633

DOI:10.1145/2647868

General Chairs:
Kien A. Hua
University of Central Florida, USA
,
Yong Rui
Microsoft Research, China
,
Ralf Steinmetz
Technische Universitt Darmstadt, Germany
,
Program Chairs:
Alan Hanjalic
Delft University of Technology, Netherlands
,
Apostol (Paul) Natsev
Google, USA
,
Wenwu Zhu
Tsinghua University, China

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Research Grants Council, University Grants Committee, Hong Kong

Conference

MM '14

Sponsor:

SIGMM

MM '14: 2014 ACM Multimedia Conference

November 3 - 7, 2014

Florida, Orlando, USA

Acceptance Rates

MM '14 Paper Acceptance Rate 55 of 286 submissions, 19%;

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
255
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 10 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zhu WWang XLi H(2020)Multi-Modal Deep Analysis for MultimediaIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2019.294064730:10(3740-3764)Online publication date: Oct-2020
https://doi.org/10.1109/TCSVT.2019.2940647
Wang ZYang FSatoh S(2019)Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV SeriesProceedings of the ACM Multimedia Asia10.1145/3338533.3366594(1-6)Online publication date: 15-Dec-2019
https://dl.acm.org/doi/10.1145/3338533.3366594
Chen ZZhang WDeng BXie HGu X(2019)Name-face association with web facial image supervisionMultimedia Systems10.1007/s00530-017-0544-y25:1(1-20)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1007/s00530-017-0544-y
Li HEllis JZhang LChang SAizawa KLew MSatoh S(2018)PatternNetProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206039(291-299)Online publication date: 5-Jun-2018
https://dl.acm.org/doi/10.1145/3206025.3206039
Li HEllis JZhang LChang S(2018)Automatic visual pattern mining from categorical image datasetInternational Journal of Multimedia Information Retrieval10.1007/s13735-018-0163-18:1(35-45)Online publication date: 19-Dec-2018
https://doi.org/10.1007/s13735-018-0163-1
Li WLi JWang CZhang LZhang B(2018)Visual instance mining from the graph perspectiveMultimedia Systems10.1007/s00530-016-0533-624:2(147-162)Online publication date: 1-Mar-2018
https://dl.acm.org/doi/10.1007/s00530-016-0533-6
Li WLi JZhang B(2018)Saliency-GD: A TF-IDF Analogy for Landmark Image MiningAdvances in Multimedia Information Processing – PCM 201710.1007/978-3-319-77380-3_45(477-486)Online publication date: 10-May-2018
https://doi.org/10.1007/978-3-319-77380-3_45
Nguyen NNguyen KVan CLe D(2017)Graph-based visual instance mining with geometric matching and nearest candidates selection2017 9th International Conference on Knowledge and Systems Engineering (KSE)10.1109/KSE.2017.8119469(263-268)Online publication date: Oct-2017
https://doi.org/10.1109/KSE.2017.8119469
Zhang WCao XWang RGuo YChen Z(2017)Binarized Mode Seeking for Scalable Visual Pattern Discovery2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR.2017.722(6827-6835)Online publication date: Jul-2017
https://doi.org/10.1109/CVPR.2017.722
Li HEllis JJi HChang SHanjalic ASnoek CWorring MBulterman DHuet BKelliher AKompatsiaris YLi J(2016)Event Specific Multimodal Pattern Mining for Knowledge Base ConstructionProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2964287(821-830)Online publication date: 1-Oct-2016
https://dl.acm.org/doi/10.1145/2964284.2964287
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents