Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1646396.1646410acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
keynote

Mining from large image sets

Published: 08 July 2009 Publication History

Abstract

So far, most image mining was based on interactive querying. Although such querying will remain important in the future, several applications need image mining at such wide scales that it has to run automatically. This adds an additional level to the problem, namely to apply appropriate further processing to different types of images, and to decide on such processing automatically as well. This paper touches on those issues in that we discuss the processing of landmark images and of images coming from webcams. The first part deals with the automated collection of images of landmarks, which are then also automatically annotated and enriched with Wikipedia information. The target application is that users photograph landmarks with their mobile phones or PDAs, and automatically get information about them. Similarly, users can get images in their photo albums annotated automatically. The object of interest can also be automatically delineated in the images. The pipeline we propose actually retrieves more images than manual keyword input would produce. The second part of the paper deals with an entirely different source of image data, but one that also produces massive amounts (although typically not archived): webcams. They produce images at a single location, but rather continuously and over extended periods of time. We propose an approach to summarize data coming from webcams. This data handling is quite different from that applied to the landmark images.

References

[1]
Time Square Webcam. http://images.earthcam.com/ec_metros/ourcams/mpstudiosouth.jpg, 2008/04/01--2008/05/20.
[2]
A. Adam, E. Rivlin, I. Shimshoni, and D. Reinitz. Robust real-time unusual event detection using multiple fixed-location monitors. Trans. PAMI, 30(3): 555--560, 2008.
[3]
R. Agrawal, T. Imielinski, and A. N. Swami. Mining association rules between sets of items in large databases. In SIGMOD, 1993.
[4]
K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft. When is "nearest neighbor" meaningful. In Int. Conf. on Database Theory, 1999.
[5]
M. Breitenstein, H. Grabner, and L. V. Gool. Hunting nessie -- real-time abnormality detection from webcams. Technical report, Computer Vision Laboratory, ETH Zürich, 2009.
[6]
N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005.
[7]
S. Gammeter, L. Bossard, T. Quack, and L. Van Gool. I know what you did last summer: object level auto-annotation of holiday snaps. In ICCV, 2009.
[8]
M. Goesele, N. Snavely, B. Curless, H. Hoppe, and S. M. Seitz. Multi-view stereo for community photo collections. In ICCV, pages 265--270, Rio de Janeiro, Brazil, 2007. IEEE.
[9]
I. D. Guedalia, M. London, and M. Werman. An on-line agglomerative clustering method for nonstationary data. Neural Computation, 11(2): 521--540, 1999.
[10]
W. Hu, X. Xiao, Z. Fu, D. Xie, F.-T. Tan, and S. Maybank. A system for learning statistical motion patterns. Trans. PAMI, 28(9): 1450--1464, 2006.
[11]
N. Jacobs, N. Roman, and R. Pless. Consistent temporal variations in many outdoor scenes. In CVPR, 2007.
[12]
N. Johnson and D. Hogg. Learning the distribution of object trajectories for event recognition. In BMVC, 1996.
[13]
J. Lalonde, S. Narasimhan, and A. Efros. What does the sky tell us about the camera? In ECCV, 2008.
[14]
J. Li, S. Gong, and T. Xiang. Scene segmentation for behaviour correlation. In ECCV, 2008.
[15]
D. Makris and T. Ellis. Learning semantic scene models from observing activity in visual surveillance. Trans. on Systems, Man, and Cybernetics, 35(3): 397--408, 2005.
[16]
N. Oliver, B. Rosario, and A. Pentland. A bayesian computer vision system for modeling human interactions. Trans. PAMI, 22(8): 831--843, 2000.
[17]
D. Omercevic, O. Drbohlav, and A. Leonardis. High-dimensional feature matching: Employing the concept of meaningful nearest neighbors. In ICCV, 2007.
[18]
M. Perdoch, O. Chum, and J. Matas. Efficient representation of local geometry for large scale object retrieval. In CVPR, 2009.
[19]
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR, 2007.
[20]
Y. Pritch, A. Rav-acha, A. Gutman, and S. Peleg. Webcam synopsis: Peeking around the world. In ICCV, 2007.
[21]
T. Quack, B. Leibe, and L. Van Gool. World-scale mining of objects and events from community photo collections. In CIVR, 2008.
[22]
J. Sivic, M. Everingham, and A. Zisserman. "Who are you?" -- Learning person specific classifiers from video. In CVPR, 2009.
[23]
J. Sivic and A. Zisserman. Video google: a text retrieval approach to object matching in videos. In ICCV, 2003.
[24]
N. Snavely, S. M. Seitz, and R. Szeliski. Photo tourism: Exploring photo collections in 3d. In SIGGRAPH, 2006.
[25]
C. Stauffer and W. E. L. Grimson. Learning patterns of activity using real-time tracking. Trans. PAMI, 22(8): 747--757, 2000.
[26]
K. Sunkavalli, F. Romeiro, W. Matusik, T. Zickler, and H. Pfister. What do color changes reveal about an outdoor scene? In CVPR, 2008.
[27]
M. Vergauwen and L. Van Gool. Web-based 3d reconstruction service. MVA, 17(6): 411--426, 2006.
[28]
X. Wang, K. Ma, G. Ng, and W. Grimson. Trajectory analysis and semantic region modeling using a nonparametric bayesian model. In CVPR, 2008.
[29]
Y. Weiss. Deriving intrinsic images from image sequences. In ICCV, 2001.
[30]
H. Zhong, J. Shi, and M. Visontai. Detecting unusual activity in video. In CVPR, 2004.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIVR '09: Proceedings of the ACM International Conference on Image and Video Retrieval
July 2009
383 pages
ISBN:9781605584805
DOI:10.1145/1646396
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. computer vision
  2. image collection mining
  3. landmark recognition
  4. large-scale processing
  5. learning from continuous data streams
  6. vision for mobile phones
  7. webcam summarization

Qualifiers

  • Keynote

Funding Sources

Conference

CIVR '09
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Mining of Medical Trends Using Social NetworksHealthcare Policy and Reform10.4018/978-1-5225-6915-2.ch036(786-804)Online publication date: 2019
  • (2016)Mining of Medical Trends Using Social NetworksBiomedical Image Analysis and Mining Techniques for Improved Health Outcomes10.4018/978-1-4666-8811-7.ch008(164-182)Online publication date: 2016
  • (2015)Learning Algorithms for Anomaly Detection from ImagesInternational Journal of System Dynamics Applications10.4018/IJSDA.20150701034:3(43-69)Online publication date: 1-Jul-2015
  • (2014)Computer Graphics and Cultural Heritage, Part 2: Continuing Inspiration for Future ToolsIEEE Computer Graphics and Applications10.1109/MCG.2014.6534:4(70-79)Online publication date: Jul-2014
  • (2012)Robust Duplicate Detection of 2D and 3D ObjectsMethods and Innovations for Multimedia Database Content Management10.4018/978-1-4666-1791-9.ch007(96-117)Online publication date: 2012
  • (2011)The InfoAlbum image centric information collectionProceedings of the International Conference on Web Intelligence, Mining and Semantics10.1145/1988688.1988706(1-9)Online publication date: 25-May-2011
  • (2011)City-scale landmark identification on mobile devicesProceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition10.1109/CVPR.2011.5995610(737-744)Online publication date: 20-Jun-2011
  • (2011)ReferencesMultimedia Semantics10.1002/9781119970231.refs(281-299)Online publication date: 8-Aug-2011
  • (2010)Robust Duplicate Detection of 2D and 3D ObjectsInternational Journal of Multimedia Data Engineering & Management10.4018/jmdem.20100701021:3(19-40)Online publication date: 1-Jul-2010
  • (2010)Real-time detection of unusual regions in image streamsProceedings of the 18th ACM international conference on Multimedia10.1145/1873951.1874208(1307-1310)Online publication date: 25-Oct-2010
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media