Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Temporal event clustering for digital photo collections

Published: 01 August 2005 Publication History

Abstract

Organizing digital photograph collections according to events such as holiday gatherings or vacations is a common practice among photographers. To support photographers in this task, we present similarity-based methods to cluster digital photos by time and image content. The approach is general and unsupervised, and makes minimal assumptions regarding the structure or statistics of the photo collection. We present several variants of an automatic unsupervised algorithm to partition a collection of digital photographs based either on temporal similarity alone, or on temporal and content-based similarity. First, interphoto similarity is quantified at multiple temporal scales to identify likely event clusters. Second, the final clusters are determined according to one of three clustering goodness criteria. The clustering criteria trade off computational complexity and performance. We also describe a supervised clustering method based on learning vector quantization. Finally, we review the results of an experimental evaluation of the proposed algorithms and existing approaches on two test collections.

References

[1]
Boreczky, J. and Rowe, L. 1996. Comparison of video shot boundary detection techniques. In SPIE Storage and Retrieval for Image and Video Databases. SPIE, Press, Bellingham, WA, 170--179.
[2]
Cooper, M., Foote, J., Girgensohn, A., and Wilcox, L. 2003. Temporal event clustering for digital photo collections. In Proceedings of the 11th ACM International Conference on Multimedia. ACM Press, New York, NY, 364--373.
[3]
Duda, R. and Hart, P. 1973. Pattern Classification and Scene Analysis. Wiley-Interscience, New York, NY.
[4]
Foote, J. 2000. Automatic audio segmentation using a measure of audio novelty. In Proceedings of the IEEE International Conference on Multimedia and Expo. IEEE, Computer Society Press, Los Alamitos, CA, 452--55.
[5]
Frohlich, D., Kuchinsky, A., Pering, C., Don, A., and Ariss, S. 2002. Requirements for photoware. In Proceedings of the ACM Conference on CSCW. ACM Press, New York, NY, 166--175.
[6]
Gargi, U. 2003. Modeling and clustering of photo capture streams. In Proceedings of the 5th ACM SIGMM Workshop on Multimedia Information Retrieval. ACM Press, New York, NY, 47--54.
[7]
Girgensohn, A., Adcock, J., Cooper, M., Foote, J., and Wilcox, L. 2003. Simplifying the management of large photo collections. In Proceedings of Human-Computer Interaction INTERACT '03. IOS Press, Amsterdam, The Netherlands, 196--203.
[8]
Graham, A., Garcia-Molina, H., Paepcke, A., and Winograd, T. 2002. Time as the essence for photo browsing through personal digital libraries. In Proceedings of the Joint Conference on Digital Libraries. ACM Press, New York, NY, 326--35.
[9]
Hartigan, J. 1975. Clustering Algorithms. Wiley & Sons, New York, NY.
[10]
Jaimes, A., Benitez, A. B., Chang, S.-F., and Loui, A. C. 2000. Discovering recurrent visual semantics in consumer photographs. In IEEE International Conference on Image Processing, Vol. 2. IEEE Press, Los Alamitos, CA, 528--531.
[11]
Jeida. 1998. Digital Still Camera Image File Format Standard. Japan Electronic Industry Development Association, Tokyo, Japan.
[12]
Kohonen, T. 1989. Self-Organization and Associative Memory. Springer-Verlag, Berlin, Germany.
[13]
Kohonen, T., Kangas, J., Laaksonen, J., and Torkkola, K. 1992. Lvq pak: A program package for the correct application of learning vector quantization algorithms. In International Joint Conference on Neural Networks. ACM Press, New York, NY, 725--730.
[14]
Leung, Y., Zhang, J.-S., and Xu, Z.-B. 2000. Clustering by scale-space filtering. IEEE Trans. Patt. Anal. Mach. Intell. 22, 12, 1396--1410.
[15]
Lim, J.-H., Tian, Q., and Mulhelm, P. 2003. Home photo content modelling for personalized event-based retrieval. IEEE Multimed. 10, 4, 28--37.
[16]
Loui, A. and Savakis, A. 2003. Automatic event clustering and quality screening of comsumer pictures for digital albuming. IEEE Trans. Multimed. 5, 3, 390--402.
[17]
Mills, T., Pye, D., Sinclair, D., and Wood, K. 2000. Shoebox: A digital photo management system. In Technical Report 2000.10. AT&T Laboratories Cambridge, Cambridge, U.K.
[18]
Mojsilovic, A., Gomes, J., and Rogowitz, B. 2002. Isee: Perceptual features for image library navigation. In SPIE Human Vision and Electronic Imaging. SPIE Press, Bellingham, WA, 266--277.
[19]
Naaman, M., Song, Y. J., Paepcke, A., and Garcia-Molina, H. 2004. Automatic organization for digital photographs with geographic coordinates. In Proceedings of the Joint Conference on Digital Libraries. ACM Press, New York, NY, 326--35.
[20]
Platt, J., Czerwinski, M., and Field, B. 2003. Simplifying the management of large photo collections. In Fourth IEEE Pacific Rim Conference on Multimedia. IEEE Press, Los Alamitos, CA, 6--10.
[21]
Rodden, K. 2002. Evaluating similarity-based visualisations as interfaces for image browsing. Ph.D. dissertation. Univeristy of Cambridge, Cambridge, U.K.
[22]
Rodden, K. and Wood, K. 2003. How do people manage their digital photographs? In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI). ACM Press, New York, NY, 409--416.
[23]
Schwarz, G. 1978. Estimating the dimension of a model. Ann. Statist. 6, 461--64.
[24]
Slaney, M., Ponceleon, D., and Kaufman, J. 2001. Multimedia edges: Finding hierarchy in all dimensions. In ACM International Conference on Multimedia. ACM Press, New York, NY, 29--40.
[25]
Witkin, A. 1984. Scale-space filtering: A new approach to multi-scale description. In IEEE ICASSP, Vol. 9. IEEE Press, Los Alamitos, CA, 150--153.

Cited By

View all
  • (2023)Smart Multimedia Information RetrievalAnalytics10.3390/analytics20100112:1(198-224)Online publication date: 20-Feb-2023
  • (2022)A Survey of Data Representation for Multi-Modality Event Detection and EvolutionApplied Sciences10.3390/app1204220412:4(2204)Online publication date: 20-Feb-2022
  • (2021)Generating visual story graphs with application to photo album summarizationSignal Processing: Image Communication10.1016/j.image.2020.11603390(116033)Online publication date: Jan-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications
ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 1, Issue 3
August 2005
104 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/1083314
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 2005
Published in TOMM Volume 1, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Digital photo organization
  2. digital libraries
  3. temporal media indexing

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)3
Reflects downloads up to 01 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Smart Multimedia Information RetrievalAnalytics10.3390/analytics20100112:1(198-224)Online publication date: 20-Feb-2023
  • (2022)A Survey of Data Representation for Multi-Modality Event Detection and EvolutionApplied Sciences10.3390/app1204220412:4(2204)Online publication date: 20-Feb-2022
  • (2021)Generating visual story graphs with application to photo album summarizationSignal Processing: Image Communication10.1016/j.image.2020.11603390(116033)Online publication date: Jan-2021
  • (2020)Supporting intergenerational memento storytelling for older adults through a tangible display: a case studyPersonal and Ubiquitous Computing10.1007/s00779-020-01364-926:3(625-649)Online publication date: 25-Jan-2020
  • (2019)Context in Photo AlbumsACM Transactions on Applied Perception10.1145/333361216:2(1-20)Online publication date: 13-Aug-2019
  • (2019)From crowdsourcing to crowdmining: using implicit human intelligence for better understanding of crowdsourced dataWorld Wide Web10.1007/s11280-019-00718-523:2(1101-1125)Online publication date: 31-Aug-2019
  • (2019)A new F-score gradient-based training rule for the linear modelPattern Analysis & Applications10.1007/s10044-017-0650-722:2(537-548)Online publication date: 1-May-2019
  • (2019)Survey on Social Networks Data AnalysisInnovations for Community Services10.1007/978-3-030-37484-6_6(100-119)Online publication date: 15-Dec-2019
  • (2018)Time-TurnerProceedings of the 2018 CHI Conference on Human Factors in Computing Systems10.1145/3173574.3173753(1-14)Online publication date: 21-Apr-2018
  • (2018)Care to Share?Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining10.1145/3159652.3159713(207-215)Online publication date: 2-Feb-2018
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media