Digital libraries: Meeting place for high-level and low-level vision

Picard, Rosalind W.

doi:10.1007/3-540-60793-5_57

Rosalind W. Picard¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1035))

Included in the following conference series:

Asian Conference on Computer Vision

250 Accesses

Abstract

The average person with a networked computer can now understand why computers should have vision — to search the world's collections of digital video and images and “retrieve a picture of_.” Computer vision for intelligent browsing, querying, and retrieval of imagery is needed now, and yet traditional approaches to computer vision remain far from a general solution to the scene understanding problem. In this paper I discuss the need for a solution based on combining high-level and low-level vision, that works in concert with input from a human user. The solution is based on: 1) Learning from the user what is important visually, and 2) Learning associations between text descriptions and visual data. I describe some recent results in these areas, and overview key challenges for future research in computer vision for digital libraries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Introduction

On the data set’s ruins

Article Open access 11 November 2020

Machine Vision

References

D. B. Lenat, “Artificial intelligence,” Scientific American, pp. 80–82, Sept. 1995.
Google Scholar
R. W. Picard, “Light-years from Lena: Video and image libraries of the future,” in IEEE Second Int. Conf. on Image Proc., (Washington, DC), Oct. 1995. To appear; Also appears as MIT Media Lab Perceptual Computing TR #339.
Google Scholar
A. R. Damasio, Descartes' Error: Emotion, Reason, and the Human Brain. New York, NY: Gosset/Putnam Press, 1994.
Google Scholar
T. P. Minka and R. W. Picard, “Interactive learning using a 'society of models',” Submitted for Publication, 1995. Also appears as MIT Media Lab Perceptual Computing TR#349.
Google Scholar
A. S. Chakravarthy, “Toward semantic retrieval of pictures and video,” in RIAO'94, Intelligent Multimedia Information Retrieval Systems and Management, (New York), pp. 676–686, Oct. 1994.
Google Scholar
R. K. Srihari, “Combining text and image information in content-based retrieval,” in IEEE Second Int. Conf. on Image Proc., (Washington, DC), Oct. 1995. To appear.
Google Scholar

Download references

Author information

Authors and Affiliations

MIT Media Laboratory, 20 Ames Street, 02139, Cambridge, MA, USA
Rosalind W. Picard

Authors

Rosalind W. Picard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Stan Z. Li Dinesh P. Mital Eam Khwang Teoh Han Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Picard, R.W. (1996). Digital libraries: Meeting place for high-level and low-level vision. In: Li, S.Z., Mital, D.P., Teoh, E.K., Wang, H. (eds) Recent Developments in Computer Vision. ACCV 1995. Lecture Notes in Computer Science, vol 1035. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60793-5_57

Download citation

DOI: https://doi.org/10.1007/3-540-60793-5_57
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60793-9
Online ISBN: 978-3-540-49448-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Digital libraries: Meeting place for high-level and low-level vision

Abstract

Access this chapter

Preview

Similar content being viewed by others

Introduction

On the data set’s ruins

Machine Vision

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Digital libraries: Meeting place for high-level and low-level vision

Abstract

Access this chapter

Preview

Similar content being viewed by others

Introduction

On the data set’s ruins

Machine Vision

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation