Abstract
The average person with a networked computer can now understand why computers should have vision — to search the world's collections of digital video and images and “retrieve a picture of_.” Computer vision for intelligent browsing, querying, and retrieval of imagery is needed now, and yet traditional approaches to computer vision remain far from a general solution to the scene understanding problem. In this paper I discuss the need for a solution based on combining high-level and low-level vision, that works in concert with input from a human user. The solution is based on: 1) Learning from the user what is important visually, and 2) Learning associations between text descriptions and visual data. I describe some recent results in these areas, and overview key challenges for future research in computer vision for digital libraries.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
D. B. Lenat, “Artificial intelligence,” Scientific American, pp. 80–82, Sept. 1995.
R. W. Picard, “Light-years from Lena: Video and image libraries of the future,” in IEEE Second Int. Conf. on Image Proc., (Washington, DC), Oct. 1995. To appear; Also appears as MIT Media Lab Perceptual Computing TR #339.
A. R. Damasio, Descartes' Error: Emotion, Reason, and the Human Brain. New York, NY: Gosset/Putnam Press, 1994.
T. P. Minka and R. W. Picard, “Interactive learning using a 'society of models',” Submitted for Publication, 1995. Also appears as MIT Media Lab Perceptual Computing TR#349.
A. S. Chakravarthy, “Toward semantic retrieval of pictures and video,” in RIAO'94, Intelligent Multimedia Information Retrieval Systems and Management, (New York), pp. 676–686, Oct. 1994.
R. K. Srihari, “Combining text and image information in content-based retrieval,” in IEEE Second Int. Conf. on Image Proc., (Washington, DC), Oct. 1995. To appear.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Picard, R.W. (1996). Digital libraries: Meeting place for high-level and low-level vision. In: Li, S.Z., Mital, D.P., Teoh, E.K., Wang, H. (eds) Recent Developments in Computer Vision. ACCV 1995. Lecture Notes in Computer Science, vol 1035. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60793-5_57
Download citation
DOI: https://doi.org/10.1007/3-540-60793-5_57
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60793-9
Online ISBN: 978-3-540-49448-5
eBook Packages: Springer Book Archive