Abstract
The term ‘data-stream’ is an increasingly overloaded expression. It often means different things to different people, depending on domain, usage or operation. Harold (2003) draws the following analogy:
“A [stream] analogy might be a queue of people waiting to get on a ride at an amusement park. As people are processed at the front (i.e. get on the roller coaster) more are added at the back of the line. If it’s a slow day the roller coaster may catch up with the end of the line and have to wait for people to board. Other days there may always be people in line until the park closes...There’s always a definite number of people in line though this number may change from moment to moment as people enter at the back of the line and exit from the front of the line. Although all the people are discrete, you’ll sometimes have a family that must be put together in the same car. Thus although the individuals are discrete, they aren’t necessarily unrelated.”
For our purposes we define a data-stream as a series of data (e.g. credit card transactions arriving at a clearing office, cellular phone traffic or environmental data from satellites) arriving in real time, that have an initiation, a continuous ingest of data, but with no expectations on the amount, length, or end of the data flow. The data stream does not have a database or repository as an intrinsic part of its definition–it is a ‘one-look’ opportunity from the perspective of data stream analytics. We call each data element in the stream a token and the complexity of these tokens ranges from simple (e.g. characters in a sentence: “T H I S I S A S T R E A M...”) to extremely complex (e.g. a detailed transaction record). The volume of data-streams is usually massive, and while each individual token may be rather uninformative, taken as a whole they describe the nature of the changing phenomena over time.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lyman, P., Varian, H.: How Much Information? A project report of the Regents of the University of California (2000), available at, http://www.sims.berkeley.edu/how-much-info
Gorton, I., Almquist, J., Cramer, N., Haack, J., Hoza, M.: An Efficient, Scalable Content-Based Messaging System. In: Proc. 7th IEEE International Enterprise Distributed Object Computing Conference (EDOC 2003), Brisbane, Australia, pp. 278–285 (September 2003)
Pacific Northwest National Laboratory’s IT Showcase, http://showcase.pnl.gov/show?it/triver-prod
Havre, S.L., Hetzler, B.G., Whitney, P.D., Nowell, L.T.: ThemeRiver: Visualizing Thematic Changes in Document Collections. IEEE Transactions on Visualization and Computer Graphics 8(1), 9–20 (2002)
Hoffman, D.D.: Visual Intelligence: How We Create What We See. W.W. Norton and Company, Inc., New York (1998)
Ware, C.: Information Visualization: Perception for Design. Academic Press, San Diego (2000)
May, R.: Hi-Space: A Next Generation Workspace Environment. Masters Thesis in Electrical Engineering Computer Science. Pullman, Washington, Washington State University
MacEachren, M.A., et al.: Visually-Enabled Geocollaboration to Support Data Exploration and Decision-Making. In: Proceedings of the 21st International Cartographic Conference, Durban, South Africa, pp. 10–16 (August 2003)
Ullmer, B., Ishii, H.: The metaDESK: Models and Prototypes for Tangible User Interfaces. In: UIST (1997)
Matsushita, N., Rekimoto, J.: HoloWall: Designing a Finger, Hand, Body, and Object Sensitive Wall. In: UIST (1997)
Krueger, M.W.: Artificial Reality II. Addison-Wesley Publishing Company, Reading (1991)
Wellner, P.: Interactions with Paper on the DigitalDesk. Communications of the ACM 36(7), 87–96 (1993)
Ohshima, T., Sato, K., et al.: AR2 Hockey; A Case Study of Collaborative Augmented Reality. In: VRAIS (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Cowell, A.J., Havre, S., May, R., Sanfilippo, A. (2005). Scientific Discovery Within Data Streams. In: Cai, Y. (eds) Ambient Intelligence for Scientific Discovery. Lecture Notes in Computer Science(), vol 3345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-32263-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-32263-4_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24466-0
Online ISBN: 978-3-540-32263-4
eBook Packages: Computer ScienceComputer Science (R0)