Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/646838.759696guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Data Provenance: Some Basic Issues

Published: 13 December 2000 Publication History

Abstract

The ease with which one can copy and transform data on the Web, has made it increasingly difficult to determine the origins of a piece of data. We use the term data provenance to refer to the process of tracing and recording the origins of data and its movement between databases. Provenance is now an acute issue in scientific databases where it is central to the validation of data. In this paper we discuss some of the technical issues that have emerged in an initial exploration of the topic.

References

[1]
A. Woodruff and M. Stonebraker. Supporting fine-grained data lineage in a database visualization environment. In ICDE, pages 91-102, 1997.
[2]
Serge Abiteboul, Peter Buneman, and Dan Suciu. Data on the Web. From Relations to Semistructured Data and XML. Morgan Kaufman, 2000.
[3]
T. Barsalou, N. Siambela, A. Keller, and G Wiederhold. Updating relational databases through object-based views. In Proceedings ACM SIGMOD, May 1991.
[4]
Tim Bray, Jean Paoli, and C. M. Sperberg-McQueen. Extensible Markup Language (XML) 1.0. World Wide Web Consortium (W3C), Feb 1998. http://www.w3.org/TR/REC-xml.
[5]
P. Buneman, S. Davidson, M. Liberman, C. Overton, and V. Tannen. Data provenance. http://db.cis.upenn.edu/~wctan/DataProvenance/precis/index.html.
[6]
Peter Buneman, Susan Davidson, Carmem Hara, Wenfei Fan, and Wang-Chiew Tan. Keys for XML. Technical report, University of Pennsylvania, 2000. http://db.cis.upenn.edu.
[7]
Peter Buneman, Sanjeev Khanna, and Wang-Chiew Tan. Why and Where: A Characterization of Data Provenance. In International Conference on Database Theory, 2001. To appear, available at http://db.cis.upenn.edu.
[8]
James Clark and Steve DeRose. XML Path Language (XPath). W3C Working Draft, November 1999. http://www.w3.org/TR/xpath.
[9]
Y. Cui and J. Widom. Practical lineage tracing in data warehouses. In ICDE, pages 367-378, 2000.
[10]
Jon Doyle. A truth maintenance system. Artificial Intelligence, 12:231-272, 1979.
[11]
R. G. G. Cattell et al, editor. The Object Database Standard: Odmg 2.0. Morgan Kaufmann, 1997.
[12]
A. Gupta and I. Mumick. Maintenance of materialized views: Problems, techniques, and applications. IEEE Data Engineering Bulletin, Vol. 18, No. 2, June 1995., 1995.
[13]
Michael Lesk. Practical Digital Libraries: Books, Bytes and Bucks,. Morgan Kaufmann, July 1997.
[14]
Hartmut Liefke and Susan Davidson. View maintenance for hierarchical semistructured data. In International Conference on Data Warehousing and Knowledge Discovery, 2000.
[15]
Susan Davidson and Chris Overton and Peter Buneman. Challenges in Integrating Biological Data Sources. Journal of Computational Biology, 2(4):557-572, Winter 1995.
[16]
World Wide Web Consortium (W3C). XML Schema Part 0: Primer, 2000. http://www.w3.org/TR/xmlschema-0/.

Cited By

View all
  • (2018)Dac-ManProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291753(1-13)Online publication date: 11-Nov-2018
  • (2018)Dac-ManProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00075(1-13)Online publication date: 11-Nov-2018
  • (2018)Theory and practice of data citationJournal of the Association for Information Science and Technology10.1002/asi.2391769:1(6-20)Online publication date: 1-Jan-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
FST TCS 2000: Proceedings of the 20th Conference on Foundations of Software Technology and Theoretical Computer Science
December 2000
530 pages
ISBN:3540414134

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 13 December 2000

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Dac-ManProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291753(1-13)Online publication date: 11-Nov-2018
  • (2018)Dac-ManProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00075(1-13)Online publication date: 11-Nov-2018
  • (2018)Theory and practice of data citationJournal of the Association for Information Science and Technology10.1002/asi.2391769:1(6-20)Online publication date: 1-Jan-2018
  • (2016)Efficient Multi-depth Querying on Provenance of Relational Queries Using Graph DatabaseProceedings of the 9th Annual ACM India Conference10.1145/2998476.2998480(11-20)Online publication date: 21-Oct-2016
  • (2016)CPACProceedings of the 32nd Annual Conference on Computer Security Applications10.1145/2991079.2991126(139-152)Online publication date: 5-Dec-2016
  • (2015)Data Provenance for Historical Queries in Relational DatabaseProceedings of the 8th Annual ACM India Conference10.1145/2835043.2835047(117-122)Online publication date: 29-Oct-2015
  • (2013)Attributing authorship of revisioned contentProceedings of the 22nd international conference on World Wide Web10.1145/2488388.2488419(343-354)Online publication date: 13-May-2013
  • (2013)Engineering access control policies for provenance-aware systemsProceedings of the third ACM conference on Data and application security and privacy10.1145/2435349.2435390(285-292)Online publication date: 18-Feb-2013
  • (2012)Reconstructing the software environment of an experiment with kameleonProceedings of the 5th ACM COMPUTE Conference: Intelligent & scalable system technologies10.1145/2459118.2459134(1-8)Online publication date: 23-Jan-2012
  • (2011)Data sharing in the sciencesAnnual Review of Information Science and Technology10.5555/2766865.276687845:1(247-294)Online publication date: 1-Jan-2011
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media