Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Graceful database schema evolution: the PRISM workbench

Published: 01 August 2008 Publication History

Abstract

Supporting graceful schema evolution represents an unsolved problem for traditional information systems that is further exacerbated in web information systems, such as Wikipedia and public scientific databases: in these projects based on multiparty cooperation the frequency of database schema changes has increased while tolerance for downtimes has nearly disappeared. As of today, schema evolution remains an error-prone and time-consuming undertaking, because the DB Administrator (DBA) lacks the methods and tools needed to manage and automate this endeavor by (i) predicting and evaluating the effects of the proposed schema changes, (ii) rewriting queries and applications to operate on the new schema, and (iii) migrating the database.
Our PRISM system takes a big first step toward addressing this pressing need by providing: (i) a language of Schema Modification Operators to express concisely complex schema changes, (ii) tools that allow the DBA to evaluate the effects of such changes, (iii) optimized translation of old queries to work on the new schema version, (iv) automatic data migration, and (v) full documentation of intervened changes as needed to support data provenance, database flash back, and historical queries. PRISM solves these problems by integrating recent theoretical advances on mapping composition and invertibility, into a design that also achieves usability and scalability. Wikipedia and its 170+ schema versions provided an invaluable testbed for validating PRISM tools and their ability to support legacy queries.

References

[1]
R. B. Almeida, B. Mozafari, and J. Cho. On the evolution of wikipedia. In Int. Conf. on Weblogs and Social Media, March 2007.
[2]
D. Barbosa, J. Freire, and A. O. Mendelzon. Designing information-preserving mapping schemes for xml. In VLDB, pages 109--120, 2005.
[3]
P. A. Bernstein. Applying model management to classical meta data problems. In CIDR, 2003.
[4]
P. A. Bernstein, T. J. Green, S. Melnik, and A. Nash. Implementing mapping composition. VLDB J., 17(2):333--353, 2008.
[5]
P. A. Bernstein and E. Rahm. Data warehouse scenarios for model management. In ER, 2003.
[6]
H. Bounif and R. Pottinger. Schema repository for database schema evolution. DEXA, 0:647--651, 2006.
[7]
A. Cleve and J.-L. Hainaut. Co-transformations in database applications evolution. Generative and Transformational Techniques in Software Engineering, pages 409--421, 2006.
[8]
C. A. Curino, H. J. Moon, L. Tanca, and C. Zaniolo. Schema Evolution in Wikipedia: toward a Web Information System Benchmark. ICEIS, 2008.
[9]
C. A. Curino, H. J. Moon, and C. Zaniolo. Managing the history of metadata in support for db archiving and schema evolution. In ECDM, 2008.
[10]
DB2 development team. DB2 Change Management Expert. 2006.
[11]
A. Deutsch and V. Tannen. Optimization properties for classes of conjunctive regular path queries. In DBPL '01: Revised Papers from the 8th International Workshop on Database Programming Languages, pages 21--39, London, UK, 2002. Springer-Verlag.
[12]
A. Deutsch and V. Tannen. Mars: A system for publishing XML from mixed and redundant storage. In VLDB, 2003.
[13]
R. Fagin. Inverting schema mappings. ACM Trans. Database Syst., 32 (4): 25, 2007.
[14]
R. Fagin, P. G. Kolaitis, L. Popa, and W. C. Tan. Composing schema mappings: Second-order dependencies to the rescue. In PODS, pages 83--94, 2004.
[15]
R. Fagin, P. G. Kolaitis, L. Popa, and W.-C. Tan. Quasi-inverses of schema mappings. In PODS '07, pages 123--132, 2007.
[16]
R. d. M. Galante, C. S. dos Santos, N. Edelweiss, and A. F. Moreira. Temporal and versioning model for schema evolution in object-oriented databases. Data & Knowledge Engineering, 53(2):99--128, 2005.
[17]
M. Golfarelli, J. Lechtenbörger, S. Rizzi, and G. Vossen. Schema versioning in data warehouses. In ER (Workshops), pages 415--428, 2004.
[18]
J.-M. Hick and J.-L. Hainaut. Database application evolution: a transformational approach. Data Knowl. Eng., 59(3):534--558, 2006.
[19]
H. V. Jagadish, I. S. Mumick, and M. Rabinovich. Scalable versioning in distributed databases with commuting updates. In Conference on Data Engineering, pages 520--531, 1997.
[20]
T. Lemke and R. Manthey. The schema evolution assistant: Tool description, 1995.
[21]
J. Madhavan and A. Y. Halevy. Composing mappings among data sources. In VLDB, 2003.
[22]
S. Melnik, E. Rahm, and P. A. Bernstein. Rondo: A programming platform for generic model management. In SIGMOD, 2003.
[23]
H. J. Moon, C. A. Curino, A. D. C.-Y. Hou, and C. Zaniolo. Managing and querying transaction-time databases under schema evolution. In VLDB, 2008.
[24]
M. M. Moro, S. Malaika, and L. Lim. Preserving XML Queries during Schema Evolution. In WWW, pages 1341--1342, 2007.
[25]
A. Nash, P. A. Bernstein, and S. Melnik. Composition of mappings given by embedded dependencies. In PODS, 2005.
[26]
Oracle development team. Oracle database 10g online data reorganization and redefinition. 2005.
[27]
Y.-G. Ra. Relational schema evolution for program independency. Intelligent Information Technology, pages 273--281, 2005.
[28]
S. Rizzi and M. Golfarelli. X-time: Schema versioning and cross-version querying in data warehouses. In ICDE, pages 1471--1472, 2007.
[29]
J. Roddick. A Survey of Schema Versioning Issues for Database Systems. Information and Software Technology, 37(7):383--393, 1995.
[30]
J. Ullman. Principles of Database System., Computer Science Press, 1982.
[31]
Y. Velegrakis, R. J. Miller, and L. Popa. Mapping adaptation under evolving schemas. In VLDB, 2003.
[32]
Wikimedia Foundation. Mediawiki http://www.mediawiki.org, 2007. {Online}.
[33]
Wikimedia Foundation. Wikipedia http://en.wikipedia.org/, 2007. {Online}.
[34]
C. Yu and L. Popa. Semantic adaptation of schema mappings when schemas evolve. In VLDB, 2005.

Cited By

View all
  • (2024)Drag, Drop, Merge: A Tool for Streamlining Integration of Longitudinal Survey InstrumentsProceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics10.1145/3665939.3665965(1-7)Online publication date: 14-Jun-2024
  • (2024)Nautilus: Implementation of an Evolution Approach for Graph DatabasesProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3652620.3687781(11-15)Online publication date: 22-Sep-2024
  • (2023)OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance from Database Query Event LogsProceedings of the VLDB Endowment10.14778/3611540.361155516:12(3662-3675)Online publication date: 1-Aug-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 1, Issue 1
August 2008
1216 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2008
Published in PVLDB Volume 1, Issue 1

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)42
  • Downloads (Last 6 weeks)3
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Drag, Drop, Merge: A Tool for Streamlining Integration of Longitudinal Survey InstrumentsProceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics10.1145/3665939.3665965(1-7)Online publication date: 14-Jun-2024
  • (2024)Nautilus: Implementation of an Evolution Approach for Graph DatabasesProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3652620.3687781(11-15)Online publication date: 22-Sep-2024
  • (2023)OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance from Database Query Event LogsProceedings of the VLDB Endowment10.14778/3611540.361155516:12(3662-3675)Online publication date: 1-Aug-2023
  • (2022)EvolveDBProceedings of the 25th International Conference on Model Driven Engineering Languages and Systems: Companion Proceedings10.1145/3550356.3559095(61-65)Online publication date: 23-Oct-2022
  • (2022)MIKADO: a smart city KPIs assessment modeling frameworkSoftware and Systems Modeling (SoSyM)10.1007/s10270-021-00907-921:1(281-309)Online publication date: 1-Feb-2022
  • (2021)Schema Evolution in Multiversion Data WarehousesInternational Journal of Data Warehousing and Mining10.4018/IJDWM.202110010117:4(1-28)Online publication date: 1-Oct-2021
  • (2021)Scooter & Sidecar: a domain-specific approach to writing secure database migrationsProceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation10.1145/3453483.3454072(710-724)Online publication date: 19-Jun-2021
  • (2021)Reducing Ambiguity in Json Schema DiscoveryProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452801(1732-1744)Online publication date: 9-Jun-2021
  • (2021)Should Variation Be Encoded Explicitly in Databases?Proceedings of the 15th International Working Conference on Variability Modelling of Software-Intensive Systems10.1145/3442391.3442395(1-9)Online publication date: 9-Feb-2021
  • (2021)CHiSEL: a user-oriented framework for simplifing database evolutionDistributed and Parallel Databases10.1007/s10619-020-07314-x39:2(483-543)Online publication date: 1-Jun-2021
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media