Data exchange: computing cores in polynomial time
G Gottlob, A Nash - Proceedings of the twenty-fifth ACM SIGMOD …, 2006 - dl.acm.org
G Gottlob, A Nash
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on …, 2006•dl.acm.orgData exchange deals with inserting data from one database into another database having a
different schema. We study and solve a central computational problem of data exchange,
namely, computing the core of a universal solution to a data exchange problem. Fagin,
Kolaitis, and Popa [9], have shown that among the universal solutions of a solvable data
exchange problem, there exists a most compact one (up to isomorphism)," the core"(of any
universal solution), and have convincingly argued that this core should be the solution of …
different schema. We study and solve a central computational problem of data exchange,
namely, computing the core of a universal solution to a data exchange problem. Fagin,
Kolaitis, and Popa [9], have shown that among the universal solutions of a solvable data
exchange problem, there exists a most compact one (up to isomorphism)," the core"(of any
universal solution), and have convincingly argued that this core should be the solution of …
Data exchange deals with inserting data from one database into another database having a different schema. We study and solve a central computational problem of data exchange, namely, computing the core of a universal solution to a data exchange problem. Fagin, Kolaitis, and Popa [9], have shown that among the universal solutions of a solvable data exchange problem, there exists a most compact one (up to isomorphism), "the core" (of any universal solution), and have convincingly argued that this core should be the solution of choice. They stated as an important open problem whether the core of a universal solution can be computed in polynomial time in the general setting where the source-to-target constraints are arbitrary tuple generating dependencies (TGDs) and the target constraints consist of equation generating dependencies (EGDs) and weakly-acyclic TGDs. In this paper we solve this problem by developing new efficient methods for computing the core of a universal solution. This positive result shows that the core approach of Fagin, Kolaitis, and Popa is feasible and applicable in a very general setting and thus provides a further momentum to the use of cores in data exchange.
ACM Digital Library