Approximate Denial Constraints

Livshits, Ester; Heidari, Alireza; Ilyas, Ihab F.; Kimelfeld, Benny

Computer Science > Databases

arXiv:2005.08540v1 (cs)

[Submitted on 18 May 2020]

Title:Approximate Denial Constraints

Authors:Ester Livshits, Alireza Heidari, Ihab F. Ilyas, Benny Kimelfeld

View PDF

Abstract:The problem of mining integrity constraints from data has been extensively studied over the past two decades for commonly used types of constraints including the classic Functional Dependencies (FDs) and the more general Denial Constraints (DCs). In this paper, we investigate the problem of mining approximate DCs (i.e., DCs that are "almost" satisfied) from data. Considering approximate constraints allows us to discover more accurate constraints in inconsistent databases, detect rules that are generally correct but may have a few exceptions, as well as avoid overfitting and obtain more general and less contrived constraints. We introduce the algorithm ADCMiner for mining approximate DCs. An important feature of this algorithm is that it does not assume any specific definition of an approximate DC, but takes the semantics as input. Since there is more than one way to define an approximate DC and different definitions may produce very different results, we do not focus on one definition, but rather on a general family of approximation functions that satisfies some natural axioms defined in this paper and captures commonly used definitions of approximate constraints. We also show how our algorithm can be combined with sampling to return results with high accuracy while significantly reducing the running time.

Subjects:	Databases (cs.DB)
Cite as:	arXiv:2005.08540 [cs.DB]
	(or arXiv:2005.08540v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2005.08540

Submission history

From: Ester Livshits [view email]
[v1] Mon, 18 May 2020 09:06:29 UTC (167 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DB

< prev | next >

new | recent | 2020-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ester Livshits
Alireza Heidari
Ihab F. Ilyas
Benny Kimelfeld

export BibTeX citation

Computer Science > Databases

Title:Approximate Denial Constraints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Approximate Denial Constraints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators