Abstract
To detect errors in numeric data, this paper proposes numeric functional dependencies (NFDs), a class of dependencies that allow us to specify arithmetic relationships among numeric attributes. We show that NFDs subsume conditional functional dependencies (CFDs); hence, we can catch data inconsistencies, numeric or not, in a uniform logic framework by using NFDs as data quality rules. Better still, NFDs do not increase the complexity of reasoning about data quality rules. We show that the satisfiability and implication problems for NFDs remain NP-complete and coNP-complete, respectively, the same as their counterparts for CFDs. Moreover, NFDs can be implemented in SQL and hence, error detection can be readily supported by DBMS. In addition, we show that NFDs and CFDs can be extended across multiple tables, without increasing the complexity of static analyses and error detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
MySQL, http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley (1995)
Arenas, M., Bertossi, L.E., Chomicki, J.: Consistent query answers in inconsistent databases. TPLPÂ 3(4-5) (2003)
Bertossi, L.E., Bravo, L., Franconi, E., Lopatenko, A.: Complexity and approximation of fixing numerical attributes in databases under integrity constraints. Inf. Syst. 33(4-5), 407–434 (2008)
Eckerson, W.W.: Data Quality and the Bottom Line: Achieving Business Success through a Commitment to High Quality Data. In: The Data Warehousing Institute (2002)
Fan, W., Geerts, F.: Foundations of Data Quality Management. Morgan & Claypool Publishers (2012)
Fan, W., Geerts, F., Jia, X., Kementsietsidis, A.: Conditional functional dependencies for capturing data inconsistencies. TODSÂ 33(1) (2008)
Fan, W., Geerts, F., Li, J., Xiong, M.: Discovering conditional functional dependencies. TKDE 23(5), 683–698 (2011)
Flesca, S., Furfaro, F., Parisi, F.: Querying and repairing inconsistent numerical databases. TODSÂ 35(2) (2010)
Franconi, E., Palma, A.L., Leone, N., Perri, S., Scarcello, F.: Census data repair: A challenging application of disjunctive logic programming. In: Nieuwenhuis, R., Voronkov, A. (eds.) LPAR 2001. LNCS (LNAI), vol. 2250, pp. 561–578. Springer, Heidelberg (2001)
Golab, L., Karloff, H.J., Korn, F., Saha, A., Srivastava, D.: Sequential dependencies. In: PVLDB, vol. 21(1) (2009)
Jones, J.P.: Undecidable Diophantine equations. Bull. Amer. Math. Soc. 3(2), 859–862 (1980)
Koudas, N., Saha, A., Srivastava, D., Venkatasubramanian, S.: Metric functional dependencies. In: ICDE (2009)
Ma, S., Fan, W., Bravo, L.: Extending inclusion dependencies with conditions. TCS 515, 64–95 (2014)
Papadimitriou, C.H.: Computational Complexity. Addison Wesley (1994)
Ramakrishnan, R., Gehrke, J.: Database Management Systems. McGraw-Hill Higher Education (2000)
Rossa, K.A., Srivastava, D., Stuckeyc, P.J., Sudarshan, S.: Foundations of aggregation constraints. TCS 193(1-2), 149–179 (1998)
The New York Times. Articles about Jesica Santillan, http://topics.nytimes.com/topics/reference/timestopics/people/s/jesic_santillan/index.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Fan, G., Fan, W., Geerts, F. (2014). Detecting Errors in Numeric Attributes. In: Li, F., Li, G., Hwang, Sw., Yao, B., Zhang, Z. (eds) Web-Age Information Management. WAIM 2014. Lecture Notes in Computer Science, vol 8485. Springer, Cham. https://doi.org/10.1007/978-3-319-08010-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-08010-9_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08009-3
Online ISBN: 978-3-319-08010-9
eBook Packages: Computer ScienceComputer Science (R0)