Abstract
Schema matching, the problem of finding semantic correspondences between elements of source and warehouse schemas, plays a key role in data warehousing. Currently, schema mapping is largely determined manually by domain experts, thus a labor-intensive process. In this paper, we propose a structural matching algorithm based on semantic similarity propagation. Experimental results on several real-world domains are presented, and show that the algorithm discovers semantic mappings with a high degree of accuracy.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Doan, A., Domingos, P., Halevy, A.: Reconciling Schemas of Disparate Data Sources: A Machine-Learning approach. SIGMOD (2001)
Do, H., Rahm, E.: COMA – A System for flexible combination of schema matching approaches. In: VLDB 2002 (2002)
Embley, D.W., et al.: Multifaceted Exploitation of Metadata for attribute Match Discovery in information Integration. In: WIIW 2001 (2001)
Li, W.S.: SemInt: A Tool for Identifying Attribute Correspondences in Heterogeneous database Using Neural Network. Data & Knowledge Engineering (2001)
Madhavant, J., Bernstein, P.A., Rahm, E.: Generic Schema Matching with Cupid. In: VLDB 2001 (2001)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity Flooding: A versatile graph matching Algorithm. In: ICDE 2002 (2002)
Mitra, P., Wiederhold, G., Jannink, J.: Semiautomatic integration of knowledge sources. In: FUSION 1999 (1999)
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. The VLDB Journal 10(4), 334–350 (2001)
WordNet: http://www.cogsci.princeton.edu/~wn/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cheng, W., Sun, Y. (2005). GSMA: A Structural Matching Algorithm for Schema Matching in Data Warehousing. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_50
Download citation
DOI: https://doi.org/10.1007/11540007_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28331-7
Online ISBN: 978-3-540-31828-6
eBook Packages: Computer ScienceComputer Science (R0)