Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-642-05151-7_27guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

LinksB2N: Automatic Data Integration for the Semantic Web

Published: 07 November 2009 Publication History
  • Get Citation Alerts
  • Abstract

    The ongoing trend towards open data embraced by the Semantic Web has started to produce a large number of data sources. These data sources are published using RDF vocabularies, and it is possible to navigate throughout the data due to their graph topology. This paper presents LinksB2N, an algorithm for discovering information overlaps in RDF data repositories and performing data integration with no human intervention over data sets that partially share the same domain.
    LinksB2N identifies equivalent RDF resources from different data sets with several degrees of confidence. The algorithm relies on a novel approach that uses clustering techniques to analyze the distribution of unique objects that contain overlapping information in different data graphs. Our contribution is illustrated in the context of the Market Blended Insight project by applying the LinksB2N algorithm to data sets in the order of hundreds of millions of RDF triples containing relevant information in the domain of business to business (B2B) marketing analysis.

    References

    [1]
    Alani, H., Dasmahapatra, S., Gibbins, N., Glaser, H., Harris, S., Kalfoglou, Y., O'Hara, K., Shadbolt, N.: Managing reference: Ensuring referential integrity of ontologies for the semantic web. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 317-334. Springer, Heidelberg (2002).
    [2]
    Arens, Y., Knoblock, C.A.: Sims: Retrieving and integrating information from multiple sources. In: SIGMOD Conference, pp. 562-563 (1993).
    [3]
    Correndo, G., Alani, H.: Collaborative support for community data sharing. In: The 2nd Workshop on Collective Intelligence in Semantic Web and Social Networks (December 2008).
    [4]
    Fellegi, I.P., Sunter, A.B.: A theory for record linkage. Journal of the American Statistical Association 64(328), 1183-1210 (1969).
    [5]
    Jaffri, A., Glaser, H., Millard, I.: Uri identity management for semantic web data integration and linkage. In: 3rd International Workshop On Scalable Semantic Web Knowledge Base Systems, Springer, Heidelberg (2007).
    [6]
    Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. Knowledge Engineering Review 18(1), 1-31 (2003).
    [7]
    Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Technical Report 8 (1966).
    [8]
    Mena, E., Illarramendi, A., Kashyap, V., Sheth, A.P.: Observer: An approach for query processing in global information systems based on interoperation across preexisting ontologies. Distributed and Parallel Databases 8(2), 223-271 (2000).
    [9]
    Newcombe, H.B., Kennedy, J.M.: Record linkage: making maximum use of the discriminating power of identifying information. Commun. ACM 5(11), 563-566 (1962).
    [10]
    Preece, A.D., Hui, K.-y., Gray, W.A., Marti, P., Bench-Capon, T.J.M., Jones, D.M., Cui, Z.: The kraft architecture for knowledge fusion and transformation. Knowl.- Based Syst. 13(2-3), 113-120 (2000).
    [11]
    Salvadores, M., Zuo, L., Imtiaz, S.M.H., Darlington, J., Gibbins, N., Shadbolt, N., Dobree, J.: Market blended insight: Modeling propensity to buy with the semantic web. In: International Semantic Web Conference, pp. 777-789 (2008).
    [12]
    Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Silk - A Link Discovery Framework for the Web of Data. In: 18th International World Wide Web Conference (2009).
    [13]
    Wiederhold, G.: Mediators in the architecture of future information systems. IEEE Computer 25(3), 38-49 (1992).

    Cited By

    View all
    • (2019)On indexing evidential dataInternational Journal of Approximate Reasoning10.1016/j.ijar.2018.12.015106:C(63-87)Online publication date: 1-Mar-2019
    • (2018)Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corporaWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2011.11.00210(76-110)Online publication date: 20-Dec-2018
    • (2011)Distributed human computation framework for linked data co-reference resolutionProceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I10.5555/2008892.2008896(32-46)Online publication date: 29-May-2011

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    OTM '09: Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part II
    November 2009
    477 pages
    ISBN:9783642051500
    • Editors:
    • Robert Meersman,
    • Tharam Dillon,
    • Pilar Herrero

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 07 November 2009

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)On indexing evidential dataInternational Journal of Approximate Reasoning10.1016/j.ijar.2018.12.015106:C(63-87)Online publication date: 1-Mar-2019
    • (2018)Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corporaWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2011.11.00210(76-110)Online publication date: 20-Dec-2018
    • (2011)Distributed human computation framework for linked data co-reference resolutionProceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I10.5555/2008892.2008896(32-46)Online publication date: 29-May-2011

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media