Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2660517.2660534acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfseConference Proceedingsconference-collections
research-article

Toward matching the relation instantiation from DBpedia ontology to Wikipedia text: fusing FrameNet to Korean

Published: 04 September 2014 Publication History
  • Get Citation Alerts
  • Abstract

    Nowadays, there are many ongoing researches to construct knowledge bases from unstructured data. This process requires an ontology that includes enough properties to cover the various attributes of knowledge elements. As a huge encyclopedia, Wikipedia is a typical unstructured corpora of knowledge. DBpedia, a structured knowledge base constructed from Wikipedia, is based on DBpedia ontology which was created to represent knowledge in Wikipedia well. However, DBpedia ontology is a Wikipedia-Infobox-driven ontology. This means that although it is suitable to represent essential knowledge of Wikipedia, it does not cover all of the knowledge in Wikipedia text. In overcoming this problem, resources representing semantics or relations of words such as WordNet and FrameNet are considered useful. In this paper we determined whether DBpedia ontology is enough to cover a sufficient amount of natural language written knowledge in Wikipedia. We mainly focused on the Korean Wikipedia, and calculated the Korean Wikipedia coverage rate with two methods, by the DBpedia ontology and by FrameNet frames. To do this, we extracted sentences with extractable knowledge from Wikipedia text, and also extracted natural language predicates by Part-Of-Speech tagging. We generated Korean lexicons for DBpedia ontology properties and frame indexes, and used these lexicons to measure the Korean Wikipedia coverage ratio of the DBpedia ontology and frames. By our measurements, FrameNet frames cover 73.85% of the Korean Wikipedia sentences, which is a sufficient portion of Wikipedia text. We finally show the limitations of DBpedia and FrameNet briefly, and propose the outlook of constructing knowledge bases based on the experiment results.

    References

    [1]
    Blumberg, Robert, and Shaku Atre. 2003. The problem with unstructured data. DM REVIEW 13: 42--49
    [2]
    Gaag, Andreas, Andreas Kohn, and Udo Lindemann. 2009. Function-based solution retrieval and semantic search in mechanical engineering. In Proceedings of the 17th International Conference on Engineering Design.
    [3]
    Heath, Tom, and Christian Bizer. 2011. Linked data: Evolving the web into a global data space. Syn-thesis lectures on the semantic web: theory and technology 1.1: 1--136
    [4]
    Klyne, Graham, and Jeremy J. Carroll. 2006. Resource description framework (RDF): Concepts and abstract syntax.
    [5]
    C. Bizer, J. Lehmann, G. Kobilarov, S. Auer, C. Becker, R. Cyganiak, and S. Hellmann. 2009. DBpedia -A crystallization point for the Web of Data. Web Semantics: Science, Services and Agents on the World Wide Web, 7:154--165.
    [6]
    Gerber, Daniel, and A-C. Ngonga Ngomo. 2011. Bootstrapping the linked data web. 1st Workshop on Web Scale Knowledge Extraction@ ISWC.
    [7]
    Wolf, Florian, and Edward Gibson. 2005. Representing discourse coherence: A corpus-based study. Computational Linguistics 31.2: 249--287.
    [8]
    Redeker, Gisela. 2000. Coherence and structure in text and discourse. Abduction, Belief and Context in Dialogue: 233--263.
    [9]
    Taboada, Maite, and William C. Mann. 2006. Rhetorical structure theory: Looking back and moving ahead. Discourse studies 8.3: 423--459.
    [10]
    Chow, Ian C., and Jonathan J. Webster. 2007. Integration of linguistic resources for verb classification: FrameNet Frame, WordNet verb and suggested upper merged ontology. Computational Linguistics and Intelligent Text Processing. Springer Berlin Heidelberg, 2007. 1--11.
    [11]
    Laparra, Egoitz, and German Rigau. 2009. Integrating wordnet and framenet using a knowledge-based word sense disambiguation algorithm. In Proceedings of RANLP. Vol. 9.
    [12]
    Shi, Lei, and Rada Mihalcea. 2005. Putting pieces together: Combining FrameNet, VerbNet and WordNet for robust semantic parsing. Computational Linguistics and Intelligent Text Processing. Springer Berlin Heidelberg, 100--111.
    [13]
    Hartmann, Silvana, Iryna Gurevych, and Ubiquitous Knowledge Processing Lap. 2013. FrameNet on the Way to Babel: Creating a Bilingual FrameNet Using Wiktionary as Interlingual Connection. In Proceedings of the 51th Annual Meeting of the Association for Computational Linguistics, to appear. Sofia, Bulgaria.
    [14]
    Ovchinnikova, Ekaterina. 2012. Integration of world knowledge for natural language understanding. Vol. 3. Springer
    [15]
    Chiarcos, Christian. 2008. An ontology of linguistic annotations. LDV Forum. Vol. 23. No. 1.
    [16]
    Hellmann, Sebastian, et al. 2013. Integrating nlp using linked data." The Semantic Web--ISWC 2013. Springer Berlin Heidelberg. 98--113.
    [17]
    Hennett, H., and C. Fellbaum. 2006. Linking FrameNet to the suggested upper merged ontology. Formal Ontology in Information Systems: Proceedings of the Fourth International Conference. Vol. 150.
    [18]
    Mohit, Behrang, and Srini Narayanan. 2003. Semantic extraction with wide-coverage lexical resources. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers-Volume 2. Association for Computational Linguistics.
    [19]
    Orfan, Jansen, Rochester Edu, and James Allen. 2013. Toward Learning High-Level Semantic Frames from Definitions. In Proceedings of the Second Annual Conference on Advances in Cognitive Systems ACS. Vol. 125.
    [20]
    Christiana Unger, John McCrae, Sebastian Walter, Sara Winter, and Philipp Cimiano. 2013. A lemon lexicon for DBpedia. Proceedings of 1st International Workshop on NLP and DBpedia, co-located with the 12th International Semantic Web Conference (ISWC 2013), October 21--25, Sydney, Australia.
    [21]
    Sebastian Walter, Christina Unger, and Philipp Cimiano. 2013. A corpus-based approach for the induction of ontology lexica. Natural Language Processing and Information Systems. Springer Berlin Heidelberg. 102--113.
    [22]
    Declerck T., Buitelaar P., Wunner, T., McCrae, J., Montiel-Ponsoda, E., de Cea, A. G. 2010. lemon: An Ontology-Lexicon model for the Multilingual Semantic Web. W3C Workshop: The Multilingual Web - Where Are We?

    Cited By

    View all
    • (2019)FinnTransFrameLanguage Resources and Evaluation10.1007/s10579-018-9434-y53:1(141-171)Online publication date: 1-Mar-2019
    • (2017)FinnFN 1.0: The Finnish frame semantic databaseNordic Journal of Linguistics10.1017/S033258651700007540:3(287-311)Online publication date: 14-Aug-2017

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SEM '14: Proceedings of the 10th International Conference on Semantic Systems
    September 2014
    161 pages
    ISBN:9781450329279
    DOI:10.1145/2660517
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • St. Pölten University: St. Pölten University of Applied Sciences, Austria
    • University of Potsdam: University of Potsdam
    • PoolParty: PoolParty (Semantic Web Company GmbH)
    • University of Vienna: University of Vienna
    • Wolters Kluwer: Wolters Kluwer, Germany
    • Semantic Web Company: Semantic Web Company
    • STII: STI International
    • DBpedia Association: DBpedia Association

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 September 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. DBpedia
    2. FrameNet
    3. Wikipedia
    4. knowledge representation
    5. ontology

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SEM '14
    Sponsor:
    • St. Pölten University
    • University of Potsdam
    • PoolParty
    • University of Vienna
    • Wolters Kluwer
    • Semantic Web Company
    • STII
    • DBpedia Association

    Acceptance Rates

    SEM '14 Paper Acceptance Rate 22 of 59 submissions, 37%;
    Overall Acceptance Rate 22 of 59 submissions, 37%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)FinnTransFrameLanguage Resources and Evaluation10.1007/s10579-018-9434-y53:1(141-171)Online publication date: 1-Mar-2019
    • (2017)FinnFN 1.0: The Finnish frame semantic databaseNordic Journal of Linguistics10.1017/S033258651700007540:3(287-311)Online publication date: 14-Aug-2017

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media