Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/988672.988734acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
Article

Incremental formalization of document annotations through ontology-based paraphrasing

Published: 17 May 2004 Publication History
  • Get Citation Alerts
  • Abstract

    For the manual semantic markup of documents to become wide-spread, usersmust be able to express annotations that conform to ontologies (orschemas) that have shared meaning. However, a typical user is unlikelyto be familiar with the details of the terms as defined by the ontology authors. In addition, the idea to be expressed may not fit perfectly within a pre-defined ontology. The ideal tool should help users find apartial formalization that closely follows the ontology where possiblebut deviates from the formal representation where needed. We describe animplemented approach to help users create semi-structured semantic annotations for a document according to an extensible OWL ontology. In our approach, users enter a short sentence in free text to describe allor part of a document, and the system presents a set of potential paraphrases of the sentence that are generated from valid expressions inthe ontology, from which the user chooses the closest match. We use a combination of off-the-shelf parsing tools and breadth-first search of expressions in the ontology to help users create valid annotations starting from free text. The user can also define new terms to augmentthe ontology, so the potential matches can improve over time.

    References

    [1]
    Amaya 03. http://www.w3.org/Amaya.
    [2]
    Blythe, J. Integrating expectations from different sources to help end users acquire procedural knowledge. Proceedings of IJCAI'01 (Seattle, WA, August 2001).
    [3]
    Chklovski, Y., Using Analogy to Acquire Commonsense Knowledge from Human Contributors, Ph.D. Thesis, MIT Artificial Intelligence Laboratory technical report AITR-2003-002, 2003.
    [4]
    Ciravegna, F., Dingli, A., Petrelli, D. and Wilks, Y., User-System Cooperation in Document Annotation based on Information Extraction. Proceedings of EKAW'02. (2002).
    [5]
    Dill, S., Eiron, N., Gibson, D., Gruhl, D., Guha, R., Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A., Tomlin, J., and Zien, J. SemTag and Seeker: Bootstrapping the Semantic Web via Automated Semantic Annotation, Proceedings of WWW12, Budapest, 2003.
    [6]
    Fellbaum, C., Ed. WordNet, an Electronic Lexical Database, MIT Press, 1998.
    [7]
    Gennari, J., Musen, M., Fergerson, R., Grosso, W., Crubezy, M., Eriksson, H., Noy, N., Tu, S. The Evolution of Protege: An Environment for Knowledge-Based Systems Development International Journal of Human-Computer Studies, 58(1), 2002.
    [8]
    Gil, Y. and Ratnakar, V. Trusting Information Sources One Citizen at a Time. Proceedings of ISWC'02. (2002a).
    [9]
    Gil, Y. and Ratnakar, V. TRELLIS: An interactive tool for capturing information analysis and decision making. Proceedings of EKAW'02. (2002b).
    [10]
    Handschuh, S., Staab, S. and Ciravegna, F., S-CREAM: Semi-automatic CREAtion of Metadata. Proceedings of EKAW'02. (2002).
    [11]
    Kahan, J. and Koivunen, M., Annotea: An Open RDF Infrastructure for Shared Web Annotations, Proceedings of WWW10, Hong Kong, 2001.
    [12]
    Klein, D. and Manning, C. Fast Exact Inference with a Factored Model for Natural Language Parsing. Advances in Neural Information Processing Systems 15 (NIPS 2002), 2002.
    [13]
    McGuinness, D., Fikes, R., Rice, J., Wilder, S. The Chimaera Ontology Environment, Proceedings of AAAI 2000.
    [14]
    Mihalcea, R. and Moldovan, D. eXtended WordNet: Progress Report, in Proceedings of NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA, 2001.
    [15]
    OWL 03. http: //www.w3.org/TR/owl-features/.
    [16]
    Porter, M., An algorithm for suffix stripping, Program, 14(3): 130--137.
    [17]
    Singh, P. and Barry, B. Collecting Commonsense Experiences, Proceedings of KCAP'03, 2003.
    [18]
    Sleator, D. and Termperley D., Parsing English with a link grammar, Proc. International Workshop on Parsing Technologies, 1993.
    [19]
    Stork, D. The Open Mind Initiative, IEEE Expert Systems and Their Applications, May/June 1999.
    [20]
    Vargas-Vera, M., Motta, E., Domingue, J, Lanzoni, M., Stutt, A. and Ciravegna, F. MnM: Ontology Driven Semi-automatic and Automatic Support for Semantic Markup, Proceedings of EKAW'02. (2002).
    [21]
    Wojcik, R. The Boeing Simplified English Checker, 2002, http://www.boeing.com/assocproducts/sechecker

    Cited By

    View all
    • (2018)RDF dataset profiling – a survey of features, methods, vocabularies and applicationsSemantic Web10.3233/SW-1802949:5(677-705)Online publication date: 1-Jan-2018
    • (2015)Human Tutorial Instruction in the RawACM Transactions on Interactive Intelligent Systems10.1145/25319205:1(1-29)Online publication date: 25-Mar-2015
    • (2014)Challenges in Bridging Social Semantics and Formal Semantics on the WebEnterprise Information Systems10.1007/978-3-319-09492-2_1(3-15)Online publication date: 25-Jul-2014
    • Show More Cited By

    Index Terms

    1. Incremental formalization of document annotations through ontology-based paraphrasing

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WWW '04: Proceedings of the 13th international conference on World Wide Web
      May 2004
      754 pages
      ISBN:158113844X
      DOI:10.1145/988672
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 May 2004

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. document annotation
      2. knowledge acquisition
      3. semantic markup

      Qualifiers

      • Article

      Conference

      WWW04
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)6
      • Downloads (Last 6 weeks)0
      Reflects downloads up to

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)RDF dataset profiling – a survey of features, methods, vocabularies and applicationsSemantic Web10.3233/SW-1802949:5(677-705)Online publication date: 1-Jan-2018
      • (2015)Human Tutorial Instruction in the RawACM Transactions on Interactive Intelligent Systems10.1145/25319205:1(1-29)Online publication date: 25-Mar-2015
      • (2014)Challenges in Bridging Social Semantics and Formal Semantics on the WebEnterprise Information Systems10.1007/978-3-319-09492-2_1(3-15)Online publication date: 25-Jul-2014
      • (2013)A review of argumentation for the Social Semantic WebSemantic Web10.5555/2590215.25902184:2(159-218)Online publication date: 1-Apr-2013
      • (2009)Learning to tagProceedings of the 18th international conference on World wide web10.1145/1526709.1526758(361-370)Online publication date: 20-Apr-2009
      • (2009)Semi-Automatic Annotation System for OWL-Based Semantic Search2009 International Conference on Complex, Intelligent and Software Intensive Systems10.1109/CISIS.2009.82(475-480)Online publication date: Mar-2009
      • (2008)Wishful searchProceedings of the 17th international conference on World Wide Web10.1145/1367497.1367602(775-784)Online publication date: 21-Apr-2008
      • (2007)AASAJournal of Information Science10.1177/016555150607216433:4(435-450)Online publication date: 1-Aug-2007
      • (2007)Ontology based annotation of text segmentsProceedings of the 2007 ACM symposium on Applied computing10.1145/1244002.1244296(1362-1367)Online publication date: 11-Mar-2007
      • (2007)Semantic Annotation of Resources in the Semantic WebSemantic Web Services10.1007/3-540-70894-4_5(135-155)Online publication date: 2007
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media