Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Research toward the development of a lexical knowledge base for natural language processing

Published: 01 May 1989 Publication History

Abstract

This paper documents research toward building a complete lexicon containing all the words found in general newspaper text. It is intended to provide the reader with an understanding of the inherent limitations of existing vocabulary collection methods and the need for greater attention to multi-word phrases as the building blocks of text. Additionally, while traditional reference books define many proper nouns, they appear to be very limited in their coverage of the new proper nouns appearing daily in newspapers. Proper nouns appear to require a grammar and lexicon of components much the way general parsing of text requires syntactic rules and a lexicon of common nouns.

References

[1]
Amsler, Robert A. The Structure of the Merriam- Webster Pocket Dictionary. PhD thesis, The University of Texas at Austin, 1980.
[2]
Amsler, Robert A. Words and Worlds. In Proceeings of the Third Workshop on Theoretical Issues in Natural Language Processing (TINLAP3). New Mexico State University at Las Cruces, NM, January 7-9, 1987.
[3]
Amsler, Robert A.; White, John S. Development of a Computational Methodology for Deriving Natural Language Semantic Structures via Analysis of Machine-Readable Dictionaries. Technical Report, Linguistics Research Center, University of Texas at Austin, Austin, TX 78712, July, 1979. Final Report on NSF Project MCS77-01315.
[4]
Botha, Rudolph P. Morphological Mechanisms: Lexicalist Analysis of Synthetic Compounding. Pergammon Press, Oxford, England, 1984.
[5]
Bresnan, J. The Mental Representation of Grammatical Relations. MIT Press, Cambridge, MA, 1983.
[6]
Carroll, John M. What's in a Name. W.H. Freeman, New York, 1985.
[7]
Michiels, Archbal. Exploiting a Large Dictionary Data Base. PhD thesis, University of Liege, 1981.
[8]
Peterson, James L. Webster's Seventh New Collegiate Dictionary: A Computer- Readable File Format. Technical Report TR-196, Dept. of Computer Sciences, Univ. of Texas at Austin, Austin, TX 78712, May, 1982.
[9]
Peterson, James L. Webster's Seventh New Collegiate Dictionary: A Computer- Readable File Format. Technical Report, Microelectronics and Computer Technology Corporation, Austin, TX, 1987.
[10]
Quillian, Ross. Semantic Memory. Semantic Information Processing. MIT Press, Cambridge, MA, 1968, pages 227-270.
[11]
Reichert, Richard, Olney, John and Paris, James. Two Dictionary Transcripts and Programs for Processing Them. Volume !: The Encoding Scheme, PARSENT and CONIX. Technical Report TM-3978/001/00, System Development Corporation, Santa Monica, CA, 1969.
[12]
Robins, Gabriel. The NIKL Manual The Knowledge Representation Project, Information Sciences Institute, 4646 Admiralty Way, Marina Del Rey, CA, 1986.
[13]
Shapiro, Stuart C. The SNePS Semantic Network Processing System. Associative Networks: Representation and Use of Knowledge by Computers. Academic Press, New York, NY, 1979, pages 179-203.
[14]
Sherman, Donald. A New Computer Format for "Webster's Seventh Collegiate Dictionary". Computers and the Humanities 8:21-26, 1974.
[15]
Simmons, Robert F. Semantic Networks: Their Computation and Use for Understanding English Sentences. Computer Models of Thought and Language. W.H. Freeman & Co., San Francisco, CA, 1973, pages 63-113.
[16]
Sowa, John. Conceptual Structures: Information Processing in Mind and Machine. Addison-Wesley, Reading, MA, 1984.
[17]
Walker, Donald E. and Robert A. Amsler. The Use of Machine-Readable Dictionaries in Sublanguage Analysis. Analyzing Language in Restricted Domains: Sublanguage Description and Processing. Lawrence Erlbaum Associates, Publishers, Hillsdale, N J, 1986, pages 69-83, Chapter 5.
[18]
Wittenburg, Kent. Natural Language Parsing with Combinatory Categorial Grammars in a Graph-Unification- Based Formalism. PhD thesis, The University of Texas at Austin, 1986.

Cited By

View all
  • (2017)Ensemble Learning of Named Entity Recognition Algorithms using Multilayer Perceptron for the Multilingual Web of DataProceedings of the 9th Knowledge Capture Conference10.1145/3148011.3154471(1-4)Online publication date: 4-Dec-2017
  • (2014)Ensemble Learning for Named Entity RecognitionThe Semantic Web – ISWC 201410.1007/978-3-319-11964-9_33(519-534)Online publication date: 19-Oct-2014
  • (2014)Introduction to Linked Data and Its Lifecycle on the WebReasoning Web. Reasoning on the Web in the Big Data Era10.1007/978-3-319-10587-1_1(1-99)Online publication date: 2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGIR Forum
ACM SIGIR Forum  Volume 23, Issue SI
Special issue: Proceedings of the 12th annual international ACMSIGIR conference on Research and development in information retrieval, N.J. Belkin and C.J. van Rijsbergen (Eds.), June 25-28, 1989, Cambridge, MA.
June 1989
243 pages
ISSN:0163-5840
DOI:10.1145/75335
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGIR '89: Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
    May 1989
    257 pages
    ISBN:0897913213
    DOI:10.1145/75334
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1989
Published in SIGIR Volume 23, Issue SI

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)64
  • Downloads (Last 6 weeks)13
Reflects downloads up to 28 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Ensemble Learning of Named Entity Recognition Algorithms using Multilayer Perceptron for the Multilingual Web of DataProceedings of the 9th Knowledge Capture Conference10.1145/3148011.3154471(1-4)Online publication date: 4-Dec-2017
  • (2014)Ensemble Learning for Named Entity RecognitionThe Semantic Web – ISWC 201410.1007/978-3-319-11964-9_33(519-534)Online publication date: 19-Oct-2014
  • (2014)Introduction to Linked Data and Its Lifecycle on the WebReasoning Web. Reasoning on the Web in the Big Data Era10.1007/978-3-319-10587-1_1(1-99)Online publication date: 2014
  • (2013)Introduction to linked data and its lifecycle on the webProceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access10.1007/978-3-642-39784-4_1(1-90)Online publication date: 30-Jul-2013
  • (2011)SCMSProceedings of the 10th international conference on The semantic web - Volume Part II10.5555/2063076.2063090(189-204)Online publication date: 23-Oct-2011
  • (2011)Introduction to linked data and its lifecycle on the webProceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data10.5555/2033313.2033314(1-75)Online publication date: 23-Aug-2011
  • (2011)SCMS – Semantifying Content Management SystemsThe Semantic Web – ISWC 201110.1007/978-3-642-25093-4_13(189-204)Online publication date: 2011
  • (2011)Introduction to Linked Data and Its Lifecycle on the WebReasoning Web. Semantic Technologies for the Web of Data10.1007/978-3-642-23032-5_1(1-75)Online publication date: 2011
  • (1992)The Analysis and Acquisition of Proper Names for the Understanding of Free TextComputers and the Humanities10.1007/BF0013698526:5-6(441-456)Online publication date: Dec-1992
  • (1991)Experiments on linguistically based term associationsIntelligent Text and Image Handling - Volume 210.5555/3171004.3171006(528-566)Online publication date: 2-Apr-1991
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media