Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Building a large annotated corpus of English: the penn treebank

Published: 01 June 1993 Publication History
First page of PDF

References

[1]
Brill, Eric (1991). "Discovering the lexical features of a language." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA.
[2]
Brill, Eric; Magerman, David; Marcus, Mitchell P.; and Santorini, Beatrice (1990). "Deducing linguistic structure from the statistics of large corpora." In Proceedings, DARPA Speech and Natural Language Workshop. June 1990, 275--282.
[3]
Church, Kenneth W. (1980). Memory limitations in natural language processing. Master's dissertation, Massachusetts Institute of Technology, Cambridge MA.
[4]
Church, Kenneth W. (1988). "A stochastic parts program and noun phrase parser for unrestricted text." In Proceedings, Second Conference on Applied Natural Language Processing. 136--143.
[5]
Francis, W. Nelson (1964). "A standard sample of present-day English for use with digital computers." Report to the U.S. Office of Education on Cooperative Research Project No. E-007. Brown University, Providence RI.
[6]
Francis, W. Nelson, and Kuĉera, Henry (1982). Frequency Analysis of English Usage: Lexicon and Grammar. Houghton Mifflin.
[7]
Garside, Roger; Leech, Geoffrey; and Sampson, Geoffrey (1987). The Computational Analysis of English: A Corpus-Based Approach. Longman.
[8]
Hindle, Donald (1983). "User manual for Fidditch." Technical memorandum 7590--142, Naval Research Laboratory.
[9]
Hindle, Donald (1989). "Acquiring disambiguation rules from text." In Proceedings, 27th Annual Meeting of the Association for Computational Linguistics.
[10]
Lewis, Bil; LaLiberte, Dan; and the GNU Manual Group (1990). The GNU Emacs Lisp reference manual. Free Software Foundation, Cambridge, MA.
[11]
Magerman, David, and Marcus, Mitchell P. (1990). "Parsing a natural language using mutual information statistics." In Proceedings of AAAI-90.
[12]
Meteer, Marie; Schwartz, Richard; and Weischedel, Ralph (1991). "Studies in part of speech labelling." In Proceedings, Fourth DARPA Speech and Natural Language Workshop. February 1991.
[13]
Niv, Michael (1991). "Syntactic disambiguation." In The Penn Review of Linguistics, 14, 120--126.
[14]
Pereira, Fernando, and Schabes, Yves (1992). "Inside-outside reestimation from partially bracketed corpora." In Proceedings, 30th Annual Meeting of the Association for Computational Linguistics.
[15]
Santorini, Beatrice (1990). "Part-of-speech tagging guidelines for the Penn Treebank Project." Technical report MS-CIS-90--47, Department of Computer and Information Science, University of Pennsylvania.
[16]
Santorini, Beatrice, and Marcinkiewicz, Mary Ann (1991). "Bracketing guidelines for the Penn Treebank Project." Unpublished manuscript, Department of Computer and Information Science, University of Pennsylvania.
[17]
Veilleux, N. M., and Ostendorf, Mari (1992). "Probabilistic parse scoring based on prosodic features." In Proceedings, Fifth DARPA Speech and Natural Language Workshop. February 1992.
[18]
Weischedel, Ralph; Ayuso, Damaris; Bobrow, R.; Boisen, Sean; Ingria, Robert; and Palmucci, Jeff (1991). "Partial parsing: a report of work in progress." In Proceedings, Fourth DARPA Speech and Natural Language Workshop. February 1991.

Cited By

View all
  • (2024)Automated Testing Linguistic Capabilities of NLP ModelsACM Transactions on Software Engineering and Methodology10.1145/367245533:7(1-33)Online publication date: 14-Jun-2024
  • (2024)A Catalog of Transformations to Remove Smells From Natural Language TestsProceedings of the 28th International Conference on Evaluation and Assessment in Software Engineering10.1145/3661167.3661225(7-16)Online publication date: 18-Jun-2024
  • (2024)Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint EvolutionProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654187(1100-1109)Online publication date: 14-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Computational Linguistics
Computational Linguistics  Volume 19, Issue 2
Special issue on using large corpora: II
June 1993
185 pages
ISSN:0891-2017
EISSN:1530-9312
Issue’s Table of Contents

Publisher

MIT Press

Cambridge, MA, United States

Publication History

Published: 01 June 1993
Published in COLI Volume 19, Issue 2

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)639
  • Downloads (Last 6 weeks)133
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Automated Testing Linguistic Capabilities of NLP ModelsACM Transactions on Software Engineering and Methodology10.1145/367245533:7(1-33)Online publication date: 14-Jun-2024
  • (2024)A Catalog of Transformations to Remove Smells From Natural Language TestsProceedings of the 28th International Conference on Evaluation and Assessment in Software Engineering10.1145/3661167.3661225(7-16)Online publication date: 18-Jun-2024
  • (2024)Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint EvolutionProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654187(1100-1109)Online publication date: 14-Jul-2024
  • (2024)Product Spam on YouTube: A Case StudyProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638303(358-363)Online publication date: 10-Mar-2024
  • (2024)Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud UnderstandingIEEE Transactions on Multimedia10.1109/TMM.2023.328459126(1626-1638)Online publication date: 1-Jan-2024
  • (2024)Computation and Communication Efficient Federated Learning With Adaptive Model PruningIEEE Transactions on Mobile Computing10.1109/TMC.2023.324779823:3(2003-2021)Online publication date: 1-Mar-2024
  • (2024)Early analysis of requirements using NLP and Petri-netsJournal of Systems and Software10.1016/j.jss.2023.111901208:COnline publication date: 1-Feb-2024
  • (2024)LEAF: A Less Expert Annotation Framework with Active LearningAdvances in Knowledge Discovery and Data Mining10.1007/978-981-97-2259-4_28(369-384)Online publication date: 7-May-2024
  • (2024)Semantics-Preserved Distortion for Personal Privacy Protection in Information ManagementArtificial Neural Networks and Machine Learning – ICANN 202410.1007/978-3-031-72344-5_26(386-401)Online publication date: 17-Sep-2024
  • (2023)Evaluating neuron interpretation methods of NLP modelsProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669428(75644-75668)Online publication date: 10-Dec-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media