Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/980691.980736dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Machine aided error-correction environment for Korean morphological analysis and part-of-speech tagging

Published: 10 August 1998 Publication History

Abstract

Statistical methods require very large corpus with high quality. But building large and faultless annotated corpus is a very difficult job. This paper proposes an efficient method to construct part-of-speech tagged corpus. A rule-based error correction method is proposed to find and correct errors semi-automatically by user-defined rules. We also make use of user's correction log to reflect feedback. Experiments were carried out to show the efficiency of error correction process of this workbench. The result shows that about 63.2 % of tagging errors can be corrected.

References

[1]
E. Brill. 1993. "A Corpus-Based Approach to Language Learning". Ph.D. Thesis, Dept. of Computer and Information Science, University of Pennsylvania.
[2]
K. Choi, Y. Han, and O. Kwon. 1994. "KAIST Tree Bank Project for Korean: Present and Future Development". SNLP, Proceedings of International Workshop on Sharable Natural Language Resources, pages 7--14.
[3]
G. F. Foster. 1991. "Statistical Lexical Disambiguation". M. S. Thesis, McGill University, School of Computer Science.
[4]
G. Lee and J. Lee. 1996. "Rule-based error correction for statistical part-of-speech tagging". Korea-China Joint Symposium on Oriental Language Computing, pages 125--131.
[5]
H. Lim, J. Kim, and H. Rim. 1996. "A Korean Transformation-based Part-of-Speech Tagger with Lexical information of mistagged Eojeol". Korea-China Joint Symposium on Oriental Language Computing, pages 119--124.
[6]
J. Shin, Y. Han, Y. Park, and K. Choi. 1995. "A HMM Part-of-Speech Tagger for Korean with wordphrasal Relations". In Proceedings of Recent Advances in Natural Language Processing.
  1. Machine aided error-correction environment for Korean morphological analysis and part-of-speech tagging

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ACL '98/COLING '98: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 2
      August 1998
      768 pages

      Sponsors

      • Government of Canada
      • Université de Montréal

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 10 August 1998

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 85 of 443 submissions, 19%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 255
        Total Downloads
      • Downloads (Last 12 months)31
      • Downloads (Last 6 weeks)10
      Reflects downloads up to 12 Sep 2024

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media