Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/981658.981684dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Unsupervised word sense disambiguation rivaling supervised methods

Published: 26 June 1995 Publication History
  • Get Citation Alerts
  • Abstract

    This paper presents an unsupervised learning algorithm for sense disambiguation that, when trained on unannotated English text, rivals the performance of supervised techniques that require time-consuming hand annotations. The algorithm is based on two powerful constraints---that words tend to have one sense per discourse and one sense per collocation---exploited in an iterative bootstrapping procedure. Tested accuracy exceeds 96%.

    References

    [1]
    Baum, L. E., "An Inequality and Associated Maximization Technique in Statistical Estimation of Probabilistic Functions of a Markov Process," Inequalities, v 3, pp 1--8, 1972.
    [2]
    Black, Ezra, "An Experiment in Computational Discrimination of English Word Senses," in IBM Journal of Research and Development, v 232, pp 185--194, 1988.
    [3]
    Brill, Eric, "A Corpus-Based Approach to Language Learning," Ph.D. Thesis, University of Pennsylvania, 1993.
    [4]
    Brown, Peter, Stephen Della Pietra, Vincent Della Pietra, and Robert Mercer, "Word Sense Disambiguation using Statistical Methods," Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, pp 264--270, 1991.
    [5]
    Bruce, Rebecca and Janyce Wiebe, "Word-Sense Disambiguation Using Decomposable Models," in Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM, 1994.
    [6]
    Church, K. W., "A Stochastic Parts Program an Noun Phrase Parser for Unrestricted Text," in Proceeding, IEEE International Conference on Acoustics, Speech and Signal Processing, Glasgow, 1989.
    [7]
    Dagan, Ido and Alon Itai, "Word Sense Disambiguation Using a Second Language Monolingual Corpus", Computational Linguistics, v 20, pp 563--596, 1994.
    [8]
    Dempster, A. P., Laird, N. M., and Rubin, D. B., "Maximum Likelihood From Incomplete Data via the EM Algorithm," Journal of the Royal Statistical Society, v 39, pp 1--38, 1977.
    [9]
    Gale, W., K. Church, and D. Yarowsky, "A Method for Disambiguating Word Senses in a Large Corpus," Computers and the Humanities, 26, pp 415--439, 1992.
    [10]
    Gale, W., K. Church, and D. Yarowsky. "Discrimination Decisions for 100,000-Dimensional Spaces." In A. Zampoli, N. Calzolari and M. Palmer (eds.), Current Issues in Computational Linguistics: In Honour of Don Walker, Kluwer Academic Publishers, pp. 429--450, 1994.
    [11]
    Guthrie, J., L. Guthrie, Y. Wilks and H. Aidinejad, "Subject Dependent Co-occurrence and Word Sense Disambiguation," in Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, pp 146--152, 1991.
    [12]
    Hearst, Marti, "Noun Homograph Disambiguation Using Local Context in Large Text Corpora," in Using Corpora, University of Waterloo, Ontario, 1991.
    [13]
    Leacock, Claudia, Geoffrey Towell and Ellen Voorhees "Corpus-Based Statistical Sense Resolution," in Proceedings, ARPA Human Language Technology Workshop, 1993.
    [14]
    Lehman, Jill Fain, "Toward the Essential Nature of Statistical Knowledge in Sense Resolution", in Proceedings of the Twelfth National Conference on Artificial Intelligence, pp 734--471, 1994.
    [15]
    Lesk, Michael, "Automatic Sense Disambiguation: How to tell a Pine Cone from an Ice Cream Cone," Proceeding of the 1986 SIGDOC Conference, Association for Computing Machinery, New York, 1986.
    [16]
    Miller, George, "WordNet: An On-Line Lexical Database," International Journal of Lexicography, 3, 4, 1990.
    [17]
    Mosteller, Frederick, and David Wallace, Inference and Disputed Authorship: The Federalist, Addison-Wesley, Reading, Massachusetts, 1964.
    [18]
    Rivest, R. L., "Learning Decision Lists," in Machine Learning, 2, pp 229--246, 1987.
    [19]
    Schütze, Hinrich, "Dimensions of Meaning," in Proceedings of Supercomputing '92, 1992.
    [20]
    Slator, Brian, "Using Context for Sense Preference," in Text-Based Intelligent Systems: Current Research in Text Analysis, Information Extraction and Retrieval, P. S. Jacobs, ed., GE Research and Development Center, Schenectady, New York, 1990.
    [21]
    Veronis, Jean and Nancy Ide, "Word Sense Disambiguation with Very Large Neural Networks Extracted from Machine Readable Dictionaries," in Proceedings, COLING-90, pp 389--394, 1990.
    [22]
    Yarowsky, David "Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora," in Proceedings, COLING-92, Nantes, France, 1992.
    [23]
    Yarowsky, David, "One Sense Per Collocation," in Proceedings, ARPA Human Language Technology Workshop, Princeton, 1993.
    [24]
    Yarowsky, David, "Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French," in Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM, 1994.
    [25]
    Yarowsky, David. "Homograph Disambiguation in Speech Synthesis." In J. Hirschberg, R. Sproat and J. van Santen (eds.), Progress in Speech Synthesis, Springer-Verlag, to appear.

    Cited By

    View all
    • (2024)Reducing the Impact of Time Evolution on Source Code Authorship Attribution via Domain AdaptationACM Transactions on Software Engineering and Methodology10.1145/365215133:6(1-27)Online publication date: 27-Jun-2024
    • (2023)ToolformerProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669119(68539-68551)Online publication date: 10-Dec-2023
    • (2023)A unified approach to count-based weakly-supervised learningProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667802(38709-38722)Online publication date: 10-Dec-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image DL Hosted proceedings
    ACL '95: Proceedings of the 33rd annual meeting on Association for Computational Linguistics
    June 1995
    354 pages

    Publisher

    Association for Computational Linguistics

    United States

    Publication History

    Published: 26 June 1995

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate 85 of 443 submissions, 19%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)504
    • Downloads (Last 6 weeks)61

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Reducing the Impact of Time Evolution on Source Code Authorship Attribution via Domain AdaptationACM Transactions on Software Engineering and Methodology10.1145/365215133:6(1-27)Online publication date: 27-Jun-2024
    • (2023)ToolformerProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669119(68539-68551)Online publication date: 10-Dec-2023
    • (2023)A unified approach to count-based weakly-supervised learningProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667802(38709-38722)Online publication date: 10-Dec-2023
    • (2023)Can semi-supervised learning use all the data effectively? a lower bound perspectiveProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667085(21960-21982)Online publication date: 10-Dec-2023
    • (2023)Don't stop pretraining? make prompt-based fine-tuning powerful learnerProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666378(5827-5849)Online publication date: 10-Dec-2023
    • (2023)Bidirectional adaptation for robust semi-supervised learning with inconsistent data distributionsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619015(14886-14901)Online publication date: 23-Jul-2023
    • (2023)KESTProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/561(5049-5057)Online publication date: 19-Aug-2023
    • (2023)Enabling abductive learning to exploit knowledge graphProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/427(3839-3847)Online publication date: 19-Aug-2023
    • (2023)Weakly supervised 3D segmentation via receptive-driven pseudo label consistency and structural consistencyProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i1.25205(1222-1230)Online publication date: 7-Feb-2023
    • (2023)Semi-Supervised Classification of Malware Families Under Extreme Class Imbalance via Hierarchical Non-Negative Matrix Factorization with Automatic Model SelectionACM Transactions on Privacy and Security10.1145/362456726:4(1-27)Online publication date: 13-Nov-2023
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media