Abstract
This paper presents a method for semantic classification of unknown verbs including polysemies into Levin-style semantic classes. We propose a semi-supervised clustering, which is based on a graph-based unsupervised clustering technique. The algorithm detects the spin configuration that minimizes the energy of the spin glass. Comparing global and local minima of an energy function, called the Hamiltonian, allows for the detection of nodes with more than one cluster. We extended the algorithm so as to employ a small amount of labeled data to aid unsupervised learning, and applied the algorithm to cluster verbs including polysemies. The distributional similarity between verbs used to calculate the Hamiltonian is in the form of probability distributions over verb frames. The result obtained using 110 test polysemous verbs with labeled data of 10% showed 0.577 F-score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning Distance Functions using Equivalence Relations. In: Proc. of the 20th International Conference on Machine Learning, pp. 11–18 (2003)
Bilenko, M., Basu, S., Mooney, R.J.: Integrating Constraints and Metric Learning in Semi-Supervised Clustering. In: Proc. of the 21st International Conference on Machine Learning, pp. 81–88 (2004)
Bouraev, B., Briscoe, E.J., Carroll, J., Carter, D., Grover, C.: The Derivation of a Grammatically-Indexed Lexicon from the Longman Dictionary of Contemporary English. In: Proc. of the 25th Annual Meeting of the Association for Computational Linguistics, pp. 193–200 (1987)
Brew, C., Walde, S.S.: Spectral Clustering for German Verbs. In: Proc. of 2002 Conference on Empirical Methods in Natural Language Processing, pp. 117–123 (2002)
Briscoe, E.J., Carroll, J.: Automatic Extraction of Subcategorization from Corpora. In: Proc. of 5th ACL Conference on Applied Natural Language Processing, pp. 356-363 (1997)
Briscoe, E.J., Carroll, J.: Robust Accurate Statistical Annotaion of General Text. In: Proc. of 3rd International Conference on Language Resources and Evaluation, pp. 1499-1504 (2002)
Chen, K.J., Chen, C.J.: Automatic Semantic Classification for Chinese Unknown Compound Nouns. In: Proc. of 38th Annual Meeting of the Association for Computational Linguistics, pp. 125–130 (2000)
Dagan, I., Lee, L., Pereira, F.C.N.: Similarity-based Models of Word Cooccurrence Probabilities. Machine Learning 34(1-3), 43–69 (1999)
Dorr, B.: Large-scale Dictionary Construction for Foreign Language Tutoring and Interlingual Machine Translation. Machine Translation 128(4), 271–325 (1997)
Galley, M., McKeown, K.: Improving Word Sense Disambiguation in Lexical Chaining. In: Proc. of 19th International Joint Conference on Artificial Intelligence, pp. 1486–1488 (2003)
Grishman, R., Macleod, C., Meyers, A.: Complex Syntax: Building a Computational Lexicon. In: Proc. of International Conference on Computational Linguistics, pp. 268–272 (1994)
Hindle, D.: Noun Classification from Predicate-Argument Structures. In: Proc. of 28th Annual Meeting of the Association for Computational Linguistics, pp. 268–275 (1990)
Hughes, J.: Automatically Acquiring Classification of Words, Ph.D. thesis University of Leeds (1994)
Jannink, J., Wiederhold, G.: Thesaurus Entry Extraction from an Online Dictionary. In: Proc. of Fusion 1999 (1999)
Kermanidis, K., Maragoudakis, M., Fakotakis, N., Kokkinakis, G.K.: Natural Language Engineering. Learning Verb Complements for Modern Greek: Balancing the Noisy Dataset 14(1), 71–100 (2008)
Kudo, T., Matsumoto, Y.: Fast Methods for Kernel-based Text Analysis. In: Proc. of 41st Annual Meeting of the Association for Computational Linguistics, pp. 24–31 (2003)
Kirkpatrick, S., Gelatt Jr., C.D., Vecchi, M.P.: Optimization by Simulated Annealing. Science 220(4598), 671–680 (1983)
Korhonen, A.: Subcategorization Acquisition, Ph.D. thesis University of Cambridge (2002)
Korhonen, A., Krymolowski, Y., Marx, Z.: Clustering Polysemic Subcategorization Frame Distributions Semantically. In: Proc. of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 64–71 (2003)
Korhonen, A., Krymolowski, Y., Briscoe, T.: A Large Subcategorization Lexicon for Natural Language Processing Applications. In: Proc. of the 5th International Conference on Language Resources and Evaluation (2006)
Lee, L.: Measures of Distributional Similarity. In: Proc. of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 25–32 (1999)
Leech, G.: 100 Million Words of English: The British National Corpus. Language Research 28(1), 1–13 (1992)
Levin, B.: English Verb Classes and Alternations. Chicago University Press (1993)
Lin, D.: Automatic Retrieval and Clustering of Similar Words. In: Proc. of 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, pp. 768–773 (1998)
Matsuo, Y., Sakaki, T., Uchiyama, K., Ishizuka, M.: Graph-based Word Clustering using a Web Search Engine. In: Proc. of 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), pp. 542–550 (2006)
Mihalcea, R.: Unsupervised Large Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling. In: Proc. of the Human Language Technology/Empirical Methods in Natural Language Processing Conference, pp. 411–418 (2005)
Muller, P., Hathout, N., Gaume, B.: Synonym Extraction Using a Semantic Distance on a Dictionary. In: Proc. of the Workshop on TextGraphs, pp. 65–72 (2006)
Navigli, R.: A Structural Approach to the Automatic Adjudication of Word Sense Disagreements. Natural Language Engineering 14(4), 547–573 (2008)
Navigli, R.: Word Sense Disambiguation: A Survey. ACM Computing Surveys 41(2), 1–69 (2009)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On Spectral Clustering: Analysis and an Algorithm. In: Advances in Neural Information Processing Systems 14. MIT Press (2002)
Pereira, F., Tishby, N., Lee, L.: Distributional Clustering of English Words. In: Proc. of the 31st Annual Meeting of the Association for Computational Linguistics, pp. 183–190 (1993)
Reichardt, J., Bornholdt, S.: Detecting Fuzzy Community Structure in Complex Networks with a Potts Model. Physical Review Letters 93(21) (2004)
Reichardt, J., Bornholdt, S.: Statistical Mechanics of Community Detection. Physical Review E 74 (2006)
Reiter, E., Dale, R.: Building Natural Language Generation Systems. Cambridge University Press (2000)
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Proc. of 14th International Joint Conference on Artificial Intelligence, pp. 448–453 (1995)
Rooth, M.: Two-Dimensional Clusters in Grammatical Relations. In: Inducing Lexicons with the EM Algorithm, AIMS Report 4(3) (1998)
Rooth, M., Riezler, S., Prescher, D., Carroll, G., Beil, F.: Inducing a Semantically Annotated Lexicon via EM-Based Clustering. In: Proc. of the 37th Annual Meeting of the Association for Computational Linguistics (1999)
Schulte im Walde, S.: Clustering Verbs Semantically according to their Alternation Behaviour. In: Proc. of the 18th International Conference on Computational Linguistics, pp. 747–753 (2000)
Schulte im Walde, S., Brew, C.: Inducing German Semantic Verb Classes from Purely Syntactic Subcategorisation Information. In: Proc. of the 40th Annual Meeting of the Association for Computational Linguistics (2002)
Schulte im Walde, S.: Experiments on the Automatic Induction of German Semantic Verb Classes. Computational Linguistics 32(2), 159–194 (2006)
Schulte im Walde, S., Hying, C., Scheible, C., Schmid, H.: Combining EM Training and the MDL Principle for an Automatic Verb Classification Incorporating Selectional Preferences. In: Proc. of the 46th Annual Meeting of the Association for Computational Linguistics, pp. 496–504 (2008)
Sinha, R., Mihalcea, R.: Unsupervised Graph-based Word Sense Disambiguation Using Measures of Word Semantic Similarity. In: Proc. of the IEEE International Conference on Semantic Computing, pp. 46–54 (2007)
Stevenson, S., Joanis, E.: Semi-Supervised Verb-Class Discovery using Noisy Features. In: Proc. of the 7th Conference on Natural Language Learning at HLT-NAACL 2003, pp. 71–78 (2003)
Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Constrained K-Means Clustering with Background Knowledge. In: Proc. of 18th International Conference on Machine Learning, pp. 577–584 (2001)
D. Widdows and B. Dorow, A Graph Model for Unsupervised Lexical Acquisition. In: Proc. of 19th International Conference on Computational Linguistics (COLING 2002), pp. 1093-1099, (2002)
Witten, I.H., Bell, T.C.: The Zero-Frequency Problem: Estimating the Probabilities of Novel Events in Adaptive Text Compression. IEEE Transactions on Information Theory 37(4), 1085–1094 (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fukumoto, F., Suzuki, Y. (2013). Graph-Based Semi-supervised Clustering for Semantic Classification of Unknown Words. In: Fred, A., Dietz, J.L.G., Liu, K., Filipe, J. (eds) Knowledge Discovery, Knowledge Engineering and Knowledge Management. IC3K 2011. Communications in Computer and Information Science, vol 348. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37186-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-37186-8_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37185-1
Online ISBN: 978-3-642-37186-8
eBook Packages: Computer ScienceComputer Science (R0)