Abstract
Cohesion between components of collocations is already acknowledged measurable by means of the Web, and cohesion measurements are used for some applications and extraction of new collocations. Taking a specific cohesion criterion SCI, we performed massive evaluations of collocate cohesion in Oxford Collocations Dictionary. For three groups of modificative collocations (adjective—noun, adverb—adjective, and adverb— verb) SCI distributions proved to be one-peaked and compact, with rather close mean values and standard deviations. Thus we suggest a reliable numeric criterion for extraction of collocations from the Web.
Work done under partial support of Mexican Government (CONACyT, SNI) and CGEPI-IPN, Mexico.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bolshakov, I.A., Bolshakova, E.I.: Measurements of lexico-syntactic cohesion by means of internet. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds.) MICAI 2005. LNCS (LNAI), vol. 3789, pp. 790–799. Springer, Heidelberg (2005)
Keller, F., Lapata, M.: Using the Web to Obtain Frequencies for Unseen Bigram. Computational linguistics 29(3), 459–484 (2003)
Mitkov, R. (ed.): The Oxford Handbook of Computational Linguistics. OU Press (2003)
Mel’čuk, I.: Phrasemes in Language and Phraseology in Linguistics. In: Everaert, M., et al. (eds.) Idioms: Structural and Psychological Perspectives, pp. 169–252. Lawrence Erlbaum Associates Publ, Hillsdale (1995)
Oxford Collocations Dictionary for Students of English. OU Press (2003)
Seretan, V., Nerima, L., Wehrli, E.: Using the Web as a Corpus for the Syntactic-Based Collocation Identification. In: Proc. Int. Conf. Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 1871–1874 (2004)
Smadja, F.: Retreiving Collocations from text: Xtract. Computational Linguistics 19(1), 143–177 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bolshakov, I.A., Galicia-Haro, S.N. (2006). Web-Based Measurements of Intra-collocational Cohesion in Oxford Collocations Dictionary. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_10
Download citation
DOI: https://doi.org/10.1007/11671299_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32205-4
Online ISBN: 978-3-540-32206-1
eBook Packages: Computer ScienceComputer Science (R0)