Sense Embedding Learning for Word Sense Induction

Song, Linfeng; Wang, Zhiguo; Mi, Haitao; Gildea, Daniel

Computer Science > Computation and Language

arXiv:1606.05409 (cs)

[Submitted on 17 Jun 2016 (v1), last revised 22 Jun 2016 (this version, v2)]

Title:Sense Embedding Learning for Word Sense Induction

Authors:Linfeng Song, Zhiguo Wang, Haitao Mi, Daniel Gildea

View PDF

Abstract:Conventional word sense induction (WSI) methods usually represent each instance with discrete linguistic features or cooccurrence features, and train a model for each polysemous word individually. In this work, we propose to learn sense embeddings for the WSI task. In the training stage, our method induces several sense centroids (embedding) for each polysemous word. In the testing stage, our method represents each instance as a contextual vector, and induces its sense by finding the nearest sense centroid in the embedding space. The advantages of our method are (1) distributed sense vectors are taken as the knowledge representations which are trained discriminatively, and usually have better performance than traditional count-based distributional models, and (2) a general model for the whole vocabulary is jointly trained to induce sense centroids under the mutlitask learning framework. Evaluated on SemEval-2010 WSI dataset, our method outperforms all participants and most of the recent state-of-the-art methods. We further verify the two advantages by comparing with carefully designed baselines.

Comments:	6 pages, no figures in *SEM 2016
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1606.05409 [cs.CL]
	(or arXiv:1606.05409v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1606.05409

Submission history

From: Linfeng Song [view email]
[v1] Fri, 17 Jun 2016 02:49:52 UTC (247 KB)
[v2] Wed, 22 Jun 2016 04:59:08 UTC (23 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2016-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Linfeng Song
Zhiguo Wang
Haitao Mi
Daniel Gildea

export BibTeX citation

Computer Science > Computation and Language

Title:Sense Embedding Learning for Word Sense Induction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Sense Embedding Learning for Word Sense Induction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators