Character-Aware Neural Language Models

Kim, Yoon; Jernite, Yacine; Sontag, David; Rush, Alexander M.

Computer Science > Computation and Language

arXiv:1508.06615v2 (cs)

[Submitted on 26 Aug 2015 (v1), revised 17 Sep 2015 (this version, v2), latest version 1 Dec 2015 (v4)]

Title:Character-Aware Neural Language Models

Authors:Yoon Kim, Yacine Jernite, David Sontag, Alexander M. Rush

View PDF

Abstract:We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a highway network over characters, whose output is given to a long short-term memory (LSTM) recurrent neural network language model (RNN-LM). On the English Penn Treebank the model is on par with the existing state-of-the-art despite having 60% fewer parameters. On languages with rich morphology (Czech, German, French, Spanish, Russian), the model consistently outperforms a Kneser-Ney baseline and word-level/morpheme-level LSTM baselines, again with far fewer parameters. Our results suggest that on many languages, character inputs are sufficient for language modeling.

Subjects:	Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1508.06615 [cs.CL]
	(or arXiv:1508.06615v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1508.06615

Submission history

From: Yoon Kim [view email]
[v1] Wed, 26 Aug 2015 19:25:34 UTC (211 KB)
[v2] Thu, 17 Sep 2015 23:18:00 UTC (209 KB)
[v3] Fri, 16 Oct 2015 03:18:13 UTC (209 KB)
[v4] Tue, 1 Dec 2015 22:59:24 UTC (209 KB)

Computer Science > Computation and Language

Title:Character-Aware Neural Language Models

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Character-Aware Neural Language Models

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators