Generating Sentiment Lexicons for German Twitter

Sidarenka, Uladzimir; Stede, Manfred

Computer Science > Computation and Language

arXiv:1610.09995 (cs)

[Submitted on 31 Oct 2016]

Title:Generating Sentiment Lexicons for German Twitter

Authors:Uladzimir Sidarenka, Manfred Stede

View PDF

Abstract:Despite a substantial progress made in developing new sentiment lexicon generation (SLG) methods for English, the task of transferring these approaches to other languages and domains in a sound way still remains open. In this paper, we contribute to the solution of this problem by systematically comparing semi-automatic translations of common English polarity lists with the results of the original automatic SLG algorithms, which were applied directly to German data. We evaluate these lexicons on a corpus of 7,992 manually annotated tweets. In addition to that, we also collate the results of dictionary- and corpus-based SLG methods in order to find out which of these paradigms is better suited for the inherently noisy domain of social media. Our experiments show that semi-automatic translations notably outperform automatic systems (reaching a macro-averaged F1-score of 0.589), and that dictionary-based techniques produce much better polarity lists as compared to corpus-based approaches (whose best F1-scores run up to 0.479 and 0.419 respectively) even for the non-standard Twitter genre.

Comments:	This paper is the first in a planned series of articles on an automatic generation of sentiment lexicons for non-English Twitter. It will be presented as a poster at the PEOPLES workshop (this https URL)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1610.09995 [cs.CL]
	(or arXiv:1610.09995v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1610.09995

Submission history

From: Wladimir Sidorenko [view email]
[v1] Mon, 31 Oct 2016 16:12:16 UTC (311 KB)

Computer Science > Computation and Language

Title:Generating Sentiment Lexicons for German Twitter

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Generating Sentiment Lexicons for German Twitter

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators