Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey

Alaparthi, Shivaji; Mishra, Manit

Computer Science > Computation and Language

arXiv:2007.01127 (cs)

[Submitted on 2 Jul 2020]

Title:Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey

Authors:Shivaji Alaparthi (Data Scientist, CenturyLink, Bengaluru, India), Manit Mishra (Associate Professor, International Management Institute Bhubaneswar, India)

View PDF

Abstract:The purpose of the study is to investigate the relative effectiveness of four different sentiment analysis techniques: (1) unsupervised lexicon-based model using Sent WordNet; (2) traditional supervised machine learning model using logistic regression; (3) supervised deep learning model using Long Short-Term Memory (LSTM); and, (4) advanced supervised deep learning models using Bidirectional Encoder Representations from Transformers (BERT). We use publicly available labeled corpora of 50,000 movie reviews originally posted on internet movie database (IMDB) for analysis using Sent WordNet lexicon, logistic regression, LSTM, and BERT. The first three models were run on CPU based system whereas BERT was run on GPU based system. The sentiment classification performance was evaluated based on accuracy, precision, recall, and F1 score. The study puts forth two key insights: (1) relative efficacy of four highly advanced and widely used sentiment analysis techniques; (2) undisputed superiority of pre-trained advanced supervised deep learning BERT model in sentiment analysis from text data. This study provides professionals in analytics industry and academicians working on text analysis key insight regarding comparative classification performance evaluation of key sentiment analysis techniques, including the recently developed BERT. This is the first research endeavor to compare the advanced pre-trained supervised deep learning model of BERT vis-à-vis other sentiment analysis models of LSTM, logistic regression, and Sent WordNet.

Comments:	15 pages, 1 table
Subjects:	Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2007.01127 [cs.CL]
	(or arXiv:2007.01127v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2007.01127

Submission history

From: Manit Mishra [view email]
[v1] Thu, 2 Jul 2020 14:23:57 UTC (207 KB)

Computer Science > Computation and Language

Title:Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators