Google Scholar

[PDF][PDF] Do convolutional networks need to be deep for text classification?

HT Le, C Cerisara, A Denis - Workshops at the Thirty-Second AAAI …, 2018 - cdn.aaai.org

Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence, 2018•cdn.aaai.org

We study in this work the importance of depth in convolutional models for text classification,
either when character or word inputs are considered. We show on 5 standard text
classification and sentiment analysis tasks that deep models indeed give better
performances than shallow networks when the text input is represented as a sequence of
characters. However, a simple shallow-and-wide network outperforms deep models such as
DenseNet with word inputs. Our shallow word model further establishes new state-of-the-art …

Abstract

We study in this work the importance of depth in convolutional models for text classification, either when character or word inputs are considered. We show on 5 standard text classification and sentiment analysis tasks that deep models indeed give better performances than shallow networks when the text input is represented as a sequence of characters. However, a simple shallow-and-wide network outperforms deep models such as DenseNet with word inputs. Our shallow word model further establishes new state-of-the-art performances on two datasets: Yelp Binary (95.9%) and Yelp Full (64.9%).

cdn.aaai.org

Show moreShow less

Save Cite Cited by 142 Related articles All 10 versions View as HTML

Cite

Advanced search

Saved to My library

[PDF][PDF] Do convolutional networks need to be deep for text classification?