Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Paper The following article is Open access

A Word Elimination Strategy for Learning Document Representation

, , and

Published under licence by IOP Publishing Ltd
, , Citation Ying Liu et al 2018 IOP Conf. Ser.: Mater. Sci. Eng. 466 012091 DOI 10.1088/1757-899X/466/1/012091

1757-899X/466/1/012091

Abstract

Computing word vectors based on neural network has motivations on document representation. Word elimination can enhance the extract effective of the quality of valuable feature of a document. In this paper, we propose a model named PV-IDF to eliminate redundancy and refine features to improve the performance of the classify model, with which tokens that carry semantic information of a document. The results show that PV-IDF model achieves state-of-art performance, especially for short-length document representation.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1757-899X/466/1/012091