Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

Zhou, Ganbin; Cao, Rongyu; Ao, Xiang; Luo, Ping; Lin, Fen; Lin, Leyu; He, Qing

Computer Science > Computation and Language

arXiv:1808.07228 (cs)

[Submitted on 22 Aug 2018]

Title:Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

Authors:Ganbin Zhou, Rongyu Cao, Xiang Ao, Ping Luo, Fen Lin, Leyu Lin, Qing He

View PDF

Abstract:In this study, we focus on extracting knowledgeable snippets and annotating knowledgeable documents from Web corpus, consisting of the documents from social media and We-media. Informally, knowledgeable snippets refer to the text describing concepts, properties of entities, or relations among entities, while knowledgeable documents are the ones with enough knowledgeable snippets. These knowledgeable snippets and documents could be helpful in multiple applications, such as knowledge base construction and knowledge-oriented service. Previous studies extracted the knowledgeable snippets using the pattern-based method. Here, we propose the semantic-based method for this task. Specifically, a CNN based model is developed to extract knowledgeable snippets and annotate knowledgeable documents simultaneously. Additionally, a "low-level sharing, high-level splitting" structure of CNN is designed to handle the documents from different content domains. Compared with building multiple domain-specific CNNs, this joint model not only critically saves the training time, but also improves the prediction accuracy visibly. The superiority of the proposed method is demonstrated in a real dataset from Wechat public platform.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:1808.07228 [cs.CL]
	(or arXiv:1808.07228v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1808.07228

Submission history

From: Ganbin Zhou [view email]
[v1] Wed, 22 Aug 2018 05:57:13 UTC (2,222 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-08

Change to browse by:

cs
cs.AI
cs.IR

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ganbin Zhou
Rongyu Cao
Xiang Ao
Ping Luo
Fen Lin

…

export BibTeX citation

Computer Science > Computation and Language

Title:Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators