research-article

Entity-centric summarization: generating text summaries for graph snippets

Author:

Shruti ChhabraAuthors Info & Claims

WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide Web

Pages 33 - 38

https://doi.org/10.1145/2567948.2567959

Published: 07 April 2014 Publication History

Abstract

In recent times, focus of information retrieval community has shifted from traditional keyword-based retrieval to techniques utilizing the semantics in the text. Since such techniques require the understanding of relationships between entities, efforts are ongoing to organize the Web into large entity-relationship graphs. These graphs can be leveraged to answer complex relationship queries. However, most of the research has focused upon extracting structural information between entities such as a path, Steiner tree, or subgraphs. Little attention has been paid to the comprehension of these structural results, which is necessary for the user to understand relationships encapsulated in these structures. In this doctoral proposal, we pursue the idea of entity-centric summarization and propose a novel framework to produce entity-centric summaries which describe the relationships among input entities. We discuss the inherent challenges associated with each module in the framework and present an evaluation plan. Results from our preliminary experiments are encouraging and substantiate the feasibility of summarization problem.

References

[1]

Agrawal, R., Chakraborty, S., Gollapudi, S., Kannan, A., and Kenthapadi, K. Empowering authors to diagnose comprehension burden in textbooks. In KDD (2012), pp. 967--975.

Digital Library

[2]

Anyanwu, K., Maduko, A., and Sheth, A. Semrank: ranking complex relationship search results on the semantic web. In WWW (2005), pp. 117--127.

Digital Library

[3]

Anyanwu, K., and Sheth, A. ρ-queries: enabling querying for semantic associations on the semantic web. In WWW (2003), pp. 690--699.

Digital Library

[4]

Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., and Hellmann, S. Dbpedia - a crystallization point for the web of data. Web Semant. 7, 3 (2009), 154--165.

Digital Library

[5]

Blanco, R., and Zaragoza, H. Finding support sentences for entities. In SIGIR (2010), pp. 339--346.

Digital Library

[6]

Bollacker, K., Evans, C., Paritosh, P., Sturge, T., and Taylor, J. Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD (2008), pp. 1247--1250.

Digital Library

[7]

Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E. R. H., and Mitchell, T. M. Toward an architecture for never-ending language learning. In AAAI (2010).

Digital Library

[8]

Cheng, G., Tran, T., and Qu, Y. Relin: relatedness and informativeness-based centrality for entity summarization. In ISWC (2011), pp. 114--129.

Digital Library

[9]

Chhabra, S., and Bedathur, S. Generating text summaries of graph snippets. In COMAD (2013), pp. 121--124.

Digital Library

[10]

Chhabra, S., and Bedathur, S. Towards generating text summaries for entity chains. In ECIR (2014).

[11]

Cohen, T., Whitfield, G. K., Schvaneveldt, R. W., Mukund, K., and Rindflesch, T. Epiphanet: an interactive tool to support biomedical discoveries. JBDC 5 (2010), 21--49.

[12]

Dang, V., and Croft, W. B. Diversity by proportionality: an election-based approach to search result diversification. In SIGIR (2012), pp. 65--74.

Digital Library

[13]

Drosou, M., and Pitoura, E. Search result diversification. SIGMOD Record 39, 1 (2010), 41--47.

Digital Library

[14]

Etzioni, O., Fader, A., Christensen, J., Soderland, S., and Mausam, M. Open information extraction: the second generation. In IJCAI (2011), pp. 3--10.

Digital Library

[15]

Faloutsos, C., McCurley, K. S., and Tomkins, A. Fast discovery of connection subgraphs. In KDD (2004), pp. 118--127.

Digital Library

[16]

Fang, L., Sarma, A. D., Yu, C., and Bohannon, P. Rex: explaining relationships between entity pairs. VLDB Endowment 5, 3 (2011).

Digital Library

[17]

Filippova, K., Surdeanu, M., Ciaramita, M., and Zaragoza, H. Company-oriented Extractive Summarization of Financial News. In EACL (2009), pp. 246--254.

Digital Library

[18]

Gray, W. S., and Leary, B. E. What makes a book readable.

[19]

Halaschek, C., Aleman-Meza, B., Arpinar, I. B., and Sheth, A. P. Discovering and ranking semantic associations over a large rdf metabase. In VLDB Endowment (2004), pp. 1317--1320.

Digital Library

[20]

Hoffart, J., Suchanek, F. M., Berberich, K., Lewis-Kelham, E., de Melo, G., and Weikum, G. Yago2: exploring and querying world knowledge in time, space, context, and many languages. In WWW (2011), pp. 229--232.

Digital Library

[21]

Hristovski, D., Friedman, C., Rindflesch, T. C., and Peterlin, B. Exploiting semantic relations for literature-based discovery. In AMIA Annu Symp (2006), vol. 2006, pp. 349--353.

[22]

Hristovski, D., Kastrin, A., Peterlin, B., and Rindflesch, T. C. Combining semantic relations and dna microarray data for novel hypotheses generation. In BioLINK SIG. 2010, pp. 53--61.

Digital Library

[23]

Jin, W., Srihari, R. K., Ho, H. H., and Wu, X. Improving knowledge discovery in document collections through combining text retrieval and link analysis techniques. In ICDM (2007), pp. 193--202.

Digital Library

[24]

Kasneci, G. Searching and ranking in entity-relationship graphs.

[25]

Kasneci, G., Ramanath, M., Sozio, M., Suchanek, F. M., and Weikum, G. Star: Steiner-tree approximation in relationship graphs. In ICDE (2009), pp. 868--879.

Digital Library

[26]

Kintsch, W., and Van Dijk, T. A. Toward a model of text comprehension and production. Psychological review 85, 5 (1978), 363--394.

[27]

Landauer, T. K., Foltz, P. W., and Laham, D. An introduction to latent semantic analysis. Discourse processes 25, 2--3 (1998), 259--284.

[28]

Lin, C.-Y. Rouge: A package for automatic evaluation of summaries. In ACL (2004), pp. 74--81.

[29]

Liu, X., Nie, Z., Yu, N., and Wen, J.-R. Biosnowball: automated population of wikis. In KDD (2010), pp. 969--978.

Digital Library

[30]

Mani, I. Automatic summarization, vol. 3. John Benjamins Publishing, 2001.

[31]

Nakashole, N., Weikum, G., and Suchanek, F. Patty: a taxonomy of relational patterns with semantic types. In EMNLP-CoNLL (2012), pp. 1135--1145.

Digital Library

[32]

Nenkova, A., and McKeown, K. R. Automatic summarization. Now Publishers Inc, 2011.

[33]

Pitler, E., and Nenkova, A. Revisiting readability: A unified framework for predicting text quality. In EMNLP (2008), pp. 186--195.

Digital Library

[34]

Sauper, C., and Barzilay, R. Automatically generating wikipedia articles: a structure-aware approach. In ACL-IJCNLP (2009), pp. 208--216.

Digital Library

[35]

Srihari, R. K., Xu, L., and Saxena, T. Use of ranked cross document evidence trails for hypothesis generation. In KDD (2007), pp. 677--686.

Digital Library

[36]

Srinivasan, P. Text mining: generating hypotheses from medline. JASIST 55, 5 (2004), 396--413.

Digital Library

[37]

Swanson, D. R. Two medical literatures that are logically but not bibliographically connected. JASIS 38, 4 (1987), 228--233.

Cited By

Ponza MCeccarelli DFerragina PMeij EKothari SLewin-Eytan LCarmel DYom-Tov EAgichtein EGabrilovich E(2021)Contextualizing Trending Entities in News StoriesProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441765(346-354)Online publication date: 8-Mar-2021
https://dl.acm.org/doi/10.1145/3437963.3441765
Thomas ASangeetha S(2021) Semi‐supervised, knowledge‐integrated pattern learning approach for fact extraction from judicial text Expert Systems10.1111/exsy.1265638:3Online publication date: 5-Jan-2021
https://doi.org/10.1111/exsy.12656
Thomas ASangeetha S(2019)An innovative hybrid approach for extracting named entities from unstructured text dataComputational Intelligence10.1111/coin.1221435:4(799-826)Online publication date: 25-Apr-2019
https://doi.org/10.1111/coin.12214
Show More Cited By

Index Terms

Entity-centric summarization: generating text summaries for graph snippets
1. Information systems
  1. Information retrieval

Recommendations

Entity-centric topic-oriented opinion summarization in twitter
KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining

Microblogging services, such as Twitter, have become popular channels for people to express their opinions towards a broad range of topics. Twitter generates a huge volume of instant messages (i.e. tweets) carrying users' sentiments and attitudes every ...
An entity-guided text summarization framework with relational heterogeneous graph neural network
Abstract
Two of the most crucial issues for text summarization to generate faithful summaries are to make use of knowledge beyond text and to make use of cross-sentence relations in text. Intuitive ways for the two issues are knowledge graph (KG) and graph ...
Intertopic information mining for query-based summarization

In this article, the authors address the problem of sentence ranking in summarization. Although most existing summarization approaches are concerned with the information embodied in a particular topic (including a set of documents and an associated ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide Web

April 2014

1396 pages

ISBN:9781450327459

DOI:10.1145/2567948

General Chair:
Chin-Wan Chung
Korea Advanced Institute of Science and Technology, Korea
,
Program Chairs:
Andrei Broder
Google Inc., USA
,
Kyuseok Shim
Seoul National University, Korea
,
Torsten Suel
New York University, USA

Copyright © 2014 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 April 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

TCS research fellowship

Conference

WWW '14

Sponsor:

IW3C2

WWW '14: 23rd International World Wide Web Conference

April 7 - 11, 2014

Seoul, Korea

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
216
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)0

Reflects downloads up to 23 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Ponza MCeccarelli DFerragina PMeij EKothari SLewin-Eytan LCarmel DYom-Tov EAgichtein EGabrilovich E(2021)Contextualizing Trending Entities in News StoriesProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441765(346-354)Online publication date: 8-Mar-2021
https://dl.acm.org/doi/10.1145/3437963.3441765
Thomas ASangeetha S(2021) Semi‐supervised, knowledge‐integrated pattern learning approach for fact extraction from judicial text Expert Systems10.1111/exsy.1265638:3Online publication date: 5-Jan-2021
https://doi.org/10.1111/exsy.12656
Thomas ASangeetha S(2019)An innovative hybrid approach for extracting named entities from unstructured text dataComputational Intelligence10.1111/coin.1221435:4(799-826)Online publication date: 25-Apr-2019
https://doi.org/10.1111/coin.12214
Redondo García JRizzo GTroncy RBarker KGómez-Pérez J(2015)The Concentric Nature of News Semantic SnapshotsProceedings of the 8th International Conference on Knowledge Capture10.1145/2815833.2815836(1-8)Online publication date: 7-Oct-2015
https://dl.acm.org/doi/10.1145/2815833.2815836

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents