Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2567948.2567959acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Entity-centric summarization: generating text summaries for graph snippets

Published: 07 April 2014 Publication History

Abstract

In recent times, focus of information retrieval community has shifted from traditional keyword-based retrieval to techniques utilizing the semantics in the text. Since such techniques require the understanding of relationships between entities, efforts are ongoing to organize the Web into large entity-relationship graphs. These graphs can be leveraged to answer complex relationship queries. However, most of the research has focused upon extracting structural information between entities such as a path, Steiner tree, or subgraphs. Little attention has been paid to the comprehension of these structural results, which is necessary for the user to understand relationships encapsulated in these structures. In this doctoral proposal, we pursue the idea of entity-centric summarization and propose a novel framework to produce entity-centric summaries which describe the relationships among input entities. We discuss the inherent challenges associated with each module in the framework and present an evaluation plan. Results from our preliminary experiments are encouraging and substantiate the feasibility of summarization problem.

References

[1]
Agrawal, R., Chakraborty, S., Gollapudi, S., Kannan, A., and Kenthapadi, K. Empowering authors to diagnose comprehension burden in textbooks. In KDD (2012), pp. 967--975.
[2]
Anyanwu, K., Maduko, A., and Sheth, A. Semrank: ranking complex relationship search results on the semantic web. In WWW (2005), pp. 117--127.
[3]
Anyanwu, K., and Sheth, A. ρ-queries: enabling querying for semantic associations on the semantic web. In WWW (2003), pp. 690--699.
[4]
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., and Hellmann, S. Dbpedia - a crystallization point for the web of data. Web Semant. 7, 3 (2009), 154--165.
[5]
Blanco, R., and Zaragoza, H. Finding support sentences for entities. In SIGIR (2010), pp. 339--346.
[6]
Bollacker, K., Evans, C., Paritosh, P., Sturge, T., and Taylor, J. Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD (2008), pp. 1247--1250.
[7]
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E. R. H., and Mitchell, T. M. Toward an architecture for never-ending language learning. In AAAI (2010).
[8]
Cheng, G., Tran, T., and Qu, Y. Relin: relatedness and informativeness-based centrality for entity summarization. In ISWC (2011), pp. 114--129.
[9]
Chhabra, S., and Bedathur, S. Generating text summaries of graph snippets. In COMAD (2013), pp. 121--124.
[10]
Chhabra, S., and Bedathur, S. Towards generating text summaries for entity chains. In ECIR (2014).
[11]
Cohen, T., Whitfield, G. K., Schvaneveldt, R. W., Mukund, K., and Rindflesch, T. Epiphanet: an interactive tool to support biomedical discoveries. JBDC 5 (2010), 21--49.
[12]
Dang, V., and Croft, W. B. Diversity by proportionality: an election-based approach to search result diversification. In SIGIR (2012), pp. 65--74.
[13]
Drosou, M., and Pitoura, E. Search result diversification. SIGMOD Record 39, 1 (2010), 41--47.
[14]
Etzioni, O., Fader, A., Christensen, J., Soderland, S., and Mausam, M. Open information extraction: the second generation. In IJCAI (2011), pp. 3--10.
[15]
Faloutsos, C., McCurley, K. S., and Tomkins, A. Fast discovery of connection subgraphs. In KDD (2004), pp. 118--127.
[16]
Fang, L., Sarma, A. D., Yu, C., and Bohannon, P. Rex: explaining relationships between entity pairs. VLDB Endowment 5, 3 (2011).
[17]
Filippova, K., Surdeanu, M., Ciaramita, M., and Zaragoza, H. Company-oriented Extractive Summarization of Financial News. In EACL (2009), pp. 246--254.
[18]
Gray, W. S., and Leary, B. E. What makes a book readable.
[19]
Halaschek, C., Aleman-Meza, B., Arpinar, I. B., and Sheth, A. P. Discovering and ranking semantic associations over a large rdf metabase. In VLDB Endowment (2004), pp. 1317--1320.
[20]
Hoffart, J., Suchanek, F. M., Berberich, K., Lewis-Kelham, E., de Melo, G., and Weikum, G. Yago2: exploring and querying world knowledge in time, space, context, and many languages. In WWW (2011), pp. 229--232.
[21]
Hristovski, D., Friedman, C., Rindflesch, T. C., and Peterlin, B. Exploiting semantic relations for literature-based discovery. In AMIA Annu Symp (2006), vol. 2006, pp. 349--353.
[22]
Hristovski, D., Kastrin, A., Peterlin, B., and Rindflesch, T. C. Combining semantic relations and dna microarray data for novel hypotheses generation. In BioLINK SIG. 2010, pp. 53--61.
[23]
Jin, W., Srihari, R. K., Ho, H. H., and Wu, X. Improving knowledge discovery in document collections through combining text retrieval and link analysis techniques. In ICDM (2007), pp. 193--202.
[24]
Kasneci, G. Searching and ranking in entity-relationship graphs.
[25]
Kasneci, G., Ramanath, M., Sozio, M., Suchanek, F. M., and Weikum, G. Star: Steiner-tree approximation in relationship graphs. In ICDE (2009), pp. 868--879.
[26]
Kintsch, W., and Van Dijk, T. A. Toward a model of text comprehension and production. Psychological review 85, 5 (1978), 363--394.
[27]
Landauer, T. K., Foltz, P. W., and Laham, D. An introduction to latent semantic analysis. Discourse processes 25, 2--3 (1998), 259--284.
[28]
Lin, C.-Y. Rouge: A package for automatic evaluation of summaries. In ACL (2004), pp. 74--81.
[29]
Liu, X., Nie, Z., Yu, N., and Wen, J.-R. Biosnowball: automated population of wikis. In KDD (2010), pp. 969--978.
[30]
Mani, I. Automatic summarization, vol. 3. John Benjamins Publishing, 2001.
[31]
Nakashole, N., Weikum, G., and Suchanek, F. Patty: a taxonomy of relational patterns with semantic types. In EMNLP-CoNLL (2012), pp. 1135--1145.
[32]
Nenkova, A., and McKeown, K. R. Automatic summarization. Now Publishers Inc, 2011.
[33]
Pitler, E., and Nenkova, A. Revisiting readability: A unified framework for predicting text quality. In EMNLP (2008), pp. 186--195.
[34]
Sauper, C., and Barzilay, R. Automatically generating wikipedia articles: a structure-aware approach. In ACL-IJCNLP (2009), pp. 208--216.
[35]
Srihari, R. K., Xu, L., and Saxena, T. Use of ranked cross document evidence trails for hypothesis generation. In KDD (2007), pp. 677--686.
[36]
Srinivasan, P. Text mining: generating hypotheses from medline. JASIST 55, 5 (2004), 396--413.
[37]
Swanson, D. R. Two medical literatures that are logically but not bibliographically connected. JASIS 38, 4 (1987), 228--233.

Cited By

View all
  • (2021)Contextualizing Trending Entities in News StoriesProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441765(346-354)Online publication date: 8-Mar-2021
  • (2021) Semi‐supervised, knowledge‐integrated pattern learning approach for fact extraction from judicial text Expert Systems10.1111/exsy.1265638:3Online publication date: 5-Jan-2021
  • (2019)An innovative hybrid approach for extracting named entities from unstructured text dataComputational Intelligence10.1111/coin.1221435:4(799-826)Online publication date: 25-Apr-2019
  • Show More Cited By

Index Terms

  1. Entity-centric summarization: generating text summaries for graph snippets

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide Web
    April 2014
    1396 pages
    ISBN:9781450327459
    DOI:10.1145/2567948

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 April 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. entity-relationship graph
    2. related entities
    3. relationship description
    4. summarization

    Qualifiers

    • Research-article

    Funding Sources

    • TCS research fellowship

    Conference

    WWW '14
    Sponsor:
    • IW3C2

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 23 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Contextualizing Trending Entities in News StoriesProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441765(346-354)Online publication date: 8-Mar-2021
    • (2021) Semi‐supervised, knowledge‐integrated pattern learning approach for fact extraction from judicial text Expert Systems10.1111/exsy.1265638:3Online publication date: 5-Jan-2021
    • (2019)An innovative hybrid approach for extracting named entities from unstructured text dataComputational Intelligence10.1111/coin.1221435:4(799-826)Online publication date: 25-Apr-2019
    • (2015)The Concentric Nature of News Semantic SnapshotsProceedings of the 8th International Conference on Knowledge Capture10.1145/2815833.2815836(1-8)Online publication date: 7-Oct-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media