Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2872518.2891115acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
abstract

On the Retrieval of Wikipedia Articles Containing Claims on Controversial Topics

Published: 11 April 2016 Publication History

Abstract

This work presents a novel claim-oriented document retrieval task. For a given controversial topic, relevant articles containing claims that support or contest the topic are retrieved from a Wikipedia corpus. For that, a two-step retrieval approach is proposed. At the first step, an initial pool of articles that are relevant to the topic are retrieved using state-of-the-art retrieval methods. At the second step, articles in the initial pool are re-ranked according to their potential to contain as many relevant claims as possible using several claim discovery features. Hence, the second step aims at maximizing the overall claim recall of the retrieval system. Using a recently published claims benchmark, the proposed retrieval approach is demonstrated to provide more relevant claims compared to several other retrieval alternatives.

References

[1]
E. Aharoni, A. Polnarov, T. Lavee, D. Hershcovich, R. Levy, R. Rinott, D. Gutfreund, and N. Slonim. A benchmark dataset for automatic detection of claims and evidence in the context of controversial topics. In Proceedings of the First Workshop on Argumentation Mining, ACL '14, 2014.
[2]
P. Bellot, A. Doucet, S. Geva, S. Gurajada, J. Kamps, G. Kazai, M. Koolen, A. Mishra, V. Moriceau, J. Mothe, et al. Overview of inex 2013. In Information Access Evaluation. Multilinguality, Multimodality, and Visualization, pages 269--281. Springer, 2013.
[3]
E. Cabrio and S. Villata. Combining textual entailment and argumentation theory for supporting online debates interactions. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2, ACL '12, pages 208--212, Stroudsburg, PA, USA, 2012. Association for Computational Linguistics.
[4]
J. P. Callan. Passage-level evidence in document retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '94, pages 302--310, New York, NY, USA, 1994. Springer-Verlag New York, Inc.
[5]
D. Carmel, E. Farchi, Y. Petruschka, and A. Soffer. Automatic query refinement using lexical affinities with maximal information gain. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '02, pages 283--290, New York, NY, USA, 2002. ACM.
[6]
S. Dori-Hacohen and J. Allan. Detecting controversy on the web. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management, CIKM '13, pages 1845--1848, New York, NY, USA, 2013. ACM.
[7]
A. Freeley and D. Steinberg. Argumentation and debate. Cengage Learning, 2013.
[8]
S. Geva, J. Kamps, M. Lethonen, R. Schenkel, J. A. Thom, and A. Trotman. Overview of the inex 2009 ad hoc track. focused retrieval and evaluation. In Focused retrieval and evaluation, pages 4--25. Springer, 2010.
[9]
O. Kolomiyets and M.-F. Moens. A survey on question answering technology from an information retrieval perspective. Information Sciences, 181(24):5412--5434, 2011.
[10]
M. Koolen, G. Kazai, M. Preminger, and A. Doucet. Overview of the inex 2013 social book search track. In In CLEF 2013 Evaluation Labs and Workshop, Online Working Notes, 2013.
[11]
R. Levy, Y. Bilu, D. Hershcovich, E. Aharoni, and N. Slonim. Context dependent claim detection. In Proceedings of the 25th International Conference on Computatinal Linguistics, COLIG '14, 2014.
[12]
B. Liu and L. Zhang. A survey of opinion mining and sentiment analysis. In Mining Text Data, pages 415--463. Springer, 2012.
[13]
O. Medelyan, D. Milne, C. Legg, and I. H. Witten. Mining meaning from wikipedia. Int. J. Hum.-Comput. Stud., 67(9):716--754, Sept. 2009.
[14]
R. M. Palau and M.-F. Moens. Argumentation mining: The detection, classification and structure of arguments in text. In Proceedings the 12th International Conference on Artificial Intelligence and Law, ICAIL '09, pages 98--107, New York, NY, USA, 2009. ACM.
[15]
J. Pehcevski, J. A. Thom, et al. Evaluating focused retrieval tasks. In SIGIR 2007 Workshop on Focused Retrieval, 2007.
[16]
S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. pages 109--126, 1996.
[17]
W. Song, Y. Zhang, Y. Xie, T. Liu, and S. Li. Query term ranking based on search results overlap. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '11, pages 1253--1254, New York, NY, USA, 2011. ACM.
[18]
S. Toulmin. The Uses of Argument. Cambridge University Press, 1958.
[19]
B.-Q. Vuong, E.-P. Lim, A. Sun, M.-T. Le, H. W. Lauw, and K. Chang. On ranking controversies in wikipedia: Models and evaluation. In Proceedings of the 2008 International Conference on Web Search and Data Mining, WSDM '08, pages 171--182, New York, NY, USA, 2008. ACM.
[20]
S. Wu. Data Fusion in Information Retrieval. Springer Publishing Company, Incorporated, 2012.

Cited By

View all
  • (2023)Towards Automated Claim Detection In Fact Checking2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT56998.2023.10308281(1-5)Online publication date: 6-Jul-2023
  • (2021)Studying effectiveness of Web search for fact checkingJournal of the Association for Information Science and Technology10.1002/asi.2457773:5(738-751)Online publication date: 14-Oct-2021
  • (2020)Mining an "anti-knowledge base" from Wikipedia updates with applications to fact checking and beyondProceedings of the VLDB Endowment10.14778/3372716.337272713:4(561-573)Online publication date: 6-Jan-2020
  • Show More Cited By

Index Terms

  1. On the Retrieval of Wikipedia Articles Containing Claims on Controversial Topics

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web
    April 2016
    1094 pages
    ISBN:9781450341448

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    International World Wide Web Conferences Steering Committee

    Republic and Canton of Geneva, Switzerland

    Publication History

    Published: 11 April 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Abstract

    Conference

    WWW '16
    Sponsor:
    • IW3C2
    WWW '16: 25th International World Wide Web Conference
    April 11 - 15, 2016
    Québec, Montréal, Canada

    Acceptance Rates

    WWW '16 Companion Paper Acceptance Rate 115 of 727 submissions, 16%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Towards Automated Claim Detection In Fact Checking2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT56998.2023.10308281(1-5)Online publication date: 6-Jul-2023
    • (2021)Studying effectiveness of Web search for fact checkingJournal of the Association for Information Science and Technology10.1002/asi.2457773:5(738-751)Online publication date: 14-Oct-2021
    • (2020)Mining an "anti-knowledge base" from Wikipedia updates with applications to fact checking and beyondProceedings of the VLDB Endowment10.14778/3372716.337272713:4(561-573)Online publication date: 6-Jan-2020
    • (2019)The evolution of argumentation miningInformation Processing and Management: an International Journal10.1016/j.ipm.2019.10205556:6Online publication date: 1-Nov-2019
    • (2019)Detecting pages to protect in Wikipedia across multiple languagesSocial Network Analysis and Mining10.1007/s13278-019-0555-09:1Online publication date: 14-Mar-2019
    • (2019)Learning to Rank Claim-Evidence Pairs to Assist Scientific-Based ArgumentationDigital Libraries for Open Knowledge10.1007/978-3-030-30760-8_4(41-55)Online publication date: 30-Aug-2019
    • (2018)Argumentation MiningSynthesis Lectures on Human Language Technologies10.2200/S00883ED1V01Y201811HLT04011:2(1-191)Online publication date: 20-Dec-2018
    • (2018)Claim Retrieval in TwitterWeb Information Systems Engineering – WISE 201810.1007/978-3-030-02922-7_20(297-307)Online publication date: 20-Oct-2018
    • (2017)Detecting Controversies in Online News MediaProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080723(1069-1072)Online publication date: 7-Aug-2017
    • (2016)DePPProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983914(2081-2084)Online publication date: 24-Oct-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media