Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3529372.3533300acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
short-paper

HedgePeer: a dataset for uncertainty detection in peer reviews

Published: 20 June 2022 Publication History

Abstract

Uncertainty detection from text is essential in many applications in information retrieval (IR). Detecting textual uncertainties helps extract factual information instead of uncertain or non-factual information. To avoid overprecise commitment, people use linguistic devices like hedges (uncertain words or phrases). In peer reviews, reviewers often use hedges wherever they are unsure about their opinion or when facts do not back their opinions. Usage of hedges or uncertain words in writing can also indicate the reviewer's confidence or measure of conviction in their reviews. Reviewer confidence is important in the peer review process (especially to the editors or chairs) to judge the quality of evaluation of the paper under review. However, the self-annotated reviewer confidence score is often miscalibrated or biased and not an accurate representation of the reviewer's conviction of their judgment on the merit of the paper. Less confident reviewers sometimes speculate their observations. Here in this paper, we introduce HedgePeer, a new uncertainty detection dataset of peer review comments, which is more than five times larger than the existing datasets on hedge detection in other domains. We curate our dataset from the open-access reviews available in the open review platform and annotate the review comments in terms of the hedge cues and hedge spans. We also provide several baseline approaches, including a multitask learning model with sentiment intensity and parts-of-speech as scaffold tasks to predict hedge cues and spans. We make our dataset and baseline codes available at https://github.com/Tirthankar-Ghosal/HedgePeer-Dataset. Our dataset is motivated towards computationally estimating the reviewer's conviction from their review texts.

References

[1]
Iz Beltagy, Arman Cohan, and Kyle Lo. 2019. SciBERT: Pretrained Contextualized Embeddings for Scientific Text. CoRR abs/1903.10676 (2019). arXiv:1903.10676 http://arxiv.org/abs/1903.10676
[2]
Prabhat Kumar Bharti, Shashi Ranjan, Tirthankar Ghosal, Mayank Agrawal, and Asif Ekbal. 2021. PEERAssist: Leveraging on Paper-Review Interactions to Predict Peer Review Decisions. In Towards Open and Trustworthy Digital Societies - 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1--3, 2021, Proceedings (Lecture Notes in Computer Science, Vol. 13133), Hao-Ren Ke, Chei Sian Lee, and Kazunari Sugiyama (Eds.). Springer, 421--435.
[3]
Benita Kathleen Britto and Aditya Khandelwal. 2020. Resolving the Scope of Speculation and Negation using Transformer-Based Architectures. CoRR abs/2001.02885 (2020). arXiv:2001.02885 http://arxiv.org/abs/2001.02885
[4]
Jacob Cohen. 1960. A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement 20, 1 (1960), 37--46. arXiv:https://doi.org/10.1177/001316446002000104
[5]
Noa P Cruz, Maite Taboada, and Ruslan Mitkov. 2016. A machine-learning approach to negation and speculation detection for sentiment analysis. Journal of the Association for Information Science and Technology 67, 9 (2016), 2118--2136.
[6]
Cüneyt Demir. 2018. Hedging and academic writing: an analysis of lexical hedges. Journal of language and linguistic studies 14, 4 (2018), 74--92.
[7]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186.
[8]
Richárd Farkas, Veronika Vincze, György Móra, János Csirik, and György Szarvas. 2010. The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning: Shared Task (15--16). Uppsala, Sweden, 1--12.
[9]
Tirthankar Ghosal. 2019. Exploring the Implications of Artificial Intelligence in Various Aspects of Scholarly Peer Review. Bull. IEEE Tech. Comm. Digit. Libr. 15, 1 (2019). https://bulletin.jcdl.org/Bulletin/v15n1/papers/ghosal.pdf
[10]
Tirthankar Ghosal. 2021. Studies in aspects of peer review: novelty, scope, research lineage, review significance, and peer review outcome. SIGIR Forum 55, 2 (2021), 26:1--26:2.
[11]
Tirthankar Ghosal, Sandeep Kumar, Prabhat Kumar Bharti, and Asif Ekbal. 2022. Peer review analyze: A novel benchmark resource for computational analysis of peer reviews. PLOS ONE 17, 1 (01 2022), 1--29.
[12]
Tirthankar Ghosal, Rajeev Verma, Asif Ekbal, and Pushpak Bhattacharyya. 2019. DeepSentiPeer: Harnessing Sentiment in Review Texts to Recommend Peer Review Decisions. In Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, Anna Korhonen, David R. Traum, and Lluís Màrquez (Eds.). Association for Computational Linguistics, 1120--1130.
[13]
Tirthankar Ghosal, Rajeev Verma, Asif Ekbal, and Pushpak Bhattacharyya. 2019. A Sentiment Augmented Deep Architecture to Predict Peer Review Outcomes. In 19th ACM/IEEE Joint Conference on Digital Libraries, JCDL 2019, Champaign, IL, USA, June 2--6, 2019, Maria Bonn, Dan Wu, J. Stephen Downie, and Alain Martaus (Eds.). IEEE, 414--415.
[14]
Ken Hyland. 1998. Boosting, hedging and the negotiation of academic knowledge. Text & Talk 18, 3 (1998), 349--382.
[15]
Aditya Khandelwal and Suraj Sawant. 2019. Negbert: A transfer learning approach for negation detection and scope resolution. arXiv preprint arXiv:1911.04211 (2019).
[16]
Halil Kilicoglu and Sabine Bergler. 2010. A high-precision approach to detecting hedges and their scopes. In Proceedings of the fourteenth conference on computational natural language learning-Shared task. 70--77.
[17]
Asheesh Kumar, Tirthankar Ghosal, and Asif Ekbal. 2021. A Deep Neural Architecture for Decision-Aware Meta-Review Generation. In ACM/IEEE Joint Conference on Digital Libraries, JCDL 2021, Champaign, IL, USA, September 27--30, 2021, J. Stephen Downie, Dana McKay, Hussein Suleman, David M. Nichols, and Faryaneh Poursardar (Eds.). IEEE, 222--225.
[18]
Sandeep Kumar, Tirthankar Ghosal, Prabhat Kumar Bharti, and Asif Ekbal. 2021. Sharing is Caring! Joint Multitask Learning Helps Aspect-Category Extraction and Sentiment Detection in Scientific Peer Reviews. In ACM/IEEE Joint Conference on Digital Libraries, JCDL 2021, Champaign, IL, USA, September 27--30, 2021, J. Stephen Downie, Dana McKay, Hussein Suleman, David M. Nichols, and Faryaneh Poursardar (Eds.). IEEE, 270--273.
[19]
George Lakoff. 1970. Linguistics and natural logic. Synthese 22, 1 (1970), 151--271.
[20]
Marc Light, Xin Ying Qiu, and Padmini Srinivasan. 2004. The language of bioscience: Facts, speculations, and statements in between. In Proceedings of the HLT-NAACL 2004 Workshop : Biolink 2004, linking biological literature, ontologies and databases. 17--24.
[21]
Ben Medlock and Ted Briscoe. 2007. Weakly supervised learning for hedge classification in scientific literature. In ACL, Vol. 2007. 992--999.
[22]
Guillermo Moncecchi, Jean-Luc Minel, and Dina Wonsever. 2012. Improving speculative language detection using linguistic knowledge. In Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics. 37--46.
[23]
Arzucan Özgür and Dragomir Radev. 2009. Detecting speculations and their scopes in scientific text. In Proceedings of the 2009 conference on empirical methods in natural language processing. 1398--1407.
[24]
Jiaxin Pei and David Jurgens. 2021. Measuring Sentence-Level and Aspect-Level (Un)certainty in Science Communications. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, 9959--10011.
[25]
Alec Radford, Rafal Józefowicz, and Ilya Sutskever. 2017. Learning to Generate Reviews and Discovering Sentiment. CoRR abs/1704.01444 (2017). arXiv:1704.01444 http://arxiv.org/abs/1704.01444
[26]
Kyle Siler, Kirby Lee, and Lisa Bero. 2015. Measuring the effectiveness of scientific gatekeeping. Proceedings of the National Academy of Sciences 112, 2 (2015), 360--365.
[27]
György Szarvas, Veronika Vincze, Richárd Farkas, and János Csirik. 2008. The BioScope corpus: annotation for negation, uncertainty and their scope in biomedical texts. In Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing. Association for Computational Linguistics, Columbus, Ohio, 38--45. https://aclanthology.org/W08-0606
[28]
Buzhou Tang, Xiaolong Wang, Xuan Wang, Bo Yuan, and Shixi Fan. 2010. A cascade method for detecting hedges and their scope in natural language text. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning-Shared Task. 13--17.
[29]
Ann Taylor, Mitchell Marcus, and Beatrice Santorini. 2003. The Penn treebank: an overview. Treebanks (2003), 5--22.
[30]
Erik Velldal. 2011. Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature. Journal of Biomedical Semantics 2, 5 (2011), 1--14.
[31]
Erik Velldal, Lilja Øvrelid, and Stephan Oepen. 2010. Resolving speculation: MaxEnt cue classification and dependency-based scope rules. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning-Shared Task. 48--55.
[32]
Erik Velldal, Lilja Øvrelid, Jonathon Read, and Stephan Oepen. 2012. Speculation and negation: Rules, rankers, and the role of syntax. Computational linguistics 38, 2 (2012), 369--410.
[33]
Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. 2019. XLNet: Generalized Autoregressive Pretraining for Language Understanding. Advances in neural information processing systems 32 (2019).

Cited By

View all
  • (2024)Are the confidence scores of reviewers consistent with the review content? Evidence from top conference proceedings in AIScientometrics10.1007/s11192-024-05070-8129:7(4109-4135)Online publication date: 20-Jun-2024
  • (2023)ReviVal: Towards Automatically Evaluating the Informativeness of Peer ReviewsProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3625341(95-103)Online publication date: 26-Nov-2023
  • (2023)What Is the Difference Between a Mountain and a Molehill? Quantifying Semantic Labeling of Visual Features in Line Charts2023 IEEE Visualization and Visual Analytics (VIS)10.1109/VIS54172.2023.00041(161-165)Online publication date: 21-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '22: Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries
June 2022
392 pages
ISBN:9781450393454
DOI:10.1145/3529372
  • General Chairs:
  • Akiko Aizawa,
  • Thomas Mandl,
  • Zeljko Carevic,
  • Program Chairs:
  • Annika Hinze,
  • Philipp Mayr,
  • Philipp Schaer
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • IEEE Technical Committee on Digital Libraries (TC DL)

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. hedges
  2. peer reviews
  3. reviewer confidence
  4. uncertainty detection

Qualifiers

  • Short-paper

Funding Sources

  • Cactus Communications, India

Conference

JCDL '22
Sponsor:

Acceptance Rates

JCDL '22 Paper Acceptance Rate 35 of 132 submissions, 27%;
Overall Acceptance Rate 415 of 1,482 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)61
  • Downloads (Last 6 weeks)7
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Are the confidence scores of reviewers consistent with the review content? Evidence from top conference proceedings in AIScientometrics10.1007/s11192-024-05070-8129:7(4109-4135)Online publication date: 20-Jun-2024
  • (2023)ReviVal: Towards Automatically Evaluating the Informativeness of Peer ReviewsProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3625341(95-103)Online publication date: 26-Nov-2023
  • (2023)What Is the Difference Between a Mountain and a Molehill? Quantifying Semantic Labeling of Visual Features in Line Charts2023 IEEE Visualization and Visual Analytics (VIS)10.1109/VIS54172.2023.00041(161-165)Online publication date: 21-Oct-2023
  • (2023)PEERRec: An AI-based approach to automatically generate recommendations and predict decisions in peer reviewInternational Journal on Digital Libraries10.1007/s00799-023-00375-025:1(55-72)Online publication date: 4-Jul-2023
  • (2023)Towards automated meta-review generation via an NLP/ML pipeline in different stages of the scholarly peer review processInternational Journal on Digital Libraries10.1007/s00799-023-00359-0Online publication date: 24-Apr-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media