Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3383583.3398598acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
short-paper

Segmenting Scientific Abstracts into Discourse Categories: A Deep Learning-Based Approach for Sparse Labeled Data

Published: 01 August 2020 Publication History

Abstract

The abstract of a scientific paper distills the contents of the paper into a short paragraph. In the biomedical literature, it is customary to structure an abstract into discourse categories like BACKGROUND, OBJECTIVE, METHOD, RESULT, and CONCLUSION, but this segmentation is uncommon in other fields like computer science. Explicit categories could be helpful for more granular, that is, discourse-level search and recommendation. The sparsity of labeled data makes it challenging to construct supervised machine learning solutions for automatic discourse-level segmentation of abstracts in non-bio domains. In this paper, we address this problem using transfer learning. We define three discourse categories -- BACKGROUND, TECHNIQUE, and OBSERVATION -- for an abstract because these three categories are most common. We train a deep neural network on structured abstracts from PubMed, then fine-tune it on a small hand-labeled corpus of computer science papers. We observe an accuracy of 75% on the test corpus of computer science papers. We also perform an ablation study to highlight the roles of the different parts of the model. Our method appears to be a promising solution to the automatic segmentation of abstracts, where the labeled data is sparse.

References

[1]
Joel Chan, Joseph Chee Chang, Tom Hope, Dafna Shahaf, and Aniket Kittur. 2018. SOLVENT: A Mixed Initiative System for Finding Analogies between Research Papers. Proceedings of the ACM on Human-Computer Interaction, Vol. 2, CSCW (2018), 31.
[2]
Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement, Vol. 20, 1 (1960), 37--46.
[3]
Franck Dernoncourt. 2017. Sequential short-text classification with neural networks. Ph.D. Dissertation. Massachusetts Institute of Technology.
[4]
Franck Dernoncourt and Ji Young Lee. 2017. 200k RCT: a dataset for sequential sentence classification in medical abstracts. arXiv preprint arXiv:1710.06071 (2017).
[5]
Di Jin and Peter Szolovits. 2018. Hierarchical Neural Networks for Sequential Sentence Classification in Medical Scientific Abstracts. arXiv preprint arXiv:1808.06161 (2018).
[6]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[7]
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 1532--1543.
[8]
Anna M Ripple, James G Mork, John M Rozier, and Lou S Knecht. 2012. Structured abstracts in MEDLINE: Twenty-five years later. National Library of Medicine (2012).
[9]
Charles Sutton, Andrew McCallum, et al. 2012. An introduction to conditional random fields. Foundations and Trends® in Machine Learning, Vol. 4, 4 (2012), 267--373.
[10]
US National Library of Health. [n. d.]. Structured Abstracts. https://www.nlm.nih.gov/bsd/policy/structured_abstracts.html. (Accessed on May 20, 2020).

Cited By

View all
  • (2024)Sequential sentence classification in research papers using cross-domain multi-task learningInternational Journal on Digital Libraries10.1007/s00799-023-00392-z25:2(377-400)Online publication date: 1-Jun-2024
  • (2023)Sectioning biomedical abstracts using pointer networksProceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3584371.3612959(1-9)Online publication date: 3-Sep-2023
  • (2023)Label informed hierarchical transformers for sequential sentence classification in scientific abstractsExpert Systems10.1111/exsy.1323840:6Online publication date: 25-Jan-2023
  • Show More Cited By

Index Terms

  1. Segmenting Scientific Abstracts into Discourse Categories: A Deep Learning-Based Approach for Sparse Labeled Data

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020
      August 2020
      611 pages
      ISBN:9781450375856
      DOI:10.1145/3383583
      Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 August 2020

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. LSTM
      2. deep learning
      3. structured abstract
      4. transfer learning

      Qualifiers

      • Short-paper

      Funding Sources

      • National Digital Library of India Project sponsored by the Ministry of Human Resource Development, Government of India at IIT Kharagpur

      Conference

      JCDL '20
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 415 of 1,482 submissions, 28%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)19
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 20 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Sequential sentence classification in research papers using cross-domain multi-task learningInternational Journal on Digital Libraries10.1007/s00799-023-00392-z25:2(377-400)Online publication date: 1-Jun-2024
      • (2023)Sectioning biomedical abstracts using pointer networksProceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3584371.3612959(1-9)Online publication date: 3-Sep-2023
      • (2023)Label informed hierarchical transformers for sequential sentence classification in scientific abstractsExpert Systems10.1111/exsy.1323840:6Online publication date: 25-Jan-2023
      • (2023)Scientific document processing: challenges for modern learning methodsInternational Journal on Digital Libraries10.1007/s00799-023-00352-724:4(283-309)Online publication date: 24-Mar-2023
      • (2022)National digital library of IndiaCommunications of the ACM10.1145/355048065:11(58-61)Online publication date: 20-Oct-2022
      • (2022)Cross-domain multi-task learning for sequential sentence classification in research papersProceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries10.1145/3529372.3530922(1-13)Online publication date: 20-Jun-2022

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media