Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
survey

Biomedical Question Answering: A Survey of Approaches and Challenges

Published: 18 January 2022 Publication History

Abstract

Automatic Question Answering (QA) has been successfully applied in various domains such as search engines and chatbots. Biomedical QA (BQA), as an emerging QA task, enables innovative applications to effectively perceive, access, and understand complex biomedical knowledge. There have been tremendous developments of BQA in the past two decades, which we classify into five distinctive approaches: classic, information retrieval, machine reading comprehension, knowledge base, and question entailment approaches. In this survey, we introduce available datasets and representative methods of each BQA approach in detail. Despite the developments, BQA systems are still immature and rarely used in real-life settings. We identify and characterize several key challenges in BQA that might lead to this issue, and we discuss some potential future directions to explore.

References

[1]
Asma Ben Abacha, Eugene Agichtein, Yuval Pinter, and Dina Demner-Fushman. 2017. Overview of the medical question answering task at TREC 2017 LiveQA. In Proceedings of the Text Retrieval Conference (TREC).
[2]
Asma Ben Abacha and Dina Demner-Fushman. 2016. Recognizing question entailment for medical question answering. In AMIA Annual Symposium Proceedings, Vol. 2016. American Medical Informatics Association.
[3]
Asma Ben Abacha and Dina Demner-Fushman. 2019. A question-entailment approach to question answering. BMC Bioinform. 20, 1 (2019), 511.
[4]
Asma Ben Abacha, Sadid A. Hasan, Vivek V. Datla, Joey Liu, Dina Demner-Fushman, and Henning Müller. 2019. VQA-Med: Overview of the medical visual question answering task at ImageCLEF 2019. In CLEF 2019 Working Notes.
[5]
Asma Ben Abacha and Pierre Zweigenbaum. 2012. Medical question answering: translating medical questions into sparql queries. In Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium. 41–50.
[6]
Asma Ben Abacha and Pierre Zweigenbaum. 2015. MEANS: A medical question-answering system combining NLP techniques and semantic web technologies. Inf. Process. Manag. 51, 5 (2015), 570–594.
[7]
Emily Alsentzer, John Murphy, William Boag, Wei-Hung Weng, Di Jindi, Tristan Naumann, and Matthew McDermott. 2019. Publicly available clinical BERT embeddings. In Proceedings of the 2nd Clinical Natural Language Processing Workshop. Association for Computational Linguistics, 72–78. DOI: https://doi.org/10.18653/v1/W19-1909
[8]
Alan R. Aronson. 2001. Effective mapping of biomedical text to the UMLS Metathesaurus: The MetaMap program. In Proceedings of the AMIA Symposium. American Medical Informatics Association.
[9]
Alan R. Aronson, Dina Demner-Fushman, Susanne M. Humphrey, and Jimmy J. Lin. 2005. Fusion of knowledge-intensive and statistical approaches for retrieving and annotating textual genomics documents. In Proceedings of the Text Retrieval Conference (TREC).
[10]
Sofia J. Athenikos and Hyoil Han. 2010. Biomedical question answering: A survey. Comput. Meth. Prog. Biomed. 99, 1 (2010), 1–24.
[11]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).
[12]
Georgios Balikas, Ioannis Partalas, Axel-Cyrille Ngonga Ngomo, Anastasia Krithara, Eric Gaussier, and George Paliouras. 2014. Results of the BioASQ tasks of the question answering lab at CLEF 2014. In CLEF 2014 Working Notes.
[13]
Michael A. Bauer and Daniel Berleant. 2012. Usability survey of biomedical question answering systems. Hum. Genom. 6, 1 (2012), 17.
[14]
Iz Beltagy, Kyle Lo, and Arman Cohan. 2019. SciBERT: A pretrained language model for scientific text. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 3615–3620. DOI: https://doi.org/10.18653/v1/D19-1371
[15]
Asma Ben Abacha and Dina Demner-Fushman. 2019. On the summarization of consumer health questions. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 2228–2234. DOI: https://doi.org/10.18653/v1/P19-1215
[16]
Asma Ben Abacha, Chaitanya Shivade, and Dina Demner-Fushman. 2019. Overview of the MEDIQA 2019 shared task on textual inference, question entailment and question answering. In Proceedings of the 18th BioNLP Workshop and Shared Task. Association for Computational Linguistics, 370–379. DOI: https://doi.org/10.18653/v1/W19-5039
[17]
Jonathan Berant, Vivek Srikumar, Pei-Chun Chen, Abby Van der Linden, Brittany Harding, Brad Huang, Peter Clark, and Christopher D. Manning. 2014. Modeling biological processes for reading comprehension. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, 1499–1510. DOI: https://doi.org/10.3115/v1/D14-1159
[18]
Abhishek Bhandwaldar and Wlodek Zadrozny. 2018. UNCC QA: Biomedical Question Answering system. In Proceedings of the 6th BioASQ Workshop. Association for Computational Linguistics, 66–71. DOI: https://doi.org/10.18653/v1/W18-5308
[19]
Pinaki Bhaskar, Partha Pakray, Somnath Banerjee, Samadrita Banerjee, Sivaji Bandyopadhyay, and Alexander F. Gelbukh. 2012. Question answering system for QA4MRE@ CLEF 2012. In Proceedings of the CLEF Online Working Notes/Labs/Workshop.
[20]
Ludovic Bonnefoy, Romain Deveaud, and Patrice Bellot. 2012. Do social information help book search? In Workshop Pre-proceedings INEX’12.
[21]
Samuel R. Bowman, Gabor Angeli, Christopher Potts, and Christopher D. Manning. 2015. A large annotated corpus for learning natural language inference. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 632–642. DOI: https://doi.org/10.18653/v1/D15-1075
[22]
George Brokos, Polyvios Liosis, Ryan McDonald, Dimitris Pappas, and Ion Androutsopoulos. 2018. AUEB at BioASQ 6: Document and snippet retrieval. In Proceedings of the 6th BioASQ Workshop. Association for Computational Linguistics, 30–39. DOI: https://doi.org/10.18653/v1/W18-5304
[23]
Brian L. Cairns, Rodney D. Nielsen, James J. Masanz, James H. Martin, Martha S. Palmer, Wayne H. Ward, and Guergana K. Savova. 2011. The MiPACQ clinical question answering system. In AMIA Annual Symposium Proceedings, Vol. 2011. American Medical Informatics Association.
[24]
David Campbell and Stephen Johnson. 2002. A transformational-based learner for dependency grammars in discharge summaries. In Proceedings of the ACL-02 Workshop on Natural Language Processing in the Biomedical Domain. Association for Computational Linguistics, 37–44. DOI: https://doi.org/10.3115/1118149.1118155
[25]
YongGang Cao, Feifan Liu, Pippa Simpson, Lamont Antieau, Andrew Bennett, James J. Cimino, John Ely, and Hong Yu. 2011. AskHERMES: An online question answering system for complex clinical questions. J. Biomed. Inform. 44, 2 (2011), 277–288.
[26]
Jaime Carbonell and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 335–336.
[27]
Souradip Chakraborty, Ekaba Bisong, Shweta Bhatt, Thomas Wagner, Riley Elliott, and Francesco Mosconi. 2020. BioMedBERT: A pre-trained biomedical language model for QA and IR. In Proceedings of the 28th International Conference on Computational Linguistics. International Committee on Computational Linguistics, 669–679. DOI: https://doi.org/10.18653/v1/2020.coling-main.59
[28]
Rishav Chakravarti, Anthony Ferritto, Bhavani Iyer, Lin Pan, Radu Florian, Salim Roukos, and Avi Sil. 2020. Towards building a robust industry-scale question answering system. In Proceedings of the 28th International Conference on Computational Linguistics: Industry Track. International Committee on Computational Linguistics, 90–101. Retrieved from https://www.aclweb.org/anthology/2020.coling-industry.9.
[29]
Khyathi Chandu, Aakanksha Naik, Aditya Chandrasekar, Zi Yang, Niloy Gupta, and Eric Nyberg. 2017. Tackling biomedical text summarization: OAQA at BioASQ 5B. In Proceedings of the BioNLP 2017. Association for Computational Linguistics, 58–66. DOI: https://doi.org/10.18653/v1/W17-2307
[30]
Danqi Chen, Adam Fisch, Jason Weston, and Antoine Bordes. 2017. Reading Wikipedia to answer open-domain questions. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1870–1879. DOI: https://doi.org/10.18653/v1/P17-1171
[31]
Jianpeng Cheng, Li Dong, and Mirella Lapata. 2016. Long short-term memory-networks for machine reading. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 551–561. DOI: https://doi.org/10.18653/v1/D16-1053
[32]
Sungbin Choi. 2015. SNUMedinfo at CLEF QA track BioASQ 2015. In CLEF 2015 Working Notes.
[33]
Stéphane Clinchant and Eric Gaussier. 2009. Bridging language modeling and divergence from randomness models: A log-logistic model for IR. In Proceedings of the Conference on the Theory of Information Retrieval. Springer, 54–65.
[34]
Sarah Cruchet, Arnaud Gaudinat, and Célia Boyer. 2008. Supervised approach to recognize question type in a QA system for health. Stud. Health Technol. Inform. 136 (2008), 407.
[35]
Yiming Cui, Zhipeng Chen, Si Wei, Shijin Wang, Ting Liu, and Guoping Hu. 2017. Attention-over-Attention neural networks for reading comprehension. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 593–602. DOI: https://doi.org/10.18653/v1/P17-1055
[36]
T. Delbecque, P. Jacquemart, and P. Zweigenbaum. 2005. Indexing UMLS semantic types for medical question-answering. Stud. Health Technol. and Inform. 116 (2005), 805–810.
[37]
Dina Demner-Fushman, S. Humphrey, Nicholas C. Ide, R. Loane, James G. Mork, P. Ruch, M. Ruiz, L. H. Smith, W. Wilbur, and A. Aronson. 2007. Combining resources to find answers to biomedical questions. In Proceedings of the Text Retrieval Conference (TREC).
[38]
Dina Demner-Fushman and Jimmy Lin. 2005. Knowledge Extraction for Clinical Question Answering: Preliminary Results. AAAI Workshop - Technical Report (01 2005).
[39]
Dina Demner-Fushman and Jimmy Lin. 2006. Answer extraction, semantic clustering, and extractive summarization for clinical question answering. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 841–848. DOI: https://doi.org/10.3115/1220175.1220281
[40]
Dina Demner-Fushman and Jimmy Lin. 2007. Answering clinical questions with knowledge-based and statistical techniques. Comput. Ling. 33, 1 (2007), 63–103.
[41]
Dina Demner-Fushman, Yassine Mrabet, and Asma Ben Abacha. 2020. Consumer health information and question answering: Helping consumers find answers to their health-related information needs. J. Amer. Med. Inform. Assoc. 27, 2 (2020), 194–201.
[42]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, 4171–4186. DOI: https://doi.org/10.18653/v1/N19-1423
[43]
Bhuwan Dhingra, Danish Danish, and Dheeraj Rajagopal. 2018. Simple and effective semi-supervised question answering. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers). Association for Computational Linguistics, 582–587. DOI: https://doi.org/10.18653/v1/N18-2092
[44]
Bhuwan Dhingra, Hanxiao Liu, Zhilin Yang, William Cohen, and Ruslan Salakhutdinov. 2017. Gated-attention readers for text comprehension. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1832–1846. DOI: https://doi.org/10.18653/v1/P17-1168
[45]
Xinya Du, Junru Shao, and Claire Cardie. 2017. Learning to ask: Neural question generation for reading comprehension. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1342–1352. DOI: https://doi.org/10.18653/v1/P17-1123
[46]
Yongping Du, Wenyang Guo, and Yiliang Zhao. 2019. Hierarchical question-aware context learning with augmented data for biomedical question answering. In Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 370–375.
[47]
Yongping Du, Bingbing Pei, Xiaozheng Zhao, and Junzhong Ji. 2018. Hierarchical multi-layer transfer learning model for biomedical question answering. In Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 362–367.
[48]
John W. Ely, Jerome A. Osheroff, M. Lee Chambliss, Mark H. Ebell, and Marcy E. Rosenbaum. 2005. Answering physicians’ clinical questions: Obstacles and potential solutions. J. Amer. Med. Inform. Assoc. 12, 2 (2005), 217–224.
[49]
John W. Ely, Jerome A. Osheroff, Paul N. Gorman, Mark H. Ebell, M. Lee Chambliss, Eric A. Pifer, and P. Zoe Stavri. 2000. A taxonomy of generic clinical questions: classification study. Bmj 321, 7258 (2000), 429–432.
[50]
Günes Erkan and Dragomir R. Radev. 2004. LexRank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22 (2004), 457–479.
[51]
D. A. Ferrucci. 2012. Introduction to “This is Watson.” IBM J. Res. Devel. 56, 3.4 (2012), 1:1–1:15. DOI: https://doi.org/10.1147/JRD.2012.2184356
[52]
Katja Filippova, Enrique Alfonseca, Carlos A. Colmenares, Lukasz Kaiser, and Oriol Vinyals. 2015. Sentence compression by deletion with LSTMs. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 360–368. DOI: https://doi.org/10.18653/v1/D15-1042
[53]
Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic meta-learning for fast adaptation of deep networks. arXiv preprint arXiv:1703.03400 (2017).
[54]
Susannah Fox and Maeve Duggan. 2012. Health Online 2013. Pew Res. Internet Proj. Rep. (01 2012).
[55]
Bin Fu, Yunqi Qiu, Chengguang Tang, Yang Li, Haiyang Yu, and Jian Sun. 2020. A survey on complex question answering over knowledge base: Recent advances and challenges. arXiv preprint arXiv:2007.13069 (2020).
[56]
Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, and Marcus Rohrbach. 2016. Multimodal compact bilinear pooling for visual question answering and visual grounding. arXiv preprint arXiv:1606.01847 (2016).
[57]
Julien Gobeill, E. Patsche, D. Theodoro, A.-L. Veuthey, C. Lovis, and P. Ruch. 2009. Question answering for biology and medicine. In Proceedings of the 9th International Conference on Information Technology and Applications in Biomedicine. IEEE, 1–5.
[58]
Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, and Hoifung Poon. 2020. Domain-specific language model pretraining for biomedical natural language processing. arXiv preprint arXiv:2007.15779 (2020).
[59]
Jiafeng Guo, Yixing Fan, Qingyao Ai, and W. Bruce Croft. 2016. A deep relevance matching model for ad hoc retrieval. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM’16). Association for Computing Machinery, New York, NY, 55–64. DOI: https://doi.org/10.1145/2983323.2983769
[60]
Akshay Kumar Gupta. 2017. Survey of visual question answering: Datasets and techniques. arXiv preprint arXiv:1705.03865 (2017).
[61]
Thierry Hamon, Natalia Grabar, and Fleur Mougin. 2017. Querying biomedical linked data with natural language questions. Seman. Web 8, 4 (2017), 581–599.
[62]
Sanda Harabagiu and Andrew Hickl. 2006. Methods for using textual entailment in open-domain question answering. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 905–912. DOI: https://doi.org/10.3115/1220175.1220289
[63]
Junqing He, Mingming Fu, and Manshu Tu. 2019. Applying deep matching networks to Chinese medical question answering: A study and a dataset. BMC Med. Inform. Decis.-mak. 19, 2 (2019), 52.
[64]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770–778.
[65]
Xuehai He, Yichen Zhang, Luntian Mou, Eric Xing, and Pengtao Xie. 2020. PathVQA: 30000+ questions for medical visual question answering. arXiv preprint arXiv:2003.10286 (2020).
[66]
Yun He, Ziwei Zhu, Yin Zhang, Qin Chen, and James Caverlee. 2020. Infusing disease knowledge into BERT for health question answering, medical inference and disease name recognition. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, 4604–4614. DOI: https://doi.org/10.18653/v1/2020.emnlp-main.372
[67]
Karl Moritz Hermann, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. 2015. Teaching machines to read and comprehend. In Proceedings of the International Conference on Advances in Neural Information Processing Systems. 1693–1701.
[68]
William Hersh, Aaron Cohen, Lynn Ruslen, and Phoebe Roberts. 2007. TREC 2007 genomics track overview. In Proceedings of the Text Retrieval Conference (TREC).
[69]
William Hersh, Aaron M. Cohen, Phoebe Roberts, and Hari Krishna Rekapalli. 2006. TREC 2006 genomics track overview. In Proceedings of the Text Retrieval Conference (TREC).
[70]
William Hersh and Ellen Voorhees. 2009. TREC genomics special issue overview. Inf. Retr. 12, 1 (Feb. 2009), 1–15. DOI: https://doi.org/10.1007/s10791-008-9076-6
[71]
William R. Hersh, M. Katherine Crabtree, David H. Hickam, Lynetta Sacherek, Charles P. Friedman, Patricia Tidmarsh, Craig Mosbaek, and Dale Kraemer. 2002. Factors associated with success in searching MEDLINE and applying evidence to answer clinical questions. J. Amer. Med. Inform. Assoc. 9, 3 (2002), 283–293.
[72]
L. Hirschman and R. Gaizauskas. 2001. Natural language question answering: The view from here. Nat. Lang. Eng. 7, 4 (Dec. 2001), 275–300. DOI: https://doi.org/10.1017/S1351324901002807
[73]
Kexin Huang, Jaan Altosaar, and Rajesh Ranganath. 2019. ClinicalBERT: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342 (2019).
[74]
Xiaoli Huang, Jimmy Lin, and Dina Demner-Fushman. 2006. Evaluation of PICO as a knowledge representation for clinical questions. In AMIA Annual Symposium Proceedings, Vol. 2006. American Medical Informatics Association.
[75]
Kai Hui, Andrew Yates, Klaus Berberich, and Gerard de Melo. 2017. PACRR: A position-aware neural IR model for relevance matching. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1049–1058. DOI: https://doi.org/10.18653/v1/D17-1110
[76]
Lijun Huo and Xiang Zhao. 2020. A sentence-based circular reasoning model in multi-hop reading comprehension. IEEE Access 8 (2020), 174255–174264.
[77]
P. Jacquemart and P. Zweigenbaum. 2003. Towards a medical question-answering system: A feasibility study. Stud. Health Technol. Inform. 95 (2003), 463.
[78]
Sarthak Jain and Byron C. Wallace. 2019. Attention is not explanation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, 3543–3556. DOI: https://doi.org/10.18653/v1/N19-1357
[79]
Di Jin, Eileen Pan, Nassim Oufattole, Wei-Hung Weng, Hanyi Fang, and Peter Szolovits. 2020. What disease does this patient have? A large-scale open domain question answering dataset from medical exams. arXiv preprint arXiv:2009.13081 (2020).
[80]
Qiao Jin, Bhuwan Dhingra, William Cohen, and Xinghua Lu. 2019. Probing biomedical embeddings from language models. In Proceedings of the 3rd Workshop on Evaluating Vector Space Representations for NLP. 82–89.
[81]
Qiao Jin, Bhuwan Dhingra, Zhengping Liu, William Cohen, and Xinghua Lu. 2019. PubMedQA: A dataset for biomedical research question answering. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 2567–2577. DOI: https://doi.org/10.18653/v1/D19-1259
[82]
Zan-Xia Jin, Bo-Wen Zhang, Fan Fang, Le-Le Zhang, and Xu-Cheng Yin. 2017. A multi-strategy query processing approach for biomedical question answering: USTB_PRIR at BioASQ 2017 Task 5B. In Proceedings of the Biomedical Natural Language Processing Workshop (BioNLP). Association for Computational Linguistics, 373–380. DOI: https://doi.org/10.18653/v1/W17-2348
[83]
Mandar Joshi, Eunsol Choi, Daniel Weld, and Luke Zettlemoyer. 2017. TriviaQA: A large scale distantly supervised challenge dataset for reading comprehension. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1601–1611. DOI: https://doi.org/10.18653/v1/P17-1147
[84]
Z. Kaddari, Y. Mellah, J. Berrich, T. Bouchentouf, and M. G. Belkasmi. 2020. Biomedical question answering: A survey of methods and datasets. In Proceedings of the 4th International Conference On Intelligent Computing in Data Sciences (ICDS). 1–8. DOI: https://doi.org/10.1109/ICDS50568.2020.9268742
[85]
Rudolf Kadlec, Martin Schmid, Ondrej Bajgar, and Jan Kleindienst. 2016. Text understanding with the attention Sum Reader network. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 908–918. DOI: https://doi.org/10.18653/v1/P16-1086
[86]
Aishwarya Kamath and Rajarshi Das. 2018. A survey on semantic parsing. arXiv preprint arXiv:1812.00978 (2018).
[87]
Maulik R. Kamdar and Mark A. Musen. 2020. An empirical meta-analysis of the life sciences (Linked?) open data on the web. arXiv preprint arXiv:2006.04161 (2020).
[88]
Jaewoo Kang. 2020. Transferability of natural language inference to biomedical question answering. arXiv preprint arXiv:2007.00217 (2020).
[89]
Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Peter Clark, Oren Etzioni, and Dan Roth. 2016. Question answering via integer programming over semi-structured knowledge. arXiv preprint arXiv:1604.06076 (2016).
[90]
Jin-Dong Kim and K. Bretonnel Cohen. 2013. Natural language query processing for SPARQL generation: A prototype system for SNOMED CT. In Proceedings of Biolink, Vol. 32. Academia.
[91]
Seongsoon Kim, Donghyeon Park, Yonghwa Choi, Kyubum Lee, Byounggun Kim, Minji Jeon, Jihye Kim, Aik Choon Tan, and Jaewoo Kang. 2018. A pilot study of biomedical text comprehension using an attention-based deep neural reader: Design and experimental analysis. JMIR Medical Inform. 6, 1 (2018).
[92]
Milena Kraus, Julian Niedermeier, Marcel Jankrift, Sören Tietböhl, Toni Stachewicz, Hendrik Folkerts, Matthias Uflacker, and Mariana Neves. 2017. Olelo: A web application for intuitive exploration of biomedical literature. Nucleic Acids Res. 45, W1 (2017), W478–W483.
[93]
Keita Kurita, Nidhi Vyas, Ayush Pareek, Alan W. Black, and Yulia Tsvetkov. 2019. Measuring bias in contextualized word representations. In Proceedings of the 1st Workshop on Gender Bias in Natural Language Processing. Association for Computational Linguistics, 166–172. DOI: https://doi.org/10.18653/v1/W19-3823
[94]
Tom Kwiatkowski, Jennimaria Palomaki, Olivia Redfield, Michael Collins, Ankur Parikh, Chris Alberti, Danielle Epstein, Illia Polosukhin, Jacob Devlin, Kenton Lee, et al. 2019. Natural questions: A benchmark for question answering research. Trans. Assoc. Comput. Ling. 7 (2019), 453–466.
[95]
Guokun Lai, Qizhe Xie, Hanxiao Liu, Yiming Yang, and Eduard Hovy. 2017. RACE: Large-scale ReAding comprehension dataset from examinations. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 785–794. DOI: https://doi.org/10.18653/v1/D17-1082
[96]
A. Lamurias, D. Sousa, and F. M. Couto. 2020. Generating biomedical question answering corpora from Q A Forums. IEEE Access 8 (2020), 161042–161051. DOI: https://doi.org/10.1109/ACCESS.2020.3020868
[97]
Jason J. Lau, Soumya Gayen, Asma Ben Abacha, and Dina Demner-Fushman. 2018. A dataset of clinically generated visual questions and answers about radiology images. Sci. Data 5, 1 (2018), 1–10.
[98]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: A pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.
[99]
Minsuk Lee, James Cimino, Hai Ran Zhu, Carl Sable, Vijay Shanker, John Ely, and Hong Yu. 2006. Beyond information retrieval—Medical question answering. In AMIA Annual Symposium Proceedings, Vol. 2006. American Medical Informatics Association.
[100]
Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2020. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 7871–7880. DOI: https://doi.org/10.18653/v1/2020.acl-main.703
[101]
Dongfang Li, Baotian Hu, Qingcai Chen, Weihua Peng, and Anqi Wang. 2020. Towards medical machine reading comprehension with structural knowledge and plain text. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, 1427–1438. DOI: https://doi.org/10.18653/v1/2020.emnlp-main.111
[102]
Guanqiao Li, Yangzhong Zhou, Junyi Ji, Xiaozhen Liu, Qiao Jin, and Linqi Zhang. 2020. Surging publications on the COVID-19 pandemic. Clin. Microbiol. Infect. 27, 3 (2020).
[103]
Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, and Kai-Wei Chang. 2019. VisualBERT: A Simple and Performant Baseline for Vision and Language. arxiv:1908.03557 [cs.CV]
[104]
Jimmy Lin, Rodrigo Nogueira, and Andrew Yates. 2020. Pretrained transformers for text ranking: BERT and beyond. arXiv preprint arXiv:2010.06467 (2020).
[105]
Min Lin, Qiang Chen, and Shuicheng Yan. 2013. Network in network. arXiv preprint arXiv:1312.4400 (2013).
[106]
Ryan T. K. Lin, Justin Liang-Te Chiu, Hong-Jei Dai, Min-Yuh Day, Richard Tzong-Han Tsai, and Wen-Lian Hsu. 2008. Biological question answering with syntactic and semantic feature matching and an improved mean reciprocal ranking measurement. In Proceedings of the IEEE International Conference on Information Reuse and Integration. IEEE, 184–189.
[107]
Yifeng Liu. 2013. The University of Alberta participation in the BioASQ challenge: The Wishart system. In Proceedings of the 1st Workshop Bio-Medical Semantic Indexing Question Answering, Conference Labs Evaluation Forum. 1–4.
[108]
Ye Liu, Shaika Chowdhury, Chenwei Zhang, Cornelia Caragea, and Philip S. Yu. 2020. Interpretable multi-step reasoning with knowledge extraction on complex healthcare question answering. arXiv preprint arXiv:2008.02434 (2020).
[109]
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019).
[110]
Jake Luo, Guo-Qiang Zhang, Susan Wentz, Licong Cui, and Rong Xu. 2015. SimQ: real-time retrieval of similar consumer health questions. J. Med. Internet Res. 17, 2 (2015).
[111]
Anca Marginean. 2017. Question answering over biomedical linked data with grammatical framework. Seman. Web 8, 4 (2017), 565–580.
[112]
Jonathan Masci, Ueli Meier, Dan Cireşan, and Jürgen Schmidhuber. 2011. Stacked convolutional auto-encoders for hierarchical feature extraction. In Proceedings of the International Conference on Artificial Neural Networks. Springer, 52–59.
[113]
Giuseppe M. Mazzeo and Carlo Zaniolo. 2016. Question answering on RDF KBs using controlled natural language and semantic autocompletion. Seman. Web 1 (2016), 1–5.
[114]
Gabor Melli, Yang Wang, Yudong Liu, Mehdi M. Kashani, Zhongmin Shi, Baohua Gu, Anoop Sarkar, and Fred Popowich. 2005. Description of SQUASH, the SFU question answering summary handler for the DUC-2005 summarization task. Safety 1 (2005), 14345754.
[115]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. In 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2–4, 2013, Workshop Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). Retrieved from http://arxiv.org/abs/1301.3781.
[116]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the International Conference on Advances in Neural Information Processing Systems. 3111–3119.
[117]
Diego Mollá. 2017. Macquarie University at BioASQ 5b—Query-based summarisation techniques for selecting the ideal answers. In Proceedings of the Biomedical Natural Language Processing Workshop (BioNLP). Association for Computational Linguistics, 67–75. DOI: https://doi.org/10.18653/v1/W17-2308
[118]
Diego Mollá. 2018. Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based summarisation. In Proceedings of the 6th BioASQ Workshop. Association for Computational Linguistics, 22–29. DOI: https://doi.org/10.18653/v1/W18-5303
[119]
Diego Mollá and Christopher Jones. 2019. Classification betters regression in query-based multi-document summarisation techniques for question answering. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 624–635.
[120]
Diego Molla, Christopher Jones, and Vincent Nguyen. 2020. Query focused multi-document summarisation of biomedical texts. arXiv preprint arXiv:2008.11986 (2020).
[121]
Diego Molla and Maria Elena Santiago-Martinez. 2011. Development of a corpus for evidence based medicine summarisation. In Proceedings of the Australasian Language Technology Association Workshop. 86–94. Retrieved from https://www.aclweb.org/anthology/U11-1012.
[122]
Diego Mollá, María Elena Santiago-Martínez, Abeed Sarker, and Cécile Paris. 2016. A corpus for research in text processing for evidence based medicine. Lang. Resour. Eval. 50, 4 (2016), 705–727.
[123]
Diego Mollá, Rolf Schwitter, Michael Hess, and Rachel Fournier. 2000. ExtrAns, an answer extraction system. In T.A.L. 41, 2 (2000), 1–25.
[124]
Timo Moller, Anthony Reina, Raghavan Jayakumar, and Malte Pietsch. 2020. COVID-QA: A question answering dataset for COVID-19. Retrieved from https://openreview.net/forum?id=JENSKEEzsoU.
[125]
Roser Morante, Martin Krallinger, Alfonso Valencia, and Walter Daelemans. 2012. Machine reading of biomedical texts about Alzheimer’s disease. In CLEF 2012 Conference and Labs of the Evaluation Forum-question Answering For Machine Reading Evaluation (QA4MRE), J. Forner (Ed.). CEUR-WS, 1–14.
[126]
Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on International Conference on Machine Learning. 807–814.
[127]
Preslav Nakov, Doris Hoogeveen, Lluís Màrquez, Alessandro Moschitti, Hamdy Mubarak, Timothy Baldwin, and Karin Verspoor. 2017. SemEval-2017 Task 3: Community question answering. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval). Association for Computational Linguistics, 27–48. DOI: https://doi.org/10.18653/v1/S17-2003
[128]
Preslav Nakov, Lluís Màrquez, Alessandro Moschitti, Walid Magdy, Hamdy Mubarak, Abed Alhakim Freihat, Jim Glass, and Bilal Randeree. 2016. SemEval-2016 Task 3: Community question answering. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval). Association for Computational Linguistics. 525–545. DOI: https://doi.org/10.18653/v1/S16-1083
[129]
Anastasios Nentidis, Anastasia Krithara, Konstantinos Bougiatiotis, Martin Krallinger, Carlos Rodriguez-Penagos, Marta Villegas, and Georgios Paliouras. 2020. Overview of BioASQ 2020: The eighth BioASQ challenge on large-scale biomedical semantic indexing and question answering. In Experimental IR Meets Multilinguality, Multimodality, and Interaction, Avi Arampatzis, Evangelos Kanoulas, Theodora Tsikrika, Stefanos Vrochidis, Hideo Joho, Christina Lioma, Carsten Eickhoff, Aurélie Névéol, Linda Cappellato, and Nicola Ferro (Eds.). Springer International Publishing, Cham, 194–214.
[130]
Mariana Neves and Ulf Leser. 2015. Question answering for biology. Methods 74 (2015), 36–46.
[131]
Binh D. Nguyen, Thanh-Toan Do, Binh X. Nguyen, Tuong Do, Erman Tjiputra, and Quang D. Tran. 2019. Overcoming data limitation in medical visual question answering. In Proceedings of the International Conference on Medical Image Computing and Computer-assisted Intervention. Springer, 522–530.
[132]
Vincent Nguyen. 2019. Question answering in the biomedical domain. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, 54–63. DOI: https://doi.org/10.18653/v1/P19-2008
[133]
David N. Nicholson and Casey S. Greene. 2020. Constructing knowledge graphs and their biomedical applications. Comput. Struct. Biotechnol. J. 18 (2020), 1414.
[134]
Yun Niu, Graeme Hirst, Gregory McArthur, and Patricia Rodriguez-Gianolli. 2003. Answering clinical questions with role identification. In Proceedings of the ACL Workshop on Natural Language Processing in Biomedicine. Association for Computational Linguistics. 73–80. DOI: https://doi.org/10.3115/1118958.1118968
[135]
Rodrigo Nogueira, Zhiying Jiang, Ronak Pradeep, and Jimmy Lin. 2020. Document ranking with a pretrained sequence-to-sequence model. In Findings of the Association for Computational Linguistics: EMNLP 2020. Association for Computational Linguistics. 708–718. DOI: https://doi.org/10.18653/v1/2020.findings-emnlp.63
[136]
María-Dolores Olvera-Lobo and Juncal Gutiérrez-Artacho. 2011. Multilingual question-answering system in biomedical domain on the web: An evaluation. In Proceedings of the International Conference of the Cross-language Evaluation Forum for European Languages. Springer, 83–88.
[137]
Ibrahim Burak Ozyurt, Anita Bandrowski, and Jeffrey S. Grethe. 2020. Bio-AnswerFinder: A system to find answers to questions from biomedical texts. Database 2020 (2020).
[138]
Anusri Pampari, Preethi Raghavan, Jennifer Liang, and Jian Peng. 2018. emrQA: A large corpus for question answering on electronic medical records. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2357–2368. DOI: https://doi.org/10.18653/v1/D18-1258
[139]
Dimitris Pappas, Ion Androutsopoulos, and Haris Papageorgiou. 2018. BioRead: A new dataset for biomedical reading comprehension. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC). European Language Resources Association (ELRA). Retrieved from https://www.aclweb.org/anthology/L18-1439.
[140]
Dimitris Pappas, Ryan McDonald, Georgios-Ioannis Brokos, and Ion Androutsopoulos. 2019. AUEB at BioASQ 7: document and snippet retrieval. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 607–623.
[141]
Dimitris Pappas, Petros Stavropoulos, and Ion Androutsopoulos. 2020. AUEB-NLP at BioASQ 8: Biomedical document and snippet retrieval. In CLEF 2020 Working Notes.
[142]
Dimitris Pappas, Petros Stavropoulos, Ion Androutsopoulos, and Ryan McDonald. 2020. BioMRC: A dataset for biomedical machine reading comprehension. In Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing. Association for Computational Linguistics, 140–149. Retrieved from https://www.aclweb.org/anthology/2020.bionlp-1.15.
[143]
Junwoo Park, Youngwoo Cho, Haneol Lee, Jaegul Choo, and Edward Choi. 2020. Knowledge graph-based question answering with electronic health records. arXiv preprint arXiv:2010.09394 (2020).
[144]
Ioannis Partalas, Eric Gaussier, Axel-Cyrille Ngonga Ngomo, et al. 2013. Results of the first BioASQ workshop. In BioASQ@CLEF 2013.
[145]
Anselmo Penas, Yusuke Miyao, Alvaro Rodrigo, Eduard H. Hovy, and Noriko Kando. 2014. Overview of CLEF QA entrance exams task 2014. In CLEF (Working Notes). CEUR-WS, 1194–1200.
[146]
Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. Deep contextualized word representations. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics, 2227–2237. DOI: https://doi.org/10.18653/v1/N18-1202
[147]
Mai Phuong Pham et al. 2020. Machine Comprehension for Clinical Case Reports. Ph.D. Dissertation. Massachusetts Institute of Technology.
[148]
Adam Poliak, Max Fleming, Cash Costello, Kenton W. Murray, Mahsa Yarmohammadi, Shivani Pandya, Darius Irani, Milind Agarwal, Udit Sharma, Shuo Sun, Nicola Ivanov, Lingxi Shang, Kaushik Srinivasan, Seolhwa Lee, Xu Han, Smisha Agarwal, and João Sedoc. 2020. Collecting verified COVID-19 question answer pairs. In Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP. Association for Computational Linguistics. DOI: https://doi.org/10.18653/v1/2020.nlpcovid19-2.31
[149]
Hemant Pugaliya, Karan Saxena, Shefali Garg, Sheetal Shalini, Prashant Gupta, Eric Nyberg, and Teruko Mitamura. 2019. Pentagon at MEDIQA 2019: Multi-task learning for filtering and re-ranking answers using language inference and question entailment. arXiv preprint arXiv:1907.01643 (2019).
[150]
Minghui Qiu, Feng-Lin Li, Siyu Wang, Xing Gao, Yan Chen, Weipeng Zhao, Haiqing Chen, Jun Huang, and Wei Chu. 2017. AliMe Chat: A sequence to sequence and rerank based chatbot engine. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, 498–503. DOI: https://doi.org/10.18653/v1/P17-2079
[151]
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21, 140 (2020), 1–67.
[152]
Preethi Raghavan, Siddharth Patwardhan, Jennifer J. Liang, and Murthy V. Devarakonda. 2018. Annotating electronic medical records for question answering. arXiv preprint arXiv:1805.06816 (2018).
[153]
Alvin Rajkomar, Michaela Hardt, Michael D. Howell, Greg Corrado, and Marshall H. Chin. 2018. Ensuring fairness in machine learning to advance health equity. Ann. Intern. Med. 169, 12 (2018), 866–872.
[154]
Pranav Rajpurkar, Robin Jia, and Percy Liang. 2018. Know what you don’t know: Unanswerable questions for SQuAD. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, 784–789. DOI: https://doi.org/10.18653/v1/P18-2124
[155]
Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, and Percy Liang. 2016. SQuAD: 100,000+ Questions for machine comprehension of text. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 2383–2392. DOI: https://doi.org/10.18653/v1/D16-1264
[156]
Aarne Ranta, Ali El Dada, and Janna Khegai. 2009. The GF resource grammar library. Ling. Issues Lang. Technol. 2, 2 (2009), 1–63.
[157]
Revanth Gangi Reddy, Bhavani Iyer, Md Arafat Sultan, Rong Zhang, Avi Sil, Vittorio Castelli, Radu Florian, and Salim Roukos. 2020. End-to-end QA on COVID-19: Domain adaptation with synthetic training. arXiv preprint arXiv:2012.01414 (2020).
[158]
Fuji Ren and Yangyang Zhou. 2020. CGMVQA: A new classification and generative model for medical visual question answering. IEEE Access 8 (2020), 50626–50636.
[159]
Fabio Rinaldi, James Dowdall, Gerold Schneider, and Andreas Persidis. 2004. Answering questions in the genomics domain. In Proceedings of the Conference on Question Answering in Restricted Domains. Association for Computational Linguistics, 46–53. Retrieved from https://www.aclweb.org/anthology/W04-0508.
[160]
Kirk Roberts and Dina Demner-Fushman. 2016. Interactive use of online health resources: a comparison of consumer and professional questions. J. Amer. Med. Inform. Assoc. 23, 4 (2016), 802–811.
[161]
Kirk Roberts and Braja Gopal Patra. 2017. A semantic parsing method for mapping clinical questions to logical forms. In AMIA Annual Symposium Proceedings, Vol. 2017. American Medical Informatics Association.
[162]
Alexey Romanov and Chaitanya Shivade. 2018. Lessons from natural language inference in the clinical domain. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 1586–1596. DOI: https://doi.org/10.18653/v1/D18-1187
[163]
Subendhu Rongali, Abhyuday Jagannatha, Bhanu Pratap Singh Rawat, and Hong Yu. 2020. Improved pretraining for domain-specific contextual embedding models. arXiv preprint arXiv:2004.02288 (2020).
[164]
Tony Russell-Rose and Jon Chamberlain. 2017. Expert search strategies: The information retrieval practices of healthcare information professionals. JMIR Med. Inform. 5, 4 (2017).
[165]
David L. Sackett. 1997. Evidence-based medicine. In Seminars in Perinatology, Vol. 21. Elsevier, 3–5.
[166]
Abeed Sarker, Diego Mollá, and Cécile Paris. 2013. An approach for query-focused text summarisation for evidence based medicine. In Artificial Intelligence in Medicine, Niels Peek, Roque Marín Morales, and Mor Peleg (Eds.). Springer Berlin, 295–304.
[167]
Max Savery, Asma Ben Abacha, Soumya Gayen, and Dina Demner-Fushman. 2020. Question-driven summarization of answers to consumer health questions. arXiv e-prints (May 2020). arxiv:2005.09067 [cs.CL].
[168]
Frederik Schulze and Mariana Neves. 2016. Entity-Supported summarization of biomedical abstracts. In Proceedings of the 5th Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM). The COLING 2016 Organizing Committee, 40–49. Retrieved from https://www.aclweb.org/anthology/W16-5105.
[169]
Frederik Schulze, Ricarda Schüler, Tim Draeger, Daniel Dummer, Alexander Ernst, Pedro Flemming, Cindy Perscheid, and Mariana Neves. 2016. HPI question answering system in BioASQ 2016. In Proceedings of the 4th BioASQ Workshop. 38–44.
[170]
Abigail See, Peter J. Liu, and Christopher D. Manning. 2017. Get to the point: Summarization with pointer-generator networks. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1073–1083. DOI: https://doi.org/10.18653/v1/P17-1099
[171]
Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, and Hannaneh Hajishirzi. 2016. Bidirectional attention flow for machine comprehension. arXiv preprint arXiv:1611.01603 (2016).
[172]
Elaheh ShafieiBavani, Mohammad Ebrahimi, Raymond Wong, and Fang Chen. 2016. Appraising UMLS coverage for summarizing medical evidence. In Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers. The COLING 2016 Organizing Committee, 513–524. Retrieved from https://www.aclweb.org/anthology/C16-1050.
[173]
Samrudhi Sharma, Huda Patanwala, Manthan Shah, and Khushali Deulkar. 2015. A survey of medical question answering systems. Int. J. Eng. Technic. Res. 3, 2 (2015), 2321–0869.
[174]
Vasu Sharma, Nitish Kulkarni, Srividya Pranavi, Gabriel Bayomi, Eric Nyberg, and Teruko Mitamura. 2018. BioAMA: Towards an end to end BioMedical question answering system. In Proceedings of the Biomedical Natural Language Processing Workshop (BioNLP). Association for Computational Linguistics. 109–117. DOI: https://doi.org/10.18653/v1/W18-2312
[175]
Zhongmin Shi, Gabor Melli, Yang Wang, Yudong Liu, Baohua Gu, Mehdi M. Kashani, Anoop Sarkar, and Fred Popowich. 2007. Question answering summarization of multiple biomedical documents. In Proceedings of the Conference of the Canadian Society for Computational Studies of Intelligence. Springer, 284–295.
[176]
Hideyuki Shibuki, Kotaro Sakamoto, Yoshinobu Kano, Teruko Mitamura, Madoka Ishioroshi, Kelly Y. Itakura, Di Wang, Tatsunori Mori, and Noriko Kando. 2014. Overview of the NTCIR-11 QA-Lab Task. In Proceedings of the NTCIR Conference.
[177]
Ana Claudia Sima, Tarcisio Mendes de Farias, Maria Anisimova, Christophe Dessimoz, Marc Robinson-Rechavi, Erich Zbinden, and Kurt Stockinger. 2021. Bio-SODA: Enabling natural language question answering over knowledge graphs without training data. arXiv preprint arXiv:2104.13744 (2021).
[178]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[179]
Sarvesh Soni, Meghana Gudala, Daisy Zhe Wang, and Kirk Roberts. 2019. Using FHIR to construct a corpus of clinical questions annotated with logical forms and answers. In AMIA Annual Symposium Proceedings, Vol. 2019. American Medical Informatics Association.
[180]
Sarvesh Soni and Kirk Roberts. 2019. A paraphrase generation system for EHR question answering. In Proceedings of the 18th BioNLP Workshop and Shared Task. 20–29.
[181]
Sarvesh Soni and Kirk Roberts. 2020. Paraphrasing to improve the performance of electronic health records question answering. AMIA Summ. Translat. Sci. Proc. 2020 (2020), 626.
[182]
Yash Srivastava, Vaishnav Murali, Shiv Ram Dubey, and Snehasis Mukherjee. 2019. Visual question answering using deep learning: A survey and performance analysis. arXiv preprint arXiv:1909.01860 (2019).
[183]
Michael Q. Stearns, Colin Price, Kent A. Spackman, and Amy Y. Wang. 2001. SNOMED clinical terms: Overview of the development process and project status. In Proceedings of the AMIA Symposium. American Medical Informatics Association.
[184]
Dan Su, Yan Xu, Genta Indra Winata, Peng Xu, Hyeondey Kim, Zihan Liu, and Pascale Fung. 2019. Generalizing question answering system with pre-trained language model fine-tuning. In Proceedings of the 2nd Workshop on Machine Reading for Question Answering. Association for Computational Linguistics, 203–211. DOI: https://doi.org/10.18653/v1/D19-5827
[185]
Dan Su, Yan Xu, Tiezheng Yu, Farhad Bin Siddique, Elham Barezi, and Pascale Fung. 2020. CAiRE-COVID: A question answering and query-focused multi-document summarization system for COVID-19 scholarly information management. In Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020. Association for Computational Linguistics. DOI: https://doi.org/10.18653/v1/2020.nlpcovid19-2.14
[186]
Shuo Sun and João Sedoc. 2020. An analysis of BERT FAQ retrieval models for COVID-19 infobot. (2020).
[187]
Simon Šuster and Walter Daelemans. 2018. CliCR: A dataset of clinical case reports for machine reading comprehension. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics, 1551–1563. DOI: https://doi.org/10.18653/v1/N18-1140
[188]
Kouji Takahashi, Asako Koike, and Toshihisa Takagi. 2004. Question answering system in biomedical domain. In Proceedings of the 15th International Conference on Genome Informatics. Citeseer, 161–162.
[189]
Hao Tan and Mohit Bansal. 2019. LXMERT: Learning cross-modality encoder representations from transformers. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 5100–5111. DOI: https://doi.org/10.18653/v1/D19-1514
[190]
Raphael Tang, Rodrigo Nogueira, Edwin Zhang, Nikhil Gupta, Phuong Cam, Kyunghyun Cho, and Jimmy Lin. 2020. Rapidly bootstrapping a question answering dataset for COVID-19. arXiv preprint arXiv:2004.11339 (2020).
[191]
Rafael M. Terol, Patricio Martínez-Barco, and Manuel Palomar. 2007. A knowledge based method for the medical question answering problem. Comput. Biol. Med. 37, 10 (2007), 1511–1521.
[192]
Yuanhe Tian, Weicheng Ma, Fei Xia, and Yan Song. 2019. ChiMed: A Chinese medical corpus for question answering. In Proceedings of the 18th BioNLP Workshop and Shared Task. Association for Computational Linguistics, 250–260. DOI: https://doi.org/10.18653/v1/W19-5027
[193]
George Tsatsaronis, Georgios Balikas, Prodromos Malakasiotis, Ioannis Partalas, Matthias Zschunke, Michael R. Alvers, Dirk Weissenborn, Anastasia Krithara, Sergios Petridis, Dimitris Polychronopoulos, et al. 2015. An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition. BMC Bioinform. 16, 1 (2015), 138.
[194]
Christina Unger, Corina Forascu, Vanessa Lopez, Axel-Cyrille Ngonga Ngomo, Elena Cabrio, Philipp Cimiano, and Sebastian Walter. 2014. Question answering over linked data (QALD-4). In Working Notes for CLEF 2014 Conference. CEUR-WS.
[195]
Hadi Veisi and Hamed Fakour Shandi. 2020. A Persian medical question answering system. Int. J. Artif. Intell. Tools 29, 06 (2020), 2050019.
[196]
David Vilares and Carlos Gómez-Rodríguez. 2019. HEAD-QA: A healthcare dataset for complex reasoning. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 960–966. DOI: https://doi.org/10.18653/v1/P19-1092
[197]
Ellen M. Voorhees. 2001. The TREC question answering track. Nat. Lang. Eng. 7, 4 (2001), 361–378. DOI: https://doi.org/10.1017/S1351324901002789
[198]
Di Wang and Eric Nyberg. 2017. CMU OAQA at TREC 2017 LiveQA: A neural dual entailment approach for question paraphrase identification. In Proceedings of the Text Retrieval Conference (TREC).
[199]
Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Darrin Eide, K. Funk, Rodney Michael Kinney, Ziyang Liu, W. Merrill, P. Mooney, D. Murdick, Devvret Rishi, Jerry Sheehan, Zhihong Shen, B. Stilson, Alex D. Wade, Kuansan Wang, Christopher Wilhelm, Boya Xie, D. Raymond, Daniel S. Weld, Oren Etzioni, and Sebastian Kohlmeier. 2020. CORD-19: The Covid-19 open research dataset. ArXiv, arXiv:2004.10706v2.
[200]
Ping Wang, Tian Shi, and Chandan K. Reddy. 2020. Text-to-SQL generation for question answering on electronic medical records. In Proceedings of the Web Conference. Association for Computing Machinery, New York, NY, 350–361. DOI: https://doi.org/10.1145/3366423.3380120
[201]
Chih-Hsuan Wei, Hung-Yu Kao, and Zhiyong Lu. 2013. PubTator: a web-based text mining tool for assisting biocuration. Nucleic Acids Res. 41, W1 (2013), W518–W522.
[202]
Wang Weiming, Dawei Hu, Min Feng, and Liu Wenyin. 2007. Automatic clinical question answering based on UMLS relations. In Proceedings of the 3rd International Conference on Semantics, Knowledge and Grid (SKG). IEEE, 495–498.
[203]
Dirk Weissenborn, George Tsatsaronis, and Michael Schroeder. 2013. Answering factoid questions in the biomedical domain. (2013).
[204]
Dirk Weissenborn, Georg Wiese, and Laura Seiffe. 2017. Making neural QA as simple as possible but not simpler. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL). Association for Computational Linguistics, 271–280. DOI: https://doi.org/10.18653/v1/K17-1028
[205]
Johannes Welbl, Pontus Stenetorp, and Sebastian Riedel. 2018. Constructing datasets for multi-hop reading comprehension across documents. Trans. Assoc. Comput. Ling. 6 (2018), 287–302. DOI: https://doi.org/10.1162/tacl_a_00021
[206]
Sarah Wiegreffe and Yuval Pinter. 2019. Attention is not not explanation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 11–20. DOI: https://doi.org/10.18653/v1/D19-1002
[207]
Georg Wiese, Dirk Weissenborn, and Mariana Neves. 2017. Neural domain adaptation for biomedical question answering. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL). Association for Computational Linguistics, 281–289. DOI: https://doi.org/10.18653/v1/K17-1029
[208]
Adina Williams, Nikita Nangia, and Samuel Bowman. 2018. A broad-coverage challenge corpus for sentence understanding through inference. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics, 1112–1122. DOI: https://doi.org/10.18653/v1/N18-1101
[209]
Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony Dick, and Anton van den Hengel. 2017. Visual question answering: A survey of methods and datasets. Comput. Vis. Image Underst. 163 (2017), 21–40.
[210]
Ye Wu, Tak-Wah Lam, Hing-Fung Ting, and Ruibang Luo. 2021. BioNumQA-BERT: Answering biomedical questions using numerical facts with a deep language representation model. In Proceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics.
[211]
Caiming Xiong, Victor Zhong, and Richard Socher. 2016. Dynamic coattention networks for question answering. arXiv preprint arXiv:1611.01604 (2016).
[212]
Xin Yan, Lin Li, Chulin Xie, Jun Xiao, and Lin Gu. 2019. Zhejiang university at ImageCLEF 2019 visual question answering in the medical domain. In CLEF (Working Notes).
[213]
Zi Yang, Niloy Gupta, Xiangyu Sun, Di Xu, Chi Zhang, and Eric Nyberg. 2015. Learning to answer biomedical factoid & list questions: OAQA at BioASQ 3B. (2015).
[214]
Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, and Alex Smola. 2016. Stacked attention networks for image question answering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 21–29.
[215]
Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William Cohen, Ruslan Salakhutdinov, and Christopher D. Manning. 2018. HotpotQA: A dataset for diverse, explainable multi-hop question answering. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2369–2380. DOI: https://doi.org/10.18653/v1/D18-1259
[216]
Zi Yang, Yue Zhou, and Eric Nyberg. 2016. Learning to answer biomedical questions: OAQA at BioASQ 4B. In Proceedings of the 4th BioASQ Workshop. Association for Computational Linguistics, 23–37. DOI: https://doi.org/10.18653/v1/W16-3104
[217]
Wenpeng Yin, Hinrich Schütze, Bing Xiang, and Bowen Zhou. 2016. ABCNN: Attention-based convolutional neural network for modeling sentence pairs. Trans. Assoc. Comput. Ling. 4 (2016), 259–272. DOI: https://doi.org/10.1162/tacl_a_00097
[218]
Wonjin Yoon, Jinhyuk Lee, Donghyeon Kim, Minbyul Jeong, and Jaewoo Kang. 2019. Pre-trained language model for biomedical question answering. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 727–740.
[219]
Hong Yu and Yong-gang Cao. 2008. Automatically extracting information needs from ad hoc clinical questions. In AMIA Annual Symposium Proceedings, Vol. 2008. American Medical Informatics Association.
[220]
Hong Yu, Minsuk Lee, David Kaufman, John Ely, Jerome A. Osheroff, George Hripcsak, and James Cimino. 2007. Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians. J. Biomed. Inform. 40, 3 (2007), 236–251.
[221]
Zhou Yu, Jun Yu, Jianping Fan, and Dacheng Tao. 2017. Multi-modal factorized bilinear pooling with co-attention learning for visual question answering. In Proceedings of the IEEE International Conference on Computer Vision. 1821–1830.
[222]
Zhou Yu, Jun Yu, Chenchao Xiang, Jianping Fan, and Dacheng Tao. 2018. Beyond bilinear: Generalized multimodal factorized high-order pooling for visual question answering. IEEE Trans. Neural Netw. Learn. Sys. 29, 12 (2018), 5947–5959.
[223]
Zheng Yuan, Yijia Liu, Chuanqi Tan, Songfang Huang, and Fei Huang. 2021. Improving biomedical pretrained language models with knowledge. In Proceedings of the 20th Workshop on Biomedical Language Processing. Association for Computational Linguistics, 180–190. DOI: https://doi.org/10.18653/v1/2021.bionlp-1.20
[224]
Zheng Yuan, Zhengyun Zhao, Haixia Sun, Jiao Li, Fei Wang, and Sheng Yu. 2021. CODER: Knowledge infused cross-lingual medical term embedding for term normalization. arxiv:2011.02947 [cs.CL].
[225]
Xiang Yue, Bernal Jimenez Gutierrez, and Huan Sun. 2020. Clinical reading comprehension: a thorough analysis of the emrQA dataset. arXiv e-prints, Article arXiv:2005.00574 (May 2020).
[226]
Xiang Yue, Ziyu Yao, Simon Lin, Huan Sun, et al. 2020. CliniQG4QA: Generating diverse questions for domain adaptation of clinical question answering. arXiv preprint arXiv:2010.16021 (2020).
[227]
Li-Ming Zhan, Bo Liu, Lu Fan, Jiaxin Chen, and Xiao-Ming Wu. 2020. Medical visual question answering via conditional reasoning. In Proceedings of the 28th ACM International Conference on Multimedia. 2345–2354.
[228]
Sheng Zhang, Xin Zhang, Hui Wang, Jiajun Cheng, Pei Li, and Zhaoyun Ding. 2017. Chinese medical question answer matching using end-to-end character-level multi-scale CNNs. Appl. Sci. 7, 8 (2017), 767.
[229]
Sheng Zhang, Xin Zhang, Hui Wang, Lixiang Guo, and Shanshan Liu. 2018. Multi-scale attentive interaction networks for Chinese medical question answer selection. IEEE Access 6 (2018), 74061–74071.
[230]
Xiao Zhang, Ji Wu, Zhiyang He, Xien Liu, and Ying Su. 2018. Medical exam question answering with large-scale reading comprehension. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence.
[231]
Xinliang Frederick Zhang, Heming Sun, Xiang Yue, Emmett Jesrani, Simon Lin, and Huan Sun. 2020. COUGH: A challenge dataset and models for COVID-19 FAQ retrieval. arXiv preprint arXiv:2010.12800 (2020).
[232]
Yuanzhe Zhang, Shizhu He, Kang Liu, and Jun Zhao. 2016. A joint model for question answering over multiple knowledge bases. In Proceedings of the AAAI Conference on Artificial Intelligence.
[233]
Yanchun Zhang, S. Peng, R. You, Z. Xie, B. Wang, and Shanfeng Zhu. 2015. The Fudan participation in the 2015 BioASQ challenge: Large-scale biomedical semantic indexing and question answering. In CEUR Workshop Proceedings, Vol. 1391. CEUR Workshop Proceedings.
[234]
Yingying Zhang, Shengsheng Qian, Quan Fang, and Changsheng Xu. 2019. Multi-modal knowledge-aware hierarchical attention network for explainable medical question answering. In Proceedings of the 27th ACM International Conference on Multimedia (MM). Association for Computing Machinery, New York, NY, 1089–1097. DOI: https://doi.org/10.1145/3343031.3351033
[235]
Nikita Zhiltsov, Alexander Kotov, and Fedor Nikolaev. 2015. Fielded sequential dependence model for ad hoc entity retrieval in the web of data. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. 253–262.
[236]
Li Zhou, Jianfeng Gao, Di Li, and Heung-Yeung Shum. 2020. The design and implementation of XiaoIce, an empathetic social chatbot. Comput. Ling. 46, 1 (2020), 53–93.
[237]
Wei Zhou and Clement Yu. 2007. TREC genomics track at UIC. Resource 1 (2007), G2.
[238]
Ming Zhu, Aman Ahuja, Da-Cheng Juan, Wei Wei, and Chandan K. Reddy. 2020. Question answering with long multiple-span answers. In Findings of the Association for Computational Linguistics: EMNLP 2020. Association for Computational Linguistics, 3840–3849. DOI: https://doi.org/10.18653/v1/2020.findings-emnlp.342
[239]
Ming Zhu, Aman Ahuja, Wei Wei, and Chandan K. Reddy. 2019. A hierarchical attention retrieval model for healthcare question answering. In Proceedings of the World Wide Web Conference (WWW). Association for Computing Machinery, New York, NY, 2472–2482. DOI: https://doi.org/10.1145/3308558.3313699
[240]
Wei Zhu, Xiaofeng Zhou, K. Wang, X. Luo, Xiepeng Li, Y. Ni, and G. Xie. 2019. PANLP at MEDIQA 2019: Pre-trained language models, transfer learning and knowledge distillation. In Proceedings of the BioNLP@ACL Conference.
[241]
Pierre Zweigenbaum. 2003. Question answering in biomedicine. Nat. Lang. Process. Quest. Answer. 2005 (2003), 1–4.

Cited By

View all
  • (2024)Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction ApproachJMIR Formative Research10.2196/524828(e52482)Online publication date: 25-Mar-2024
  • (2024)Development and application of Chinese medical ontology for diabetes mellitusBMC Medical Informatics and Decision Making10.1186/s12911-023-02405-y24:1Online publication date: 19-Jan-2024
  • (2024)Effective and Reliable Domain-Specific Knowledge Question Answering2024 IEEE International Conference on e-Business Engineering (ICEBE)10.1109/ICEBE62490.2024.00044(238-243)Online publication date: 11-Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Computing Surveys
ACM Computing Surveys  Volume 55, Issue 2
February 2023
803 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/3505209
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 January 2022
Accepted: 01 September 2021
Revised: 01 August 2021
Received: 01 March 2021
Published in CSUR Volume 55, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Question answering
  2. natural language processing
  3. machine learning
  4. biomedicine

Qualifiers

  • Survey
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)734
  • Downloads (Last 6 weeks)52
Reflects downloads up to 01 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction ApproachJMIR Formative Research10.2196/524828(e52482)Online publication date: 25-Mar-2024
  • (2024)Development and application of Chinese medical ontology for diabetes mellitusBMC Medical Informatics and Decision Making10.1186/s12911-023-02405-y24:1Online publication date: 19-Jan-2024
  • (2024)Effective and Reliable Domain-Specific Knowledge Question Answering2024 IEEE International Conference on e-Business Engineering (ICEBE)10.1109/ICEBE62490.2024.00044(238-243)Online publication date: 11-Oct-2024
  • (2024)ZDDR: A Zero-Shot Defender for Adversarial Samples Detection and RestorationIEEE Access10.1109/ACCESS.2024.335656812(39081-39094)Online publication date: 2024
  • (2024)BioInstruct: instruction tuning of large language models for biomedical natural language processingJournal of the American Medical Informatics Association10.1093/jamia/ocae12231:9(1821-1832)Online publication date: 4-Jun-2024
  • (2024)Question answering systems for health professionals at the point of care—a systematic reviewJournal of the American Medical Informatics Association10.1093/jamia/ocae01531:4(1009-1024)Online publication date: 16-Feb-2024
  • (2024)GeneGPT: augmenting large language models with domain tools for improved access to biomedical informationBioinformatics10.1093/bioinformatics/btae07540:2Online publication date: 10-Feb-2024
  • (2024)Progress and opportunities of foundation models in bioinformaticsBriefings in Bioinformatics10.1093/bib/bbae54825:6Online publication date: 26-Oct-2024
  • (2024)Matching patients to clinical trials with large language modelsNature Communications10.1038/s41467-024-53081-z15:1Online publication date: 18-Nov-2024
  • (2024)Uniqorn: Unified question answering over RDF knowledge graphs and natural language textJournal of Web Semantics10.1016/j.websem.2024.10083383(100833)Online publication date: Dec-2024
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media