default search action
Yanjun Gao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j12]Wonjin Yoon, Shan Chen, Yanjun Gao, Zhanzhan Zhao, Dmitriy Dligach, Danielle S. Bitterman, Majid Afshar, Timothy A. Miller:
LCD benchmark: long clinical document benchmark on mortality prediction for language models. J. Am. Medical Informatics Assoc. 32(2): 285-295 (2025) - [j11]Skatje Myers, Timothy A. Miller, Yanjun Gao, Matthew M. Churpek, Anoop M. Mayampurath, Dmitriy Dligach, Majid Afshar:
Lessons learned on information retrieval in electronic health records: a comparison of embedding models and pooling strategies. J. Am. Medical Informatics Assoc. 32(2): 357-364 (2025) - 2024
- [j10]Yanjun Gao, Diwakar Mahajan, Özlem Uzuner, Meliha Yetisgen:
Clinical natural language processing for secondary uses. J. Biomed. Informatics 150: 104596 (2024) - [j9]Majid Afshar, Yanjun Gao, Deepak Gupta, Emma Croxford, Dina Demner-Fushman:
On the role of the UMLS in supporting diagnosis generation proposed by Large Language Models. J. Biomed. Informatics 157: 104707 (2024) - [c16]Xin Chen, Hanxian Huang, Yanjun Gao, Yi Wang, Jishen Zhao, Ke Ding:
Learning to Maximize Mutual Information for Chain-of-Thought Distillation. ACL (Findings) 2024: 6857-6868 - [c15]Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A. Miller, Danielle S. Bitterman, Matthew M. Churpek, Majid Afshar:
When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications? EMNLP (Findings) 2024: 5414-5428 - [c14]Li Yu, Yanjun Gao, Farhad Pakdaman, Moncef Gabbouj:
Panoramic Image Inpainting with Gated Convolution and Contextual Reconstruction Loss. ICASSP 2024: 4255-4259 - [i19]Li Yu, Yanjun Gao, Farhad Pakdaman, Moncef Gabbouj:
Panoramic Image Inpainting With Gated Convolution And Contextual Reconstruction Loss. CoRR abs/2402.02936 (2024) - [i18]Xin Chen, Hanxian Huang, Yanjun Gao, Yi Wang, Jishen Zhao, Ke Ding:
Learning to Maximize Mutual Information for Chain-of-Thought Distillation. CoRR abs/2403.03348 (2024) - [i17]Shan Chen, Jack Gallifant, Marco Guevara, Yanjun Gao, Majid Afshar, Timothy Miller, Dmitriy Dligach, Danielle S. Bitterman:
Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data. CoRR abs/2403.19511 (2024) - [i16]Ruizhe Li, Yanjun Gao:
Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice Questions. CoRR abs/2405.03205 (2024) - [i15]Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy Miller, Danielle S. Bitterman, Matthew M. Churpek, Majid Afshar:
When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications? CoRR abs/2408.11854 (2024) - [i14]Skatje Myers, Timothy A. Miller, Yanjun Gao, Matthew M. Churpek, Anoop M. Mayampurath, Dmitriy Dligach, Majid Afshar:
Lessons Learned on Information Retrieval in Electronic Health Records: A Comparison of Embedding Models and Pooling Strategies. CoRR abs/2409.15163 (2024) - [i13]Emma Croxford, Yanjun Gao, Nicholas Pellegrino, Karen K. Wong, Graham Wills, Elliot First, Frank J. Liao, Cherodeep Goswami, Brian W. Patterson, Majid Afshar:
Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review. CoRR abs/2409.18170 (2024) - [i12]Bingsheng Yao, Yao Du, Yue Fu, Xuhai Xu, Yanjun Gao, Hong Yu, Dakuo Wang:
Exploring Interdisciplinary Team Collaboration in Clinical NLP Projects Through the Lens of Activity Theory. CoRR abs/2410.00174 (2024) - [i11]Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A. Miller, Danielle S. Bitterman, Guanhua Chen, Anoop M. Mayampurath, Matthew M. Churpek, Majid Afshar:
Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability. CoRR abs/2411.04962 (2024) - 2023
- [j8]Weipeng Zhou, Meliha Yetisgen, Majid Afshar, Yanjun Gao, Guergana Savova, Timothy A. Miller:
Improving model transferability for clinical note section classification models using continued pretraining. J. Am. Medical Informatics Assoc. 31(1): 89-97 (2023) - [j7]Yanjun Gao, Dmitriy Dligach, Timothy A. Miller, John R. Caskey, Brihat Sharma, Matthew M. Churpek, Majid Afshar:
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing. J. Biomed. Informatics 138: 104286 (2023) - [j6]Yanjun Gao, Dmitriy Dligach, Timothy Miller, Matthew M. Churpek, Özlem Uzuner, Majid Afshar:
Progress Note Understanding - Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 shared task. J. Biomed. Informatics 142: 104346 (2023) - [c13]Brihat Sharma, Yanjun Gao, Timothy A. Miller, Matthew M. Churpek, Majid Afshar, Dmitriy Dligach:
Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning. ClinicalNLP@ACL 2023: 78-85 - [c12]Weipeng Zhou, Majid Afshar, Dmitriy Dligach, Yanjun Gao, Timothy Miller:
Improving the Transferability of Clinical Note Section Classification Models with BERT and Large Language Model Ensembles. ClinicalNLP@ACL 2023: 125-130 - [c11]Yanjun Gao, Dmitriy Dligach, Timothy Miller, Majid Afshar:
Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on Summarizing Patients' Active Diagnoses and Problems from Electronic Health Record Progress Notes. BioNLP@ACL 2023: 461-467 - [i10]Yanjun Gao, Dmitriy Dligach, Timothy A. Miller, Matthew M. Churpek, Özlem Uzuner, Majid Afshar:
Progress Note Understanding - Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 Shared Task. CoRR abs/2303.08038 (2023) - [i9]Brihat Sharma, Yanjun Gao, Timothy A. Miller, Matthew M. Churpek, Majid Afshar, Dmitriy Dligach:
Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning. CoRR abs/2306.04551 (2023) - [i8]Yanjun Gao, Dmitriy Dligach, Timothy Miller, Matthew M. Churpek, Majid Afshar:
Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on Summarizing Patients' Active Diagnoses and Problems from Electronic Health Record Progress Notes. CoRR abs/2306.05270 (2023) - [i7]Yanjun Gao, Ruizhe Li, John R. Caskey, Dmitriy Dligach, Timothy A. Miller, Matthew M. Churpek, Majid Afshar:
Leveraging A Medical Knowledge Graph into Large Language Models for Diagnosis Prediction. CoRR abs/2308.14321 (2023) - 2022
- [j5]Yanjun Gao, Dmitriy Dligach, Leslie Christensen, Samuel Tesch, Ryan Laffin, Dongfang Xu, Timothy A. Miller, Özlem Uzuner, Matthew M. Churpek, Majid Afshar:
A scoping review of publicly available language tasks in clinical natural language processing. J. Am. Medical Informatics Assoc. 29(10): 1797-1806 (2022) - [j4]Meliha Yetisgen, Özlem Uzuner, Yanjun Gao, Diwakar Mahajan:
Call for papers: Special issue on clinical natural language processing for secondary use applications. J. Biomed. Informatics 133: 104152 (2022) - [j3]Yanjun Gao, Su Luan Wong, Mas Nida Md. Khambari, Nooreen Noordin:
A bibliometric analysis of online faculty professional development in higher education. Res. Pract. Technol. Enhanc. Learn. 17(1): 17 (2022) - [j2]Patricia Marybelle Davies, Rebecca J. Passonneau, Smaranda Muresan, Yanjun Gao:
Analytical Techniques for Developing Argumentative Writing in STEM: A Pilot Study. IEEE Trans. Educ. 65(3): 373-383 (2022) - [c10]Yanjun Gao, Dmitriy Dligach, Timothy Miller, Dongfang Xu, Matthew M. Churpek, Majid Afshar:
Summarizing Patients' Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models. COLING 2022: 2979-2991 - [c9]Yanjun Gao, Dmitriy Dligach, Timothy A. Miller, Samuel Tesch, Ryan Laffin, Matthew M. Churpek, Majid Afshar:
Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding. LREC 2022: 5484-5493 - [i6]Yanjun Gao, Dmitriy Dligach, Timothy A. Miller, Samuel Tesch, Ryan Laffin, Matthew M. Churpek, Majid Afshar:
Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding. CoRR abs/2204.03035 (2022) - [i5]Yanjun Gao, Dmitriy Dligach, Timothy A. Miller, Dongfang Xu, Matthew M. Churpek, Majid Afshar:
Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models. CoRR abs/2208.08408 (2022) - [i4]Yanjun Gao, Dmitriy Dligach, Timothy A. Miller, John R. Caskey, Brihat Sharma, Matthew M. Churpek, Majid Afshar:
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing. CoRR abs/2209.14901 (2022) - 2021
- [c8]Yanjun Gao, Ting-Hao Kenneth Huang, Rebecca J. Passonneau:
ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences. ACL/IJCNLP (1) 2021: 3919-3931 - [c7]Yanjun Gao, Rebecca J. Passonneau:
Automated Assessment of Quality and Coverage of Ideas in Students' Source-Based Writing. AIED (2) 2021: 465-470 - [i3]Yanjun Gao, Ting-Hao Kenneth Huang, Rebecca J. Passonneau:
ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences. CoRR abs/2106.12027 (2021) - [i2]Yanjun Gao, Lulu Liu, Jason Wang, Xin Chen, Huayan Wang, Rui Zhang:
EVOQUER: Enhancing Temporal Grounding with Video-Pivoted BackQuery Generation. CoRR abs/2109.04600 (2021) - [i1]Yanjun Gao, Dmitriy Dligach, Leslie Christensen, Samuel Tesch, Ryan Laffin, Dongfang Xu, Timothy A. Miller, Özlem Uzuner, Matthew M. Churpek, Majid Afshar:
A Scoping Review of Publicly Available Language Tasks in Clinical Natural Language Processing. CoRR abs/2112.05780 (2021)
2010 – 2019
- 2019
- [j1]Boce Xue, Baohua Chang, Guodong Peng, Yanjun Gao, Zhijie Tian, Dong Du, Guoqing Wang:
A Vision Based Detection Method for Narrow Butt Joints and a Robotic Seam Tracking System. Sensors 19(5): 1144 (2019) - [c6]Yanjun Gao, Alex Driban, Brennan Xavier McManus, Elena Musi, Patricia Marybelle Davies, Smaranda Muresan, Rebecca J. Passonneau:
Rubric Reliability and Annotation of Content and Argument in Source-Based Argument Essays. BEA@ACL 2019: 507-518 - [c5]Yanjun Gao, Chen Sun, Rebecca J. Passonneau:
Automated Pyramid Summarization Evaluation. CoNLL 2019: 404-418 - 2018
- [c4]Yanjun Gao, Patricia Marybelle Davies, Rebecca J. Passonneau:
Automated Content Analysis: A Case Study of Computer Science Student Summaries. BEA@NAACL-HLT 2018: 264-272 - [c3]Yanjun Gao, Andrew Warner, Rebecca J. Passonneau:
PyrEval: An Automated Method for Summary Content Analysis. LREC 2018 - 2017
- [c2]Yanjun Gao, Madhu C. Reddy, Bernard J. Jansen:
ShopWithMe!: Collaborative Information Searching and Shopping for Online Retail. HICSS 2017: 1-10 - 2016
- [c1]Yanjun Gao, Madhu C. Reddy, Bernard J. Jansen:
Shop Together, Search Together: Collaborative E-commerce. CHI Extended Abstracts 2016: 2081-2087
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-02 23:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint