HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Chen, Junying; Wang, Xidong; Ji, Ke; Gao, Anningzhe; Jiang, Feng; Chen, Shunian; Zhang, Hongbo; Song, Dingjie; Xie, Wenya; Kong, Chuyi; Li, Jianquan; Wan, Xiang; Li, Haizhou; Wang, Benyou

Computer Science > Computation and Language

arXiv:2311.09774 (cs)

[Submitted on 16 Nov 2023 (v1), last revised 15 Sep 2024 (this version, v2)]

Title:HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Authors:Junying Chen, Xidong Wang, Ke Ji, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang

View PDF HTML (experimental)

Abstract:Adapting a language model into a specific domain, a.k.a `domain adaption', is a common practice when specialized knowledge, e.g. medicine, is not encapsulated in a general language model like Llama2. The challenge lies in the heterogeneity of data across the two training stages, as it varies in languages, genres, or formats. To tackle this and simplify the learning protocol, we propose to transform heterogeneous data, from the both pre-training and supervised stages, into a unified, simple input-output pair format. We validate the new protocol in the domains where proprietary LLMs like ChatGPT perform relatively poorly, such as Traditional Chinese Medicine. The developed model, HuatuoGPT-II, has shown state-of-the-art performance in Chinese medicine domain on a number of benchmarks, e.g. medical licensing exams. It even outperforms proprietary models like ChatGPT and GPT-4 in some aspects, especially in Traditional Chinese Medicine. Expert manual evaluations further validate HuatuoGPT-II's advantages over existing LLMs. Notably, HuatuoGPT-II was benchmarked in a fresh Chinese National Medical Licensing Examination where it achieved the best performance, showcasing not only its effectiveness but also its generalization capabilities.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2311.09774 [cs.CL]
	(or arXiv:2311.09774v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09774

Submission history

From: Junying Chen [view email]
[v1] Thu, 16 Nov 2023 10:56:24 UTC (3,981 KB)
[v2] Sun, 15 Sep 2024 08:41:01 UTC (2,362 KB)

Computer Science > Computation and Language

Title:HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators