SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation

Hu, Minda; Zong, Licheng; Wang, Hongru; Zhou, Jingyan; Li, Jingjing; Gao, Yichen; Wong, Kam-Fai; Li, Yu; King, Irwin

Computer Science > Computation and Language

arXiv:2406.11258 (cs)

[Submitted on 17 Jun 2024 (v1), last revised 16 Oct 2024 (this version, v2)]

Title:SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation

Authors:Minda Hu, Licheng Zong, Hongru Wang, Jingyan Zhou, Jingjing Li, Yichen Gao, Kam-Fai Wong, Yu Li, Irwin King

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have shown great potential in the biomedical domain with the advancement of retrieval-augmented generation (RAG). However, existing retrieval-augmented approaches face challenges in addressing diverse queries and documents, particularly for medical knowledge queries, resulting in sub-optimal performance. To address these limitations, we propose a novel plug-and-play LLM-based retrieval method called Self-Rewarding Tree Search (SeRTS) based on Monte Carlo Tree Search (MCTS) and a self-rewarding paradigm. By combining the reasoning capabilities of LLMs with the effectiveness of tree search, SeRTS boosts the zero-shot performance of retrieving high-quality and informative results for RAG. We further enhance retrieval performance by fine-tuning LLMs with Proximal Policy Optimization (PPO) objectives using the trajectories collected by SeRTS as feedback. Controlled experiments using the BioASQ-QA dataset with GPT-3.5-Turbo and LLama2-7b demonstrate that our method significantly improves the performance of the BM25 retriever and surpasses the strong baseline of self-reflection in both efficiency and scalability. Moreover, SeRTS generates higher-quality feedback for PPO training than self-reflection. Our proposed method effectively adapts LLMs to document retrieval tasks, enhancing their ability to retrieve highly relevant documents for RAG in the context of medical knowledge queries. This work presents a significant step forward in leveraging LLMs for accurate and comprehensive biomedical question answering.

Comments:	This work has been accepted by EMNLP 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.11258 [cs.CL]
	(or arXiv:2406.11258v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.11258

Submission history

From: Minda Hu [view email]
[v1] Mon, 17 Jun 2024 06:48:31 UTC (545 KB)
[v2] Wed, 16 Oct 2024 06:32:50 UTC (927 KB)

Computer Science > Computation and Language

Title:SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators