Stop-RAG: Value-Based Retrieval Control for Iterative RAG

This is the official repository for our paper "Stop-RAG: Value-Based Retrieval Control for Iterative RAG".

Setup

We recommend using uv with Python 3.11 for dependency management. To install the necessary dependencies, run:

pip install uv
uv sync --no-dev

Replication

To run Stop-RAG training and evaluate the results, follow the steps below. All experiments were run on 4 H100 GPUs.

There are three types of variables to set:

DATASET: the name of the dataset to use (musique or hotpotqa or 2wikimultihopqa)
RETRIEVER: the type of retriever to use (contriever or bm25)
METHOD: the base pipeline to use (ours or corag)

1. Download datasets

First, download all datasets and build retrieval embeddings from the corpora.

./scripts/download.sh {RETRIEVER}

Datasets will be stored in data/raw/{DATASET}, and the retrieval corpora, embeddings and indexes will be saved in data/corpus/{DATASET}.

2. Prepare training and evaluation data

Run the chosen pipeline to prepare the training and evaluation data.

./scripts/dataset.sh {DATASET} {RETRIEVER} {METHOD}

The processed data will be saved in data/processed/{DATASET}/{METHOD}/{RETRIEVER}.

3. Run Stop-RAG training

Now run the training script. WandB logging is enabled by default, so the environment variables WANDB_API_KEY and WANDB_ENTITY must be set. To disable WandB logging, set WANDB_DISABLED=true.

./scripts/train.sh {DATASET} {RETRIEVER} {METHOD}

4. Run evaluation

To run evaluation, first compute the scores for all trained checkpoints and find the best checkpoint and threshold.

./scripts/stop_rag_find_best.sh {DATASET} {RETRIEVER} {METHOD}

This script will print the best checkpoint and threshold. Substitute these values into the following command to run the final evaluation.

./scripts/stop_rag_test.sh {DATASET} {RETRIEVER} {METHOD} {CKPT} {THRESHOLD}

(Optional) 5. Evaluate LLM-Stop

You can also evaluate the LLM-Stop baseline for comparison.

./scripts/llm_stop_test.sh {DATASET} {RETRIEVER} {METHOD}

Acknowledgments

The code for Contriever is adapted from EfficientRAG, and the dataset download scripts are adapted from IRCoT.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stop-RAG: Value-Based Retrieval Control for Iterative RAG

Setup

Replication

1. Download datasets

2. Prepare training and evaluation data

3. Run Stop-RAG training

4. Run evaluation

(Optional) 5. Evaluate LLM-Stop

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Stop-RAG: Value-Based Retrieval Control for Iterative RAG

Setup

Replication

1. Download datasets

2. Prepare training and evaluation data

3. Run Stop-RAG training

4. Run evaluation

(Optional) 5. Evaluate LLM-Stop

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages