Cannot reproduce the results reported in the paper

Hi, thank you very much for the impressive work and for open-sourcing the code and checkpoints!
I am currently trying to reproduce the results reported in the paper, but I am encountering some discrepancies and would appreciate your guidance.

I download the official checkpoint you provided from Huggingface: reasonrag/Qwen2.5-7B-Instruct-ReasonRAG


I ran the code ` inference.py ` using this checkpoint. However, the index building instructions do not seem to provide the processed indexes/wiki18.jsonl file.
```
# Download wikidump
wget https://archive.org/download/enwiki-20181220/enwiki-20181220-pages-articles.xml.bz2

# Build index
python -m flashrag.retriever.index_builder \
  --retrieval_method bge \
  --model_path /BAAI/bge-base-en-v1.5 \
  --corpus_path indexes/wiki18.jsonl \
  --save_dir indexes/ \
  --use_fp16 \
  --max_length 512 \
  --batch_size 256 \
  --pooling_method mean \
  --faiss_type Flat 
```
 I then used the setting from **SearchR1** with **e5** as the retriever and return topk=3 results for each query (The wiki corpus and index are set as SearchR1 version). The other parameters remain as default (e.g., max_iter=8). Below are my reproduced results:
<img width="1377" height="149" alt="Image" src="https://github.com/user-attachments/assets/68252184-9e5e-4955-9535-c1058aba1a71" />

<img width="684" height="439" alt="Image" src="https://github.com/user-attachments/assets/12973b93-5ce4-458c-aec0-e37d3d39288c" />

With this setup, my reproduced results show a non-negligible gap compared to the numbers reported in the paper. 

May I kindly ask for the detailed inference parameter settings for reproduce the results in the paper? Is it possible to also provide the code or already processed `wiki18.jsonl` for bge retriever?

Thanks!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cannot reproduce the results reported in the paper #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Cannot reproduce the results reported in the paper #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions