BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives

arXiv — cs.CLWednesday, November 12, 2025 at 5:00:00 AM
BiCA represents a significant advancement in biomedical dense retrieval by addressing the challenges of hard-negative mining in this domain. By utilizing citation links from a dataset of 20,000 PubMed articles, the approach effectively distinguishes between relevant and non-duplicate documents, enhancing the training of retrieval models. The results indicate consistent improvements in zero-shot retrieval performance, as measured by nDCG@10, and outperforming baseline models on long-tailed topics using Success@5. This research highlights the potential of leveraging document link structures to generate highly informative negatives, paving the way for more effective retrieval systems in biomedical and scientific contexts.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it