Retrieval-augmented generation: a hybrid approach to assessing retrieved documents similarity, LLM confidence, and system stability

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The retrieval-augmented generation approach relies on retrieving relevant documents to enhance large language model prompts and improve model outputs. However, existing metrics like cosine similarity, precision@k, and recall@k, to name just a few, fail to account for the confidence and stability of retrieval and generation. We propose a novel approach, retrieval confidence score, and its extension, asymptotic retrieval confidence score, which combines semantic similarity, large language model confidence, and stability across multiple generations. Asymptotic retrieval confidence score potentially provides a robust approach for evaluating retrieval-augmented generation systems, possibly suggesting a better solution for combining results across retrieval, generation, and evaluation stages.

Article activity feed