Interpretable Conversation Routing via the Latent Embeddings Approach

Daniil Maksymenko
Oleksii Turuta

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) are quickly implemented to answer question and support systems to automate customer experience across all domains, including medical use cases. Models in such environments should solve multiple problems like general knowledge questions, queries to external sources, function calling and many others. Some cases might not even require a full-on text generation. They possibly need different prompts or even different models. All of it can be managed by a routing step. This paper focuses on interpretable few-shot approaches for conversation routing like latent embeddings retrieval. The work here presents a benchmark, a sorrow analysis, and a set of visualizations of the way latent embeddings routing works for long-context conversations in a multilingual, domain-specific environment. The results presented here show that the latent embeddings router is able to achieve performance on the same level as LLM-based routers with additional interpretability and higher level of control over model decision-making.

Version published to 10.3390/computation12120237
Dec 1, 2024
Version published to 10.20944/preprints202410.2295.v1
Oct 30, 2024

LFOSum: Summarizing Long-form Opinions with Large Language Models

This article has 2 authors:
1. Mir Tafseer Nayeem
2. Davood Rafiei
This article has no evaluationsLatest version Oct 28, 2024
DemoCraft: Using In-Context Learning to Improve Code Generation in Large Language Models

This article has 2 authors:
1. Kapu Nirmal Joshua
2. Mihit Sreejith
This article has no evaluationsLatest version Oct 31, 2024
Dynamic Context-Aware Representation for Semantic Alignment in Large Language Models

This article has 4 authors:
1. Jason Baronova
2. Catherine Stevens
3. Logan Tennant
4. Alfred MacPhee
This article has no evaluationsLatest version Oct 25, 2024

Listed in

Abstract

Article activity feed

Related articles

LFOSum: Summarizing Long-form Opinions with Large Language Models

DemoCraft: Using In-Context Learning to Improve Code Generation in Large Language Models

Dynamic Context-Aware Representation for Semantic Alignment in Large Language Models