Question Validation <

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

[Question]: How does Agentic RAG judge if the question shall be answered via single RAG retrieval or multiple retrievals by agent? about llama_index HOT 3 OPEN

tonyaw commented on July 17, 2024

[Question]: How does Agentic RAG judge if the question shall be answered via single RAG retrieval or multiple retrievals by agent?

from llama_index.

Comments (3)

dosubot commented on July 17, 2024 2

To determine if a user's question should be answered by a single RAG query or if it requires an agent to break down the query and perform multiple retrievals, you can follow these guidelines:

Single Document Query: If the question pertains to a specific aspect or summary of a single document, a single RAG query is sufficient. For example:
- "Does dispatcher X support IPv6?"
- "What is statistic counter Y?"
Multi-Document Query: If the question involves comparing or summarizing information across multiple documents, it requires an agent to break down the query and perform multiple retrievals. For example:
- "Find all dispatcher types that support IPv6."
- "Find all statistic counters that are related to the billing function."

For implementation, you can consider the following approaches:

Approach 1: Initial LLM Call to Classify Query Complexity

Initial LLM Call: Use an LLM to classify the query as either simple or complex.
Routing Based on Classification:
- If classified as simple, send the query to the RAG directly.
- If classified as complex, use an agent to break down the query and perform multiple retrievals.

Approach 2: Initial RAG Query with Post-Processing

Initial RAG Query: Always send the query to the RAG first.
Post-Processing:
- Let the LLM judge if the answer is comprehensive enough.
- If the answer is not comprehensive, switch to using an agent to perform multiple retrievals and provide a more detailed response.

Example Implementation

For simple queries:

response = self._client.search(
    collection_name=self.collection_name,
    query_vector=query_embedding,
    limit=query.similarity_top_k,
    query_filter=query_filter,
)

For complex queries:

sub_questions = sub_question_query_engine.generate_sub_questions(main_query)
sub_questions = prune_sub_questions(sub_questions)
answers = []
for sub_question in sub_questions:
    answer = sub_question_query_engine.retrieve_answer(sub_question)
    answers.append(answer)
final_answer = combine_answers(answers)

Additionally, consider using the Router Query Engine, which may be useful for routing queries to the appropriate engine based on their complexity ^[1].

By following these guidelines and approaches, you can effectively determine whether a user's question should be answered by a single RAG query or if it requires an agent to perform multiple retrievals.

To continue talking to Dosu, mention @dosu.

from llama_index.

tonyaw commented on July 17, 2024

@jerryjliu, could you please help to check my question as you are the author of https://learn.deeplearning.ai/courses/building-agentic-rag-with-llamaindex/lesson/1/introduction? :-)

from llama_index.

tonyaw commented on July 17, 2024

Could you please help to provide some suggestion? Thanks!

from llama_index.

Recommend Projects

[Question]: How does Agentic RAG judge if the question shall be answered via single RAG retrieval or multiple retrievals by agent? about llama_index HOT 3 OPEN

Comments (3)

Approach 1: Initial LLM Call to Classify Query Complexity

Approach 2: Initial RAG Query with Post-Processing

Example Implementation

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent