run-llama / rags Goto Github PK

View Code? Open in Web Editor NEW

5.9K 55.0 593.0 160 KB

Build ChatGPT over your data, all with natural language

License: MIT License

Python 99.85% Makefile 0.15%

agent chatbot chatgpt gpts llamaindex llm openai rag streamlit

rags's Introduction

RAGs

Build.a.RAGs.bot.powered.by.LlamaIndex.-.26.November.2023.mp4

RAGs is a Streamlit app that lets you create a RAG pipeline from a data source using natural language.

You get to do the following:

Describe your task (e.g. "load this web page") and the parameters you want from your RAG systems (e.g. "i want to retrieve X number of docs")
Go into the config view and view/alter generated parameters (top-k, summarization, etc.) as needed.
Query the RAG agent over data with your questions.

This project is inspired by GPTs, launched by OpenAI.

Installation and Setup

Clone this project, go into the rags project folder. We recommend creating a virtual env for dependencies (python3 -m venv .venv).

poetry install --with dev

By default, we use OpenAI for both the builder agent as well as the generated RAG agent. Add .streamlit/secrets.toml in the home folder.

Then put the following:

openai_key = "<openai_key>"

Then run the app from the "home page" file.


streamlit run 1_🏠_Home.py

NOTE: If you've upgraded the version of RAGs, and you're running into issues on launch, you may need to delete the cache folder in your home directory (we may have introduced breaking changes in the stored data structure between versions).

Detailed Overview

The app contains the following sections, corresponding to the steps listed above.

1. 🏠 Home Page

This is the section where you build a RAG pipeline by instructing the "builder agent". Typically to setup a RAG pipeline you need the following components:

Describe the dataset. Currently we support either a single local file or a web page. We're open to suggestions here!
Describe the task. Concretely this description will be used to initialize the "system prompt" of the LLM powering the RAG pipeline.
Define the typical parameters for a RAG setup. See the below section for the list of parameters.

2. ⚙️ RAG Config

This section contains the RAG parameters, generated by the "builder agent" in the previous section. In this section, you have a UI showcasing the generated parameters and have full freedom to manually edit/change them as necessary.

Currently the set of parameters is as follows:

System Prompt
Include Summarization: whether to also add a summarization tool (instead of only doing top-k retrieval.)
Top-K
Chunk Size
Embed Model
LLM

If you manually change parameters, you can press the "Update Agent" button in order to update the agent.

If you don't see the `Update Agent` button, that's because you haven't created the agent yet. Please go to the previous "Home" page and complete the setup process.

We can always add more parameters to make this more "advanced" 🛠️, but thought this would be a good place to start.

3. Generated RAG Agent

Once your RAG agent is created, you have access to this page.

This is a standard chatbot interface where you can query the RAG agent and it will answer questions over your data.

It will be able to pick the right RAG tools (either top-k vector search or optionally summarization) in order to fulfill the query.

Supported LLMs and Embeddings

Builder Agent

By default the builder agent uses OpenAI. This is defined in the core/builder_config.py file.

You can customize this to whatever LLM you want (an example is provided for Anthropic).

Note that GPT-4 variants will give the most reliable results in terms of actually constructing an agent (we couldn't get Claude to work).

Generated RAG Agent

You can set the configuration either through natural language or manually for both the embedding model and LLM.

LLM: We support the following LLMs, but you need to explicitly specify the ID to the builder agent.
- OpenAI: ID is "openai:<model_name>" e.g. "openai:gpt-4-1106-preview"
- Anthropic: ID is "anthropic:<model_name>" e.g. "anthropic:claude-2"
- Replicate: ID is "replicate:<model_name>"
- HuggingFace: ID is "local:<model_name>" e.g. "local:BAAI/bge-small-en"
Embeddings: Supports text-embedding-ada-002 by default, but also supports Hugging Face models. To use a hugging face model simply prepend with local, e.g. local:BAAI/bge-small-en.

Resources

Running into issues? Please file a GitHub issue or join our Discord.

This app was built with LlamaIndex Python.

See our launch blog post here.

rags's People

Contributors

Stargazers

Watchers

Forkers

hblink jeffara johnnewton gisocr snehankekre isayahc danieltea singlethrowdata lguzzon-scratchbook oagostinho zbalsara21 mz0in tangtc1981 ylannb chunhualiu tomchapin ridzuan05 prabindh george988 mthad rkp64 pradeepvaranasi aphexus sven10hove yavin-owens schalise moij sean-in-the-library dineshdyne abduibasit nikofalke anoopshrma benjaminearlevans brunotech bxck75 neeland ssshuishui muharremokutan jeekim gilby56 farhadfa22 chuukwudi seer-bi aurora779 foolafroos eltociear hh36000 abhaysavani22 manijeh-a omarofo alvarfilipe kunalwik bitkeepbit gijigae thomasbradenroche techthiyanes gabrielmendonca1 gabriel-dee chuashaocong moead99 davidlanz kalasforever kevintruong sunholo-data psacher hooliday3 lihqi pp4810319474 bombolino lisabetkush ridha226 yoowat maherboug logan-markewich redmercy-dev tuhinmallick daedraglob01 nemuiyarou appcrafts jonniechirchill redlegenddev pmirla oxcheaphi p1051790616 mani1soni saviorrazu-mudng miraclene-goldma xadsorcept ankurnine6 scorpipeens sbrightdark shampatil99 amir2pl uraucam vcpandya soccertarycubacken sharifmrcreed leevaleeth mbakpur123 adeliavale

rags's Issues

About llama_index version

WHY only support llama-index==0.9.7 ?
I want to use LLM like gemini which can not be found in 0.9.7.
Hope for your reply.

Agent is ready , how to query pdf ?

Hi ,

I've got a chance to set filepath during this conversation at Home page :

But bot at Generated RAG Agent page still unable to answer questions about the PDF :

How to query specific PDF ? Where exactly put the pdf ? Now it's at "rags" sub-folder named "files"

Installation issue

I have tried now several times to install rags, but I always get this error message:

(base) kalle@MacBook-Air rags % poetry install --with dev
Installing dependencies from lock file

No dependencies to install or update

Installing the current project: rags (0.0.5)
The current project could not be installed: No file/folder found for package rags
If you do not want to install the current project use --no-root

Any suggestions?

Metaphor Key - for cloud deploy - no .toml and no .env

I am trying to deploy to Azure App Serivices. I have ARM templates that work fine. The one issue I am having is that i need to set API keys as if they are stored as environment variables.
For OpenAI -
os.environ["OPENAI_API_KEY"] = st.secrets.openai_key
from utils.py makes sense

but am not seeing anything that straightforward for metaphor

Am I missing something?

If the Agent can be persisted

Once I loaded the PDF and was able to ask questions about it I want to save the agent to use it between launches of 1_🏠_Home.py.
Or should 1_🏠_Home.py is intended to run as online service ? Any way to create the Agent programmatically once the service was down or migrated ?

metaphor to requirements.txt

deploying to azure app services, during setup it uses requirements.txt to load up the virtual environment.
believe that metaphor-python needs to be in there.

Or am I missing something ;^)

How to upload files?

I turn to page RAG Config, but it shows "File/URL paths (not editable)".So where can I upload PDFs?

any way to support micorosft azure open ai?and how to config?

how to load local modell???????????

streamlit: command not found

after poetry install when I use streamlit it says:
streamlit: command not found

Deployment?

Is there a way to deploy the created agent only? thanks

requirements.txt specifies unreleased version of llama-index

Following tutorial on the blog post: https://blog.llamaindex.ai/introducing-rags-your-personalized-chatgpt-experience-over-your-data-2b9d140769b1

the command pip install -r requirements.txt runs an error as the .txt file specifies llama-index==0.9.7

The current release is 0.9.24

https://github.com/run-llama/rags/issues/7#issue-2005861281

#7 (comment)

Installation error using poetry

Hello,

I get this error when I run poetry install --with dev:

HTTPSConnectionPool(host='github.com', port=443): Max retries exceeded with url: /openai/CLIP.git/info/refs?service=git-upload-pack (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1129)')))

Can't find any working solution on the internet, do you have any idea what causes this ?

Why RAG?

What is and why one needs a RAG?

Support streaming chat responses

🔍 Description

LlamaIndex chat engines support streaming responses. It would be a small UX improvement if rags could support streaming the engine's responses to the Streamlit frontend such that users don't have to wait until the entire response is generated.

The only issue is that .stream_chat uses async functions. But Streamlit runs in a separate thread that doesn’t have an event loop by default. To make it work, the implementation will need to create an event loop and run the .stream_chat call inside it.

Happy to submit a PR for this!

"Update Agent" creates an error and deletes the Agent

After creating an agent and navigating to "RAG Config" for the purpose of selecting the checkbox labeled "Include Summarization (only works for GPT-4)" an error is produced and the Agent is deleted after clicking "Update Agent".

Error:

4cbf5338609c0124420e189378280428b5ed4cff09d96c93798bfb38d857b5d7

implement using llamacpp as LLM model

i am trying to implement using open source llm model with llamacpp but getting this error

"ValueError: Must pass in vector index for CondensePlusContextChatEngine."
i am new to llamaindex also can anyone help me what exactly i need to configure in order to run the RAGs

[ENH] can we use azure openai endpoints?

Hi.

What would be needed in order to use azure openai endpoints?
I think some changes in utils/_resolve_llm are certainly needed?

Any advice? I'd like to work on this topic, but I was thinking that someone else might have alreadt thought about that and could provide some feedback.

Cheers

Stop response generation in langchain framework

Python code:
qa_chain = RetrievalQA.from_chain_type(llm=turbo_llm,
chain_type="stuff",
retriever=compression_retriever,
return_source_documents=True
)
response = qa_chain("What is Langchain?")

This is the python code I am using to query over a PDF by following RAG approach.
My requirement is, if it takes more than 1 minute to generate the response then it should stop response generation from the backend.
How I can do that? Is there any python code architecture available for this?

test

+++start+++5FNiK1XxojuyyADBZU4cMmjkamutRyUUfpuYWxgmQoxAVGVE9wv6W9QvSLdH8fcy3FB3ivia5JuFb8WKEzQsWouuUkevqbNjBL54YdoVSJ2K3R9NGZVW8sY16jjFQ6vpfPhvyGF1JLYnjboSGDQo1MAkQhzVhrLPAovhNpovL9n1xVshK11fT9Ns8g+++end+++

.

Error during running answer to Home page

Hi,

I am getting this error during submitting an answer to "What RAG bot do you want to build?" question :

I guess it relates to openai credentials . What do you mean : "Please .streamlit/secrets.toml in the home folder."

Is "home" folder is "rags" folder - the roor folder of repo or home folder of host machine ?

BadRequestError: Error code: 400 & General Observations

@jerryjliu was having a great session building out a bot. then things started to get weird. the conversation on Home - setting up the bot - I could not really tell if I was getting RAGs advice and information or general GPT4. That is, after a while it seemed the setup process was being hallucinated. I then went to Generated RAG Agent to test how much of the system prompt conversation was internalized. The results we pretty poor. I copied and pasted the conversation from the generated agent and fed to the Home (need to have names for these different actors, it's confusing) and asked home if they were good responses or not. I says that they were not. we talk about modifications. it does them. the results are no better so we do the same and when I go to test the latest tweak, I go to GenRAG and my first prompt is can you try that last one again.. then poof

BadRequestError: Error code: 400 - {'error': {'message': "Invalid value for 'content': expected a string, got null.", 'type': 'invalid_request_error', 'param': 'messages.[122].content', 'code': None}}

While this is probably some trivial issue, I believe there are issues to address regarding the general behavior of the system and how some aspects present to the user.

To that end, I have attached my project (minus the .toml with my key) I have also copy/pasted the conversations from both Home and GenRAG - they are in the folder _trouble.

Please let me know if there is anyway I can be helpful. I need this to be awesome ;^)
I have also opened a thread on discord with you and @logan-markewich tagged - has some other conceptual questions and ideas. Thanks for all you do.
rags_error_build_nickknyc.zip

Suggested feature: Support two or more data sets

Add support for responding against data in two (or more) sets of documents; one with common file sets and the second having unique documents for each user.

How additional tools can be set on Home page bot ?

Hi ,

During the Agent creating I got the question about additional tools agent should use :

How to answer here if I want to add the Summary or Key tool , for instance

Installation failed with poetry

Hi,
I am running on Windows 11, and I have created a separate venv for this project.

when I run "poetry install --with dev", I got the following five lines of message, error message
Installing dependencies from lock file
No dependencies to install or update
Installing the current project: rags (0.0.2)
The current project could not be installed: No file/folder found for package rags
If you do not want to install the current project use --no-root

To resolve the problem, I have deleted venv and github repo, and then recreated the venv and re-clone the repo, but the problem above persist.

Pls help,
thanks,
Sean

this package needs which Python? 3.9? 3.10? or 3.11? thanks

Something To Consider: RAGs UI

@jerryjliu just want to share this repo is saw - possible UI for RAGs
https://github.com/admineral/Openai-Assistant-API-UI

Cheers

Gradio demo

Hi, awesome work, would be great to support a gradio demo for this as well, check out this guide to get started: https://huggingface.co/docs/hub/spaces-sdks-gradio, cc: @yvrjsharma

INTEGRATE GOOGLE COLAB

Suggestion : it would be great if you integrate the google col lab feature so we can run over it and improve the project

If multiple PDF agents can be defined ?

Hi ,

I succeeded to create some PDF documnet agent . I wonder if it's possible to create agent on several PDFs or create agent per PDF and reference them by name or similar way in "Generated RAG Agent" chat ?

AttributeError

I dont see any streamlit download

can someone please explain this step
By default, we use OpenAI for both the builder agent as well as the generated RAG agent. Please .streamlit/secrets.toml in the home folder.

I dont see any strealit/secrets.toml downloaded although I did run requirements.txt

How to run this project?

After stream run 1_home.py. How I can build a agent? by input what and how I can upload or point a file, have a example? No matter what I input, it always be:
system_prompt=None file_paths=[] docs=[] tools=[] rag_params=RAGParams(include_summarization=False, top_k=2, chunk_size=1024, embed_model='default', llm='gpt-4-1106-preview') agent=None
Thanks a lot!

ModuleNotFoundError

New option during setting up agent - prompts

Hi ,

I see some new option to setup prompts 👍

Where exactly these prompts are used ?

run-llama / rags Goto Github PK

rags's Introduction

RAGs

Installation and Setup

Detailed Overview

1. 🏠 Home Page

2. ⚙️ RAG Config

3. Generated RAG Agent

Supported LLMs and Embeddings

Builder Agent

Generated RAG Agent

Resources

rags's People

Contributors

Stargazers

Watchers

Forkers

rags's Issues

🔍 Description

Recommend Projects

Recommend Topics

Recommend Org