Light

dacasals / qa_rag_osllms Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 1.0 73 KB

Using RAG + Open Source LLMs for Question Answering In-context learning approach

Jupyter Notebook 100.00%

qa_rag_osllms's Introduction

Explore Open source LLM model + RAG for QA task.

Scraping data from a public website:

Notebook scrapper.ipynb contains the recursive web-scraping code to extract available and public web content given a site URL.
It supports web pages and pdf
It starts with a link, and extracts the page text as well as metadata like URL, domain, language a year from the URL, date, and a list of hyperlinks.
These hyperlinks are added to the queue to be processed.
You can find a dataset scraped here.

Firt Rag appoach

Check notebook
Load a Mistral 7B model with quantization config.
Compare base model answers vs a simple RAG version.
As documents for RAG use the dataset created (text from Web and Pdfs) using public data extracted from https://www.thoughtworks.com, check the other notebook to see the scrapper.

Comments from this first approach:

Checking the sample generated at the end of the notebook, you can see that the RAG version reduces some hallucinations compared with the base mistral model.
Also reduce the verbosity in the answer.
The retriever strategy is very basic, it affects a lot the results. The base model seems to be trained with data until 2022, so it fails for questions that involve more recent content, like the Rag topics, it supports the benefits of having a RAG strategy. But this should be compared with a fine-tuning approach.
There is a bug in the model prediction because the batch call does not work, it is not using the extra GPU correctly.
I am using LangChain, what about Llamaindex ?

qa_rag_osllms's People

Contributors

Watchers

Forkers

oceantangwei

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.