Giter Site home page Giter Site logo

frankziako / ask-fsdl Goto Github PK

View Code? Open in Web Editor NEW

This project forked from the-full-stack/ask-fsdl

0.0 0.0 0.0 750 KB

Document Q&A over The Full Stack's Corpus

Home Page: https://fsdl.me/join-discord-askfsdl

Shell 5.70% Python 71.41% Makefile 7.25% Jupyter Notebook 15.64%

ask-fsdl's Introduction

๐Ÿฅž๐Ÿฆœ askFSDL ๐Ÿฆœ๐Ÿฅž

askFSDL is a demonstration of a retrieval-augmented question-answering application.

You can try it out via the Discord bot frontend in the Full Stack Discord!

We use our educational materials as a corpus: the Full Stack LLM Bootcamp, the Full Stack Deep Learning course, and the Opinionated LLM++ Lit Review.

So the resulting application is great at answering questions like

  • Which is cheaper: running experiments on cheap, slower GPUs or fast, more expensive GPUs?
  • How do I build an ML team?
  • What's a data flywheel?
  • Should I use a dedicated vector store for my embeddings?
  • What is zero-shot chain-of-thought reasoning?

EXPERIMENTAL: run it yourself

This project is under rapid development, so expect sharp edges while setting it up in your environment.

Thanks to community contributions, we can share a best-effort guide to running the application yourself here.

Note that this application uses cloud services. For most of these services, regular usage of the app will fall under the free tier. However, OpenAI API calls can easily become expensive, so make sure to se usage limits to prevent surprise bills.

Stack

We use langchain to organize our LLM invocations and prompt magic.

We stood up a MongoDB instance on Atlas to store our cleaned and organized document corpus. See the Running ETL to Build the Document Corpus notebook for details.

For fast search of relevant documents to insert into our prompt, we use a FAISS index.

We host the application backend on Modal, which provides serverless execution and scaling. That's also where we execute batch jobs, like writing to the document store and refreshing the vector index.

For creating a simple user interface in pure Python, we use Gradio. This UI is great for quick tests without deploying a full frontend but with a better developer experience than curl-ing from the command line.

We host the Discord bot on Modal as well, relying on Discord's interactions endpoints to run the bot serverlessly.

We use Gantry to monitor model behvaior in production and collect feedback from users.

ask-fsdl's People

Contributors

charlesfrye avatar ido777 avatar candidosales avatar sidsharma22 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.