Giter Site home page Giter Site logo

a1's Introduction

KDB.AI Logo

The example KDB.AI samples provided aim to demonstrate examples of the use of the KDB.AI vector database in a number of scenarios ranging from getting started guides to industry specific use-cases.

In order to execute these notebooks, you will need to sign up for access to the KDB.AI Cloud Portal. Here, you will be given a Entrypoint URL and an API Key which can be used to connect to a KDB.AI Cloud session and execute the code against.

What is KDB.AI?

KDB.AI is a time-based vector database that allows developers to build scalable, reliable, and real-time applications by providing advanced search, recommendation, and personalization for Generative AI applications. KDB.AI is a key component of full-stack Generative AI applications that use Retrieval Augmented Generation (RAG).

Built by KX, the creators of kdb+, KDB.AI provides users with the ability to combine unstructured vector embedding data with structured time-series datasets to allow for hybrid use-cases which benefit from the rigor of conventional time-series data analytics and the usage patterns provided by vector databases within the Generative AI space.

What does KDB.AI support?

KDB.AI supports the following feature set:

  • Multiple index types: Flat, IVF, IVFPQ and HNSW.
  • Multiple distance metrics: Euclidean, Inner-Product, Cosine.
  • Top-N and metadata filtered retrieval
  • Python and REST Interfaces

Sample Breakdown

At this time, the repository contains the following samples:

Getting Started

Use-Cases

What Platforms Does KDB.AI Integrate With?

  • ChatGPT Retrieval Plugin: Example showing a question and answer session using a ChatGPT retrieval plugin using KDB.AI Vector Database.
  • Langchain: Example showing a question and answer session using a Langchain integration with the KDB.AI Vector Database.

Setup

This section details the setup steps required to run these samples locally on your machine.

Prerequisites

This setup guide assumes the following:

  1. You are using a Unix terminal or similar
  2. You have python >= 3.8 installed
  3. You have pip installed

Install Python Packages

  1. Use the requirements.txt file in the repository to install the relevent Python packages needed to run all of the samples:

    pip install -r requirements.txt

Install English Sentence Tokeniser

  1. Open a Python interpreter:

    python3
  2. Install the punkt data resouce for the nltk Python package:

    import nltk
    nltk.download("punkt")
  3. Exit the Python interpreter:

    exit()

View & Execute The Samples

  1. Run a jupter notebook session:

    jupyter notebook --no-browser

    This will load up the jupyter session in the background and display a URL on screen for you.

  2. Paste this URL into your browser

    This will bring up the samples for you to interact with.

Dataset Disclaimer

In this repository, we may make available to you certain datasets for use with the Software. You are not obliged to use such datasets (with the Software or otherwise), but any such use is at your own risk. Any datasets that we may make available to you are provided β€œas is” and without any warranty, including as to their accuracy or completeness. We accept no liability for any use you may make of such datasets.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.