Giter Site home page Giter Site logo

kubeagi / arcadia Goto Github PK

View Code? Open in Web Editor NEW
63.0 63.0 20.0 28.26 MB

A diverse, simple, and secure one-stop LLMOps platform

Home Page: http://www.kubeagi.com/

License: Apache License 2.0

Shell 11.25% Dockerfile 0.37% Makefile 0.73% Go 59.72% Smarty 4.61% Mustache 0.45% Python 22.87%
agents golang kubernetes langchain large-language-models llm llmops rag real-time-data retrieval-augmented-generation

arcadia's People

Contributors

0xff-dev avatar abirdcfly avatar bjwswang avatar carrotzpc avatar dayuy avatar dependabot[bot] avatar ggservice007 avatar hoega avatar huangqg avatar lanture1064 avatar nkwangleigit avatar wangxinbiao avatar xxxxibo avatar y9rabbito avatar zqq454224016 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

arcadia's Issues

Use chromaDB to store embedded documents

Original ChromaDB (Py/JS) provides:

  1. source texts > call embedding model (OpenAI, Huggingface Hub, text2vec(run locally on CPU), etc.) > store into database > return success
  2. query text > compare database result directly (using queryText func) > return a distance value with stored text, where less means closer.
  3. It also claims users can add their own embedding by using collection.add([embeddings] = [*vector data*]...) or add their own embedding function

chroma official document site

add CLI to interact with arcadia

arctl is designed to be a replacement of graphql-server which can be used to interfact with arcadia locally.

  • Datasource management
  • Dataset management
  • Knowledge management
  • Model management
  • LLM management
  • Worker management

start arcadia

  • add a cluster
  • init kubebb core
  • add kubebb repo
  • add kubeagi repo
  • deploy arcadia operator

Feature List

  • LLM management
  • Prompt management
  • Embedding Full Support
    • Zhipu AI Embedding
    • chroma vector store

AGI Dataset

Data is a essential part to LLM during model traing,prompt optimization(as prompt context with embedding models). So worth to provide a better to manage it.

The overall workflow will be
image

  1. upload files to built-in storage like minio (optional if files come from another object stroage)

  2. create a Dataset which contains

  • data sources : can be local files,object storage
  • embedding parameters : used to handle files upload
  1. Fetch files from built-in/external object storage

  2. Split/load documents into vector store

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.