Giter Site home page Giter Site logo

wuzhengxuan / vearch Goto Github PK

View Code? Open in Web Editor NEW

This project forked from vearch/vearch

0.0 0.0 0.0 26.78 MB

distributed vector search for AI-native applications

License: Apache License 2.0

Shell 0.79% C++ 38.03% Python 13.38% C 0.36% Go 44.55% Makefile 0.03% CMake 1.09% Jupyter Notebook 1.17% Dockerfile 0.02% SWIG 0.59%

vearch's Introduction

License: Apache-2.0 Build Status Gitter

Overview

Vearch is a cloud-native distributed vector database for efficient similarity search of embedding vectors in your AI applications.

Key features

  • Hybrid search: Both vector search and scalar filtering.

  • Performance: Fast vector retrieval - search from millions of objects in milliseconds.

  • Scalability & Reliability: Replication and elastic scaling out.

Document

Quick start

Deploy vearch cluster on k8s

Add charts through the repo

$ helm repo add vearch https://vearch.github.io/vearch-helm
$ helm repo update && helm install my-release vearch/vearch

Add charts from local

$ git clone https://github.com/vearch/vearch-helm.git && cd vearch-helm
$ helm install my-release ./charts -f ./charts/values.yaml

Start by docker-compose

$ cd cloud
$ cp ../config/config.toml .
$ docker-compose up

Deploy by docker: Quickly start with vearch docker image, please see DeployByDocker

Compile by source code: Quickly compile the source codes, please see SourceCompileDeployment

APIs and Use Cases

VisualSearch: Vearch can be leveraged to build a complete visual search system to index billions of images. The image retrieval plugin for object detection and feature extraction is also required. For more information, please refer to Quickstart.md.

PythonSDK: APIPythonSDK.md Vearch Python SDK enables vearch to use locally. Vearch python sdk can be installed easily by pip install vearch.

Components

Vearch Architecture

arc

Master: Responsible for schema mananagement, cluster-level metadata, and resource coordination.

Router: Provides RESTful API: upsert, delete, search and query; request routing, and result merging.

PartitionServer (PS): Hosts document partitions with raft-based replication. Gamma is the core vector search engine implemented based on faiss. It provides the ability of storing, indexing and retrieving the vectors and scalars.

Reference

Reference to cite when you use Vearch in a research paper:

@misc{li2019design,
      title={The Design and Implementation of a Real Time Visual Search System on JD E-commerce Platform}, 
      author={Jie Li and Haifeng Liu and Chuanghua Gui and Jianyu Chen and Zhenyun Ni and Ning Wang},
      year={2019},
      eprint={1908.07389},
      archivePrefix={arXiv},
      primaryClass={cs.IR}
}

Community

You can report bugs or ask questions in the issues page of the repository.

For public discussion of Vearch or for questions, you can also send email to [email protected].

Our slack : https://vearchwrokspace.slack.com

Known Users

Welcome to register the company name in this issue: vearch#230 (in order of registration)

Users

License

Licensed under the Apache License, Version 2.0. For detail see LICENSE and NOTICE.

vearch's People

Contributors

wxingda avatar zcdb avatar ljeagle avatar kevintony001 avatar ansjsun avatar gdreamcatcher avatar bladehliu avatar kuailelijuan avatar maslino avatar dependabot[bot] avatar qiutianme avatar gyd-a avatar nizyun avatar yideng avatar stuartjing avatar guichuanghua avatar vivian7755 avatar syhao avatar jinzishuai avatar godloveminch avatar ycl053382 avatar xuebosmile avatar xqk avatar realdalabengba avatar qbzenker avatar mrc1119 avatar lqhandsome avatar liule-pi avatar ghostwwl avatar dryheartgo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.