Giter Site home page Giter Site logo

francis-oss / nebullvm Goto Github PK

View Code? Open in Web Editor NEW

This project forked from nebuly-ai/optimate

0.0 0.0 0.0 1.01 MB

πŸš€ Plug and play modules to boost the performances of your AI systems πŸš€

Home Page: https://www.nebuly.com/

License: Apache License 2.0

Shell 0.62% Python 74.16% CMake 5.50% Jupyter Notebook 19.47% Dockerfile 0.27%

nebullvm's Introduction







Plug and play modules to boost the performances of your AI systems

NebullvmΒ is an ecosystem of plug and play modules to boost the performances of your AI systems. The optimization modules are stack-agnostic and work with any library.

The performances of language, vision and generative models strongly depend on input data/prompting, model architecture and hardware. These are not independent factors, and making optimal choices on all fronts is hard. Our open source modules help you to automatically combine these factors, thus bringing incredibly fast and efficient AI systems to your fingertips.

If you like the idea, give us a star to show your support for the project ⭐

Documentation

Please find here the full documentation on:

  • Installation
  • Getting started (quick view and examples)
  • Notebooks
  • Ecosystem and integrations
  • Product structure

What can this help with?

Our optimization modules are designed to be easily integrated into your system, providing a quick and seamless boost to its performance. Simply plug and play to start realizing the benefits of optimized performance right away:

βœ…Β Speedster: Automatically apply the best set of SOTA optimization techniques to achieve the maximum inference speed-up on your hardware.

βœ…Β GPU Manager: Effortlessly maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas.

βœ…Β OpenAlphaTensor: Increase the computational performances of an AI model with custom-generated matrix multiplication algorithms.

βœ…Β Forward-Forward: The Forward Forward algorithm is a method for training deep neural networks that replaces the backpropagation forward and backward passes with two forward passes.

Next modules and roadmap

We are actively working on incorporating the following modules, as requested by members of our community, in upcoming releases:

  • Promptify: Effortlessly personalize APIs generative models from OpenAI, Cohere, HF to your specific writing style and context leveraging human feedback.
  • CloudSurfer: Automatically discover the optimal cloud configuration and hardware on AWS, GCP and Azure to run your AI models.
  • OptiMate: Interactive tool guiding savvy users in achieving the best inference performance out of a given model / hardware setup.
  • TrainingSim: Easily simulate the training of large AI models on a distributed infrastructure to predict training behaviours without actual implementation.

Contributing

As an open source project in a rapidly evolving field, we welcome contributions of all kinds, including new features, improved infrastructure, and better documentation. If you're interested in contributing, please see the linked page for more information on how to get involved.


Join the community | Contribute to the library

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.