octoml Goto Github PK
Name: OctoAI
Type: Organization
Bio: Optimizing machine learning using machine learning
Location: Seattle
Blog: octo.ai
Name: OctoAI
Type: Organization
Bio: Optimizing machine learning using machine learning
Location: Seattle
Blog: octo.ai
A repo containing code examples that feature OctoAI's LLM solution
OctoAI LLM RAG samples
Custom dyld version inherited from original Apple dyld implementation
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Fork of MLCommons inference repository to test TVM integration
CK MLOps components
A simple Python harness to run an ONNX model in various concurrency and replication configurations against MLCommon's LoadGen to measure throughput.
A collection of pre-trained, state-of-the-art models in the ONNX format
A collection of OctoAI-based demos for OctoML's Youtube Channel etc.
Cartoonizer demo for OctoAI compute service launch
Examples of how to build Generative AI applications powered by the OctoAI compute service.
OctoAI's OctoShop! Transform photos with the power of words and generative AI!
A collection of test models for the OctoML AI acceleration service
Repository for OctoML-affiliated Helm Charts
A code sample that shows how to use 🦜️🔗langchain, 🦙llama_index and a hosted LLM endpoint to do a standard chat or Q&A about a pdf document
Home for OctoML PyTorch Profiler
ONNX Runtime(ORT) Go Live, is a python package that automates the process of accelerating models with ONNX Runtime(ORT). It contains two parts including model conversion to ONNX with correctness checking and auto performance tuning with ORT. Users can run these two together through a single pipeline or run them independently as needed.
Protobuf definitions for onnx models
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
DataDog integration with the OpenTelemetry crate, copied and adapted from opentelemetry-contrib https://github.com/open-telemetry/opentelemetry-rust/tree/master/opentelemetry-contrib.
Dev repo for power measurement for the MLPerf™ benchmarks
Build TVM docker image for production compilation deployments
A fork of tvm/unity
A fork of tvm/unity
docs for octoml/relax: https://octoml.github.io/relax-site/
A translator from a serialized relay graph layout into a structured graph object in JavaScript/TypeScript
Crate for authenticating Server to Server Apps for Google Cloud Engine.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.