Giter Site home page Giter Site logo

Hi there 👋

zhaohb's GitHub Stats

🤔,我使用的编程语言

Top Langs

zhaohongbo's Projects

hip icon hip

HIP: C++ Heterogeneous-Compute Interface for Portability

how_to_optimize_in_gpu icon how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

inferencehelper icon inferencehelper

Helper Class for Deep Learning Inference Frameworks: TensorFlow Lite, TensorRT, OpenCV, ncnn, MNN, SNPE, Arm NN, NNAbla

interview_internal_reference icon interview_internal_reference

2019年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。

jetson-inference icon jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

jittor icon jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

kernl icon kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

kuiperinfer icon kuiperinfer

带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

llm-action icon llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

llm-course icon llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

micronet icon micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

mimalloc icon mimalloc

mimalloc is a compact general purpose allocator with excellent performance.

min-tfs-client icon min-tfs-client

A lightweight python gRPC client to communicate with TensorFlow Serving

minpy icon minpy

NumPy interface with mixed backend execution

mkl-dnn icon mkl-dnn

Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN)

ml-workspace icon ml-workspace

🛠 All-in-one web-based IDE specialized for machine learning and data science.

mmdeploy icon mmdeploy

MMDeploy is an open-source deep learning model deployment toolset

mmdnn icon mmdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

model_server icon model_server

A scalable inference server for models optimized with OpenVINO™

model_split icon model_split

A simple CNN network is implemented on FPGA, but the resource of FPGA is not fully used, only to achieve the purpose of calculation

models icon models

Model Zoo for Intel® Architecture: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.