Giter Site home page Giter Site logo

Hi there 👋, I'm zyt1024!

About me

  • 🌱 I’m currently learning Mlsys and Cuda

  • ❤️ I love writing C/C++ and Python


Github Stats



Profile views counter




Generated using Github Profilinator

zyt1024's Projects

caffe icon caffe

Caffe: a fast open framework for deep learning.

camp icon camp

飞桨护航计划集训营

chip-knn icon chip-knn

[FPT'20] CHIP-KNN: Configurable and HIgh-Performance K-Nearest Neighbors Accelerator on Cloud FPGAs

deeppoint-v2-fpga icon deeppoint-v2-fpga

The code repository of DGCNN on FPGA: Acceleration of The Point Cloud Classifier Using FPGAs

dgcnn-on-fpga icon dgcnn-on-fpga

PLEASE USE THE NEW REPO https://github.com/salehjg/DeepPoint-V2-FPGA . The deprecated in-order-queue-based repository for "DGCNN on FPGA: Acceleration of The Point CloudClassifier Using FPGAs".

docs icon docs

Documentations for PaddlePaddle

examples icon examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

fastllm icon fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

gemm_hls icon gemm_hls

Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.

hls4ml icon hls4ml

Machine learning on FPGAs using HLS

how_to_optimize_in_gpu icon how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

kuiperinfer icon kuiperinfer

带你从零实现一个高性能的深度学习推理库,Implement a high-performance deep learning inference library step by step

ncnn icon ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

paddle icon paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

particlenet icon particlenet

Implementation of the jet classification network in ParticleNet: Jet Tagging via Particle Clouds

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.