Giter Site home page Giter Site logo

wangbiao108's Projects

hnswlib icon hnswlib

Header-only C++/python library for fast approximate nearest neighbors

holedconv icon holedconv

Holed Convolution Layer for Semantic Segmentation in MatConvNet

how_to_optimize_in_gpu icon how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

hpcc-platform icon hpcc-platform

HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics.

hugectr icon hugectr

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

keras-yolo3 icon keras-yolo3

A Keras implementation of YOLOv3 (Tensorflow backend)

lightllm icon lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

llama icon llama

Inference code for LLaMA models

llvm-project icon llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

marvin icon marvin

Marvin: A Minimalist GPU-only N-Dimensional ConvNets Framework

mask_rcnn icon mask_rcnn

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

maskrcnn-benchmark icon maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

maxas icon maxas

Assembler for NVIDIA Maxwell architecture

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.