Giter Site home page Giter Site logo

I'm ZZK

我是一名使命感爆棚的初级工程师,我喜欢:

  • 晚上开会拉通对齐
  • 一有问题就拉双方领导进群
  • 艾特人,cc
  • 写日报,一点屁事能写一页
  • 知道他人休假还硬要找人
  • 不问别人是否方便直接拨会议电话

我不喜欢:

  • 优化性能
  • 写博客
  • 用FLStudio编曲
  • 冲咖啡

图片

ZZK's github stats

Code

ZZK's Projects

cv-cuda icon cv-cuda

CV-CUDA™ is an open-source, graphics processing unit (GPU)-accelerated library for cloud-scale image processing and computer vision.

data icon data

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

deeprec icon deeprec

DeepRec is a recommendation engine based on TensorFlow.

deepspeed icon deepspeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

designpattern icon designpattern

C++11全套设计模式-23种指针的用法(a full DesignPattern implement with c++11)

docs icon docs

Documentations for PaddlePaddle

dynolog icon dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.

edgegpt icon edgegpt

Reverse engineered API of Microsoft's Bing Chat AI

eetq icon eetq

Easy and Efficient Quantization for Transformers

excalidraw icon excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

fairring icon fairring

Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large scales

fast_io icon fast_io

Significantly faster input/output for C++20

fbgemm icon fbgemm

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

fbtt-embedding icon fbtt-embedding

This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.

flash_attention_inference icon flash_attention_inference

Performance of the C++ interface of flash attention, flash attention v2 and self decoding attention in large language model (LLM) inference scenarios.

flexgen icon flexgen

Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems.

fp6_llm icon fp6_llm

An efficient GPU support for LLM inference with 6-bit quantization (FP6).

fuser icon fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

gemma_pytorch icon gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.