Giter Site home page Giter Site logo

Deepware's Projects

how_to_optimize_in_gpu icon how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the program on the GPU in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

intel-edge-ai-for-iot-developers-nanodegree-program icon intel-edge-ai-for-iot-developers-nanodegree-program

Leverage the Intel® Distribution of OpenVINO™ Toolkit to fast-track development of high-performance computer vision and deep learning inference applications, and run pre-trained deep learning models for computer vision on-premise. You will identify key hardware specifications of various hardware types (CPU, VPU, FPGA, and Integrated GPU), and utilize the Intel® DevCloud for the Edge to test model performance on the various hardware types. Finally, you will use software tools to optimize deep learning models to improve performance of Edge AI systems. - Source

learning-nvdla-notes icon learning-nvdla-notes

NVDLA is an Open source DL/ML accelerator, which is very suitable for individuals or college students. This is the NOTES when I learn and try. Hope THIS PAGE may Helps you a bit. Contact Me:[email protected]

libposit icon libposit

A library for working with the posit number type.

lina icon lina

A high-level performance analysis tool for FPGA-based accelerators

magicube icon magicube

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

mann-fpga icon mann-fpga

Energy-Efficient Inference Accelerator for Memory-Augmented Neural Networks on an FPGA (DATE-19)

marlann icon marlann

Multiply-Accumulate and Rectified-Linear Accelerator for Neural Networks

mgrn icon mgrn

Memory-Gated Recurrent Networks (AAAI 2021)

mlflow icon mlflow

Open source platform for the machine learning lifecycle

mmm_sa_2by2_posit_4_0 icon mmm_sa_2by2_posit_4_0

This repository contains a Matrix-Matrix-Multiply unit performed by a 2x2 systolic array and posit<4,0> numbers

mvu icon mvu

Neural Network accelerator powered by MVUs and RISC-V.

n2s3_examples icon n2s3_examples

Examples of projects using the n2s3 neuromorphic accelerator

nacu icon nacu

This is a repository for the "NACU: A Non-Linear Arithmetic Unit for Neural Networks"

ne16 icon ne16

Neural Engine, 16 input channels

nemo icon nemo

NEural Minimizer for pytOrch

neural-compressor icon neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision, sparsity, pruning, knowledge distillation, cross different deep learning frameworks to purse best inference performance.

neural-networks-on-silicon icon neural-networks-on-silicon

This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.

nitta icon nitta

NITTA - Tool for Hard Real-Time CGRA Processors

nn-size-reducing-papers icon nn-size-reducing-papers

collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning

nocgen icon nocgen

NoC (Network-on-Chip) generator that generates Verilog HDL model of NoC consisting of on-chip routers

nocrouter icon nocrouter

RTL Network-on-Chip Router Design in SystemVerilog by Andrea Galimberti, Filippo Testa and Alberto Zeni

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.