Giter Site home page Giter Site logo

fvoid / cuda-optimization-guide Goto Github PK

View Code? Open in Web Editor NEW

This project forked from xiaosong9905/cuda-optimization-guide

0.0 0.0 0.0 37.26 MB

Xiao's CUDA Optimization Guide [Active Adding New Contents]

License: GNU General Public License v3.0

cuda-optimization-guide's Introduction

CUDA Optimization Guide

Acknowlegement

This repo was originally part of my HPC Note. With more content related to CUDA added to the original note, I decided to open a seprate repo dedicated to CUDA optimization.

Correction on mistakes is highly welcomed. Please post a issue if you found one.

To open markdown file with better format (e.g. proper image resize, spacing, side bar, etc), typora is recommended (its beta version is free).

Disclaimer

I do not contain the copyright of some image files included in this note. The copyright belongs to the original author.

Any content inside this repo is OPEN FOR EDUCATION PURPOSE but NOT ALLOWED FOR COMMERCIAL USE.

File Structure

# Difference in architecture difference behind CPU and GPU
CPUvsGPU.md

# Memory model of CUDA and memory related optimization techniques (including synchronization)
MemoryModel.md

# Program model of CUDA and program related optimization techniques (including stream)
ProgramModel.md

# Arithmetic related topic. accuracy, speed, etc
Arithmetic.md

# Measure Performence (including nsight, etc)
MeasurePerformence.md

# Other common use optimization techniques that not included as part of programmodel / memory model
CommonOptimizationTechniques.md

# Computation capacity of each generation GPU
ComputationCapacity.md

# Cases that refer to the above optimization techniques and show how those optimization techniques can be applied to real applications.
Cases.md

# Overview of what library NVIDIA provide and functionality of each library
Library.md

Major Refrence

Note: I also refer to other papers / blogs that's not listed below.

  • Courses

    • UIUC ECE 408
    • UIUC ECE 508
    • UC Berkeley CS 267
    • CMU 16.418
  • Book (CUDA)

    • Programing Massively Parallel Processors 3rd edition
    • CUDA C++ Best Practices Guide
    • CUDA C++ Programing Guide
    • Professional CUDA C Progaming
  • Book (Arch)

    • General-Purpose Graphics Processor Architecture
    • Processor Microarchitecture: An Implementation Perspective
  • Papers

    • Algorithm and Data Optimization Techniques for Scaling to Massively Threaded Systems
  • Blogs & code

    • CUTLASS: Fast Linear Algebra in CUDA C++ link

    • Cutlass github

cuda-optimization-guide's People

Contributors

xiaosong9905 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.