damionFan's Projects
Examples of how to create colorful, annotated equations in Latex using Tikz.
Code and preprocessed dataset for EMNLP 2019 paper titled "Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks"
CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups
In PyTorch Learing Neural Networks Likes CNN(Convolutional Neural Networks for Sentence Classification (Y.Kim, EMNLP 2014) 、LSTM、BiLSTM、DeepCNN 、CLSTM、CNN and LSTM
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
cublas gemm benchmark (fp32 fp16 int8 fp16(tensor core) int8(tensor core))
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through working examples.
pdf
Fast CUDA Kernels for ResNet Inference.
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
CUDA Templates for Linear Algebra Subroutines
AlexNet,GoogleNet,VGG和ResNet网络架构的demo示例
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Download files from Google Drive using Python 2 or Python 3
DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Fast-Fourier Transform in 2D. Examination of Cooley-Tukey Algorithm for 2D FFT, image I/O for FFT, and a shared memory implementation of FFT on the GPU.