Tirumal Naidu's Projects
AMBA v.3 APB v.1 Specification Complaint Slave SRAM Core design and testbench. The testbench is developed using System Verilog and UVM and can be used as standalone Verification IP (VIP).
BaseJump STL: A Standard Template Library for SystemVerilog
Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.
An optimized neural network operator library for chips base on Xuantie CPU.
A self-contained online book containing a library of FPGA design modules and related coding/design guides.
Berkeley's Systolic Array Generator
Hardware implementation of a configurable Convolutional Module.
Verilog implementations of 6 different hardware multiplier architectures
RTL, Cmodel, and testbench for NVDLA
MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)
Open deep learning compiler stack for Kendryte K210 AI accelerator
NVDLA SW
NVIDIA Linux open GPU kernel module source
OpenXuantie - OpenC910 Core
OpenCL HLS based CNN Accelerator on Intel DE10 Nano FPGA.
SpO2 and Heart Beat Measurement of PPG Data using Fast Fourier Transform
System Verilog based Verification of MIPS 5 staged pipelined processor using UVM environment
Rachit Nigam's personal website.
A dedicated graphical processor for ray tracing
Some notes on RISC-V
Tengine is a lite, high performance, modular inference engine for embedded device
Machine learning compiler based on MLIR for Sophgo TPU.
tvm learn