Name: Michael Klemm
Type: User
Company: AMD
Bio: Principal Member of Technical Staff in the Compilers, Languages, Runtimes & Tools team, Machine Learning & Software Engineering group at AMD.
Location: Aschheim, Germany
Blog: https://www.dontknow.de
Michael Klemm's Projects
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
Sample codes for the CPU fun blog https://cpufun.substack.com
A MPI distributed stream benchmark, useful to identifying nodes with poor memory performance and characterising memory bandwidth variation over systems.
My test repository to learn D.
C++ standards drafts
This project maintains and develops a Fortran parser called fparser2 written purely in Python which supports Fortran 2003 and some Fortran 2008. A legacy parser fparser1 is also available but is not supported. The parsers were originally part of the f2py project by Pearu Peterson.
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Demonstration of various hardware effects.
Demonstration of various hardware effects on CUDA GPUs.
Multi-backend implementation of SYCL for CPUs and GPUs
improved colored diff
This is a set of simple programs that can be used to explore the features of a parallel platform.
simple terminal UI for git commands
Performance monitoring and benchmarking suite
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Little OpenMP Library
a small build system with a focus on speed
NWChem: Open Source High-Performance Computational Chemistry
files to create Docker containers
NWChem TCE CCSD(T) loop-driven kernels for performance optimization experiments
OpenMP GCC support
A delightful community-driven (with 1,200+ contributors) framework for managing your zsh configuration. Includes 200+ optional plugins (rails, git, OSX, hub, capistrano, brew, ant, php, python, etc), over 140 themes to spice up your morning, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
Exercise and Solution code for OpenCL training (Intro and Advanced)
High Performance Linpack for Next-Generation AMD HPC Accelerators
Bandwidth test for ROCm
OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, publication and presentations, please check out our website ->