This repository is for the practice of triton.
You can check my answer for the assignment of AI compiler study below link. https://github.com/Philipshrimp/triton_practice/blob/main/python/tutorials/fused_rope.py
Unfortunately, this code isn't implemented completely.
I used Anaconda for my implementation envs.
Python 3.8 CUDA 11.7
pip list argcomplete 3.2.3 Brotli 1.0.9 certifi 2024.2.2 charset-normalizer 2.0.4 contourpy 1.1.1 cycler 0.12.1 exceptiongroup 1.2.0 filelock 3.13.1 flake8-blind-except 0.2.1 flake8-builtins 2.2.0 flake8-class-newline 1.6.0 flake8-comprehensions 3.14.0 flake8-deprecated 2.2.1 flake8-docstrings 1.7.0 flake8-import-order 0.18.2 flake8-quotes 3.4.0 fonttools 4.50.0 idna 3.4 importlib_resources 6.4.0 iniconfig 2.0.0 kiwisolver 1.4.5 matplotlib 3.7.5 mkl-fft 1.3.8 mkl-random 1.2.4 mkl-service 2.4.0 numpy 1.24.3 packaging 24.0 pandas 2.0.3 pillow 10.2.0 pip 23.3.1 pluggy 1.4.0 pydocstyle 6.3.0 pyparsing 3.1.2 PySocks 1.7.1 pytest 8.1.1 pytest-repeat 0.9.3 pytest-rerunfailures 14.0 python-dateutil 2.9.0.post0 pytz 2024.1 requests 2.31.0 setuptools 68.2.2 six 1.16.0 snowballstemmer 2.2.0 tomli 2.0.1 torch 1.13.1 torchaudio 0.13.1 torchvision 0.14.1 triton 2.2.0 typing_extensions 4.9.0 tzdata 2024.1 urllib3 2.1.0 wheel 0.41.2 zipp 3.18.1