πββοΈ I previously worked at JD Explore Academy and Tencent AI Lab, and previously Ph.D. at Univ. of Sydney.
π Working on the whole pipeline of LLM R&D, including efficient and sufficient training, alignment, evaluations, compression, multilingualism, multimodality and much more.
πͺ I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019π . will resume training in 2024πͺπ»).
π₯ I (onceπ ) enjoy cooking.
π I like to spend Sundays with my cats (two from 2020-2023, one from 2023).
alphadl / attention-is-all-you-need-pytorch Goto Github PK
View Code? Open in Web Editor NEWThis project forked from jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
License: MIT License