Kai Zhang's Projects
A curated index to track AI-powered products.
CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Tools for curating biomedical training data for large-scale language modeling
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文总结+润色+审稿+审稿回复
Data compression of English text using the compressed tries data structure.
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
Implementation of [Memory-adaptive Depth-wise Heterogenous Federated Learning]
FedML - The federated and distributed machine learning library enabling machine learning anywhere at any scale. It's backed by FedML, Inc (https://FedML.ai). Supporting large-scale geo-distributed training, cross-device federated learning on smartphones/IoTs, cross-silo federated learning on data silos, and research simulation. Best Paper Award at NeurIPS 2020 Federated Learning workshop. FedML’s core technology is backed by years of cutting-edge research represented in 50+ publications in ML/FL Algorithms, Security/Privacy, Systems, and Applications, as well as 10 years of industrial experience in Distributed Systems, Cloud Computing, and Mobile/IoT Systems.
Official codes for paper "Efficient Federated Learning on Knowledge Graphs via Privacy-preserving Relation Embedding Aggregation"
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Implemented federated learning for binary classification (tabular data) with PyTorch. The data fuzzification technique and local differential privacy mechanism are applied to protect data privacy.
A curated list of adversarial attacks and defenses papers on graph-structured data.
Practical course about Large Language Models.
Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.
MIMIC Code Repository: Code shared by the research community for the MIMIC-III database
Example download scripts for the OASIS3 project
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
最全的汉语现代诗歌语料库整理,2K+诗人,42K+诗歌,8M+字,包括五四至今的所有流派。持续扩充...
Tabular Deep Learning Library for PyTorch
My personal homepage
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
A playbook for systematically maximizing the performance of deep learning models.
[EMNLP 2022] A Unified Framework and Analysis for Structured Knowledge Grounding with Text-to-Text Language Models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch