Topic: deepspeed Goto Github
Some thing interesting about deepspeed
Some thing interesting about deepspeed
deepspeed,一套代码指令微调大模型
User: 5663015
deepspeed,Framework, Model & Kernel Optimizations for Distributed Deep Learning - Data Hack Summit
User: abhilash1910
deepspeed,🥈50th place in Bristol-Myers Squibb – Molecular Translation competition🥈
User: affjljoo3581
deepspeed,
User: afogarty85
Home Page: http://seekinginference.com/
deepspeed,Train a Performer Dual Encoder to get Language Agnostic Sentence Embeddings like LABSE
User: andresoble
deepspeed,Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
User: beomi
Home Page: https://wiki.beomi.net/transformers-deepspeed-new-bert-model.html
deepspeed,DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)
User: bobo0810
deepspeed,Create an environment within AzureML that supports Deepspeed training, execute some example training processes thereon.
User: cdw
deepspeed,Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
User: coincheung
deepspeed,[T] ~ Nova Wallet ~ GUI wallet for windows on the bittensor network polkadot you can use this to store your TAO under a polkadot address [T]
User: damomineraleo
Home Page: https://bittensor.com/
deepspeed,llama2 finetuning with deepspeed and lora
Organization: git-cloner
Home Page: https://gitclone.com/aiit/chat/
deepspeed,①A toy large model for recommender system based on LLaMA2, SASRec, and Meta's generative recommenders. ②Note and experiments of official implementation for Meta's generative recommenders.
User: glb400
deepspeed,Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Organization: homebrewnlp
Home Page: https://github.com/HomebrewNLP/revlib
deepspeed,GLake: optimizing GPU memory management and IO transmission.
Organization: intelligent-machine-learning
deepspeed,LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Organization: internlm
Home Page: https://lmdeploy.readthedocs.io/en/latest/
deepspeed,A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
User: jackaduma
deepspeed,A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
User: jackaduma
deepspeed,Sample codes and guidelines on how to finetune any opensource GPT models using #deepspeed and #huggingface
User: jistiak
deepspeed,All about large language models
User: l294265421
deepspeed,Just record my journey to advance and democratize artificial intelligence through ZeRO and MSOS DeepSpeed
User: limccn
deepspeed,Samples for fine-tuning HuggingFace models with AzureML
User: linydub
deepspeed,This repository demonstrates LLM execution on CPUs using packages like llamafile, emphasizing low-latency, high-throughput, and cost-effective benefits for inference and serving.
User: mddunlap924
deepspeed,Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.
User: nawnoes
deepspeed,llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.
Organization: opencsgs
deepspeed,An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Organization: openllmai
Home Page: https://huggingface.co/OpenLLMAI
deepspeed,Collaborative Training of Large Language Models in an Efficient Way
Organization: openmoss
Home Page: https://openlmlab-collie.readthedocs.io
deepspeed,Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Organization: pku-alignment
Home Page: https://pku-beaver.github.io
deepspeed,Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.
User: pszemraj
deepspeed,The official implementation of paper "Demystifying Instruction Mixing for Fine-tuning Large Language Models"
User: reason-wang
deepspeed,Application of the L2HMC algorithm to simulations in lattice QCD.
User: saforem2
Home Page: https://saforem2.github.io/l2hmc-qcd/
deepspeed,Large Language Models for All, 🦙 Cult and More, Stay in touch !
User: shm007g
Home Page: https://shm007g.github.io/LLaMA-Cult-and-More/
deepspeed,A framework for benchmarking various DNN inference engine.
User: siahuat0727
deepspeed,Code base for the paper "Instruction Tuned Models are Quick Learners".
User: srsawant34
deepspeed,使用自己的tokenizer继续预训练大语言模型。
User: taishan1994
deepspeed,quick is the simple trainer built on the top of pytorch & deepspeed for making my deep learning model training more smoother & faster.
User: thevasudevgupta
deepspeed,Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
User: xirider
deepspeed,Shaping Language Models with Cognitive Insights
Organization: xplainmind
deepspeed,Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
User: xyjigsaw
deepspeed,Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and Trainer with DeepSpeed
User: yanste
Home Page: https://www.kaggle.com/code/yannicksteph/nlp-llm-fine-tuning-trainer-deepspeed
deepspeed,Transformer OCR by Torch Lightning
User: yoosunghyun
deepspeed,An Open-sourced Knowledgable Large Language Model Framework.
Organization: zjunlp
Home Page: http://knowlm.zjukg.cn/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.