Light

opendilab / awesome-rlhf Goto Github PK

View Code? Open in Web Editor NEW

3.2K 58.0 201.0 550 KB

A curated list of reinforcement learning with human feedback resources (continually updated)

License: Apache License 2.0

deep-learning deep-reinforcement-learning human-feedback reinforcement-learning rlhf large-language-models

awesome-rlhf's People

Contributors

Stargazers

Watchers

Forkers

paparazz1 jovany-wang yuanyuansiyuan kztao tomekkorbak dumpmemory ndtands jaearly jangocheng fl77n ttb-git swang848 kevinking xdong97 circlez1992 wujunde thinkali jerryi00 ai-awe kaifahmad1 oriskunk gitliubo shawnhowell zzmjohn 15738897318 bai123-123 hiha3456 linhuixiao wanghaiijiaocool techthiyanes jeffrey28 sandeshkatakam aucan chenzihao008 yiran-hao lewieyasu renormalizedkat simranhk wangxuebing0906 huangzhizhong0305 mbrukman twilwa goswamig sikkha dbibil 01kevin01 ningvw520 sullamij yinxiaotian0923 xharut2022 hhy5277 super-rain jianantian lizonghan1107 zhangbeibei0902 wanglongzheng0313 sunjian0523 shizhediao joker3456 evdcush tinlytin bertzhaomh 520jefferson ruoyugao jxzhangjhu kerlinn sjm1992st tiandiao123 shiluyou-tal b-kartal yiranvang jinlmsft timothyxxx xianglunkai awesomeagitech publiusau brunokk kp-forks buteomf nitesh4146 catherinezhou noticeable wentaoy-19 qzj-debug tvjoseph chaoqu12 phd-tianlv zhanghanghitomi ambier tasmimul-huda zetangforward zhecanjameswang priyabrata017 f2-song lzhang22 ab1992ao mengkunzhao kejingjing88212 whitefu gavin90s

awesome-rlhf's Issues

What is the main library to scale up RL training for LLMs?

Assuming you have a reward model (say open assistant reward model) and a target model (say LLaMA), and you want to train it at scale on a multinode setup. What is the best code base for this? DeepSeed-chat?

RLHF in MARL

I want to konw if there are any researches focusing on leveraging RLHF in MARL problems.

About Multi-modal RLHF

Hi, I wonder if there is a multi-modal dataset for RLHF training of multi-modal large language models.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.