rackloom / alpaca_farm Goto Github PK
View Code? Open in Web Editor NEWThis project forked from tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Home Page: https://arxiv.org/abs/2305.14387
License: Apache License 2.0