lorenzofamiglini / instructgoose Goto Github PK
View Code? Open in Web Editor NEWThis project forked from xrsrke/instructgoose
Implementation of Reinforcement Learning from Human Feedback (RLHF)
Home Page: https://xrsrke.github.io/instructGOOSE/
License: MIT License