-
Notifications
You must be signed in to change notification settings - Fork 102
Pull requests: RLHFlow/RLHF-Reward-Modeling
Pull requests list
add experiment setup and results for the math prm
#41
by hanningzhang
was merged Nov 9, 2024
Loading...
Rlhflow math: evaluation code and evaluation description in readme
#40
by hanningzhang
was merged Nov 8, 2024
Loading...
Pixi package management; notebooks folders; quarto paper setup.
#39
by professorwug
was closed Nov 9, 2024
Loading...
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.