- 
  Notifications
 You must be signed in to change notification settings 
- Fork 102
Pull requests: RLHFlow/RLHF-Reward-Modeling
Pull requests list
 add experiment setup and results for the math prm
 
 
 
 #41
 by hanningzhang
 
 was merged Nov 9, 2024 
 
 
 
 
 
 Loading...
 
 
 
 
 
 Rlhflow math: evaluation code and evaluation description in readme
 
 
 
 #40
 by hanningzhang
 
 was merged Nov 8, 2024 
 
 
 
 
 
 Loading...
 
 
 
 
 
 Pixi package management; notebooks folders; quarto paper setup.
 
 
 
 #39
 by professorwug
 
 was closed Nov 9, 2024 
 
 
 
 
 
 Loading...
 
 
 
 
 
 
 ProTip!
 Type g i on any issue or pull request to go back to the issue listing page.