Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: pytorch/rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Tests] Fix vmas seeding test CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Environments Adds or modifies an environment wrapper
#3210 opened Oct 16, 2025 by matteobettini Loading...
[Feature] Aggregation strategies CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3209 opened Oct 16, 2025 by vmoens Loading...
[Feature] kl_mask_threshold CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3208 opened Oct 16, 2025 by vmoens Loading...
[Feature] CISPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3207 opened Oct 16, 2025 by vmoens Loading...
[Feature] DAPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3206 opened Oct 16, 2025 by vmoens Loading...
[Refactor] Refactor GRPO as a separate class CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3205 opened Oct 16, 2025 by vmoens Loading...
[Test] Fix flaky parallel test CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3204 opened Oct 16, 2025 by vmoens Loading...
[Feature] Add support for trackio CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3196 opened Oct 14, 2025 by Xmaster6y Loading...
4 of 6 tasks
[CI] Use uv instead of conda CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Environments/Isaac Isaac-lab CI Environments Adds or modifies an environment wrapper
#3193 opened Oct 14, 2025 by vmoens Loading...
[Feature] Documentation CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3192 opened Oct 14, 2025 by vmoens Loading...
[Feature] SAC Trainer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3191 opened Oct 14, 2025 by vmoens Loading...
[Feature] PPO Trainer Updates CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3190 opened Oct 14, 2025 by vmoens Loading...
[Feature] Trainer Algorithms - Configuration System CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3189 opened Oct 14, 2025 by vmoens Loading...
[Feature] Trainer Infrastructure - Timing and Utilities CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3188 opened Oct 14, 2025 by vmoens Loading...
[Feature] Collectors - Weight Sync Scheme Integration CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3187 opened Oct 14, 2025 by vmoens Loading...
[Feature] vLLM Weight Synchronization Schemes CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3186 opened Oct 14, 2025 by vmoens Loading...
[Feature] Weight Synchronization Schemes - Core Infrastructure CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3185 opened Oct 14, 2025 by vmoens Loading...
[Feature] Transform Module - ModuleTransform and Ray Service Refactor CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3184 opened Oct 14, 2025 by vmoens Loading...
[Feature] Storage Shared Initialization for Multiprocessing CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3183 opened Oct 14, 2025 by vmoens Loading...
[Bugfix] Wrong minari download first element bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Environments Adds or modifies an environment wrapper
#3106 opened Jul 31, 2025 by marcosgalleterobbva Loading...
3 of 6 tasks
[Refactor] refactor noisy linear CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3082 opened Jul 17, 2025 by vmoens Loading...
10 tasks
Fix Habitat CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3065 opened Jul 14, 2025 by vmoens Loading...
[Algorithm] DPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#3025 opened Jun 23, 2025 by vmoens Loading...
[Feature] Added EXP3 Scoring function in continuation with pr #2358 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3013 opened Jun 18, 2025 by ParamThakkar123 Loading...
3 of 10 tasks
[Feature] Neptune logger CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3008 opened Jun 17, 2025 by vmoens Loading...
10 tasks
Previous 1 3 4 5
Previous
ProTip! Type g i on any issue or pull request to go back to the issue listing page.

AltStyle によって変換されたページ (->オリジナル) /