Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Add advanced nsys options to the wrapper Documentation Improvements or additions to documentation
#2461 opened May 11, 2026 by zswerth Loading…
4 tasks done
Deepseek v4 support - Automodel path
#2460 opened May 11, 2026 by sharonyu-115 Contributor Draft
4 tasks
ci: add MODEL_FAMILY and TEST_TYPE to test CONFIG blocks
#2459 opened May 11, 2026 by kajalj22 Contributor Loading…
1 of 2 tasks
feat: support staleness-window in ReplayBufferNew
#2458 opened May 11, 2026 by yuki-97 Contributor Loading…
feat: add online DPO training community-request Documentation Improvements or additions to documentation
#2456 opened May 10, 2026 by taivu1998 Loading…
feat(grpo): add SAPO actor loss community-request Documentation Improvements or additions to documentation
#2455 opened May 10, 2026 by taivu1998 Loading…
feat(grpo): support async multiple dataloaders community-request Documentation Improvements or additions to documentation
#2454 opened May 10, 2026 by taivu1998 Loading…
fix(infra): dev pod RBAC, macOS install scripts, helm fixes CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2450 opened May 10, 2026 by terrykong Collaborator Loading…
5 tasks
fix(nrl-k8s): rewrite --config in entrypoint to honor CLI RECIPE arg CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2449 opened May 9, 2026 by hemildesai Contributor Loading…
2 of 3 tasks
refactor: refactor async utils CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2448 opened May 9, 2026 by yuki-97 Contributor Loading…
fix(megatron): delegate packed CP slicing to MCore
#2445 opened May 8, 2026 by zyzhou5 Loading…
4 tasks
feat(vllm): add delta-compressed collective refit
#2444 opened May 8, 2026 by HollowMan6 Member Loading…
4 tasks done
fix: fix skip_reference_policy_logprobs_calculation and skip_prev_logprobs CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2443 opened May 8, 2026 by jinglinglingling Loading…
2
6
Mxin/moe mamba sft Documentation Improvements or additions to documentation
#2442 opened May 8, 2026 by mxinO Contributor Draft
4 tasks
feat: data plane transfer queue integration CI:L1 Run doctests, unit tests, and functional tests
#2439 opened May 7, 2026 by ZhiyuLi-Nvidia Contributor Loading…
4 tasks done
Dynamo Nemo-RL K8s integration
#2429 opened May 6, 2026 by jthomson04 Contributor Draft
4 tasks
[WIP] don't review Documentation Improvements or additions to documentation
#2420 opened May 6, 2026 by shuyixiong Contributor Draft
4 tasks
feat: Auto research skill community-request waiting-on-maintainers Waiting on maintainers to respond
#2419 opened May 6, 2026 by vinhngx Contributor Loading…
fix: handle non-contiguous tensors in IPC weight refit CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) community-request waiting-on-maintainers Waiting on maintainers to respond
#2418 opened May 5, 2026 by jlcanta Loading…
3 of 4 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.