-
Notifications
You must be signed in to change notification settings - Fork 375
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Add advanced nsys options to the wrapper
Documentation
Improvements or additions to documentation
#2461
opened May 11, 2026 by
zswerth
Loading…
4 tasks done
Deepseek v4 support - Automodel path
#2460
opened May 11, 2026 by
sharonyu-115
Contributor
•
Draft
4 tasks
ci: add MODEL_FAMILY and TEST_TYPE to test CONFIG blocks
#2459
opened May 11, 2026 by
kajalj22
Contributor
Loading…
1 of 2 tasks
feat: support staleness-window in ReplayBufferNew
#2458
opened May 11, 2026 by
yuki-97
Contributor
Loading…
feat: add online DPO training
community-request
Documentation
Improvements or additions to documentation
#2456
opened May 10, 2026 by
taivu1998
Loading…
feat(grpo): add SAPO actor loss
community-request
Documentation
Improvements or additions to documentation
#2455
opened May 10, 2026 by
taivu1998
Loading…
feat(grpo): support async multiple dataloaders
community-request
Documentation
Improvements or additions to documentation
#2454
opened May 10, 2026 by
taivu1998
Loading…
feat(eval): support NeMo-Gym multi-turn rollouts
community-request
#2453
opened May 10, 2026 by
taivu1998
Loading…
feat(grpo): log per-optimizer step metrics
community-request
#2452
opened May 10, 2026 by
taivu1998
Loading…
refactor: clarify next-token logprob utilities
community-request
#2451
opened May 10, 2026 by
taivu1998
Loading…
fix(infra): dev pod RBAC, macOS install scripts, helm fixes
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2450
opened May 10, 2026 by
terrykong
Collaborator
Loading…
5 tasks
fix(nrl-k8s): rewrite Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
--config in entrypoint to honor CLI RECIPE arg
CI:Lfast
#2449
opened May 9, 2026 by
hemildesai
Contributor
Loading…
2 of 3 tasks
refactor: refactor async utils
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2448
opened May 9, 2026 by
yuki-97
Contributor
Loading…
fix(megatron): delegate packed CP slicing to MCore
#2445
opened May 8, 2026 by
zyzhou5
Loading…
4 tasks
feat(vllm): add delta-compressed collective refit
#2444
opened May 8, 2026 by
HollowMan6
Member
Loading…
4 tasks done
fix: fix skip_reference_policy_logprobs_calculation and skip_prev_logprobs
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2443
opened May 8, 2026 by
jinglinglingling
Loading…
feat: data plane transfer queue integration
CI:L1
Run doctests, unit tests, and functional tests
#2439
opened May 7, 2026 by
ZhiyuLi-Nvidia
Contributor
Loading…
4 tasks done
[WIP] don't review
Documentation
Improvements or additions to documentation
#2420
opened May 6, 2026 by
shuyixiong
Contributor
•
Draft
4 tasks
feat: Auto research skill
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2419
opened May 6, 2026 by
vinhngx
Contributor
Loading…
fix: handle non-contiguous tensors in IPC weight refit
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2418
opened May 5, 2026 by
jlcanta
Loading…
3 of 4 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.