-
Notifications
You must be signed in to change notification settings - Fork 436
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
test: re-enable mcore tests
CI:L1
Run doctests, unit tests, and functional tests
#2919
opened Jun 24, 2026 by
shanmugamr1992
Contributor
Loading…
4 tasks
fix: improve error message when NeMo Gym returns no generation data
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2918
opened Jun 24, 2026 by
yfw
Contributor
Loading…
4 tasks
fix(async-grpo): starvation diagnostics, gym logging, and dataloader exhaustion
#2917
opened Jun 24, 2026 by
saumishr
Contributor
Loading…
4 tasks
feat: ThreadSafeTimer, container init timing, and Ray telemetry
#2916
opened Jun 24, 2026 by
saumishr
Contributor
Loading…
4 tasks
feat: R3 gym notq router replay
CI:L1
Run doctests, unit tests, and functional tests
#2915
opened Jun 24, 2026 by
zyzhou5
Contributor
Loading…
ci: Bump Megatron-Bridge to 9f69b72
CI:L1
Run doctests, unit tests, and functional tests
#2911
opened Jun 24, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
feat: support R3 async RL without TQ
CI:L1
Run doctests, unit tests, and functional tests
#2908
opened Jun 23, 2026 by
zyzhou5
Contributor
Loading…
fix(data): extract env_name from list-form multi-dataset configs
community-request
#2907
opened Jun 23, 2026 by
yuchenwang3
Loading…
fix: fix fp8 hf_config_overrides bug
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2904
opened Jun 23, 2026 by
ashors1
Contributor
Loading…
4 tasks
fix: Reconcile #2315 and #2612
CI:L1
Run doctests, unit tests, and functional tests
#2902
opened Jun 23, 2026 by
tdene
Contributor
Loading…
4 tasks
[draft] docs: add Qwen3.5 model guide and model-family hub
Documentation
Improvements or additions to documentation
#2900
opened Jun 23, 2026 by
sharonyu-115
Contributor
Loading…
4 tasks
feat: Update Megatron Inference API interface
CI:L1
Run doctests, unit tests, and functional tests
#2891
opened Jun 22, 2026 by
tdene
Contributor
Loading…
4 tasks
fix: load policy in compute dtype when optimizer holds fp32 master weights
community-request
#2888
opened Jun 22, 2026 by
Doondi-Ashlesh
Loading…
feat(loss): support TIS lower bound
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
Documentation
Improvements or additions to documentation
#2886
opened Jun 22, 2026 by
macandro96
Contributor
Loading…
4 tasks
feat: add multiple penalties for model behaviour
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2885
opened Jun 22, 2026 by
macandro96
Contributor
Loading…
4 tasks
feat: async colocated GRPO with Megatron inference
CI:L1
Run doctests, unit tests, and functional tests
#2884
opened Jun 22, 2026 by
tdene
Contributor
Loading…
4 tasks
fix: topk fp32 chunk memory
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2883
opened Jun 22, 2026 by
odedovadia
Contributor
Loading…
2 of 4 tasks
feat: add Mistral Medium 3.5 (128B) text-only DAPO support
CI:L1
Run doctests, unit tests, and functional tests
#2875
opened Jun 19, 2026 by
sharonyu-115
Contributor
Loading…
3 of 4 tasks
DRAT: fix: run Nemotron Nano v2 workplace assistant recipe
#2868
opened Jun 18, 2026 by
snowmanwwg
Contributor
Loading…
4 tasks
DRAFT fix: prefer real NeMo-Gym package in actor
#2867
opened Jun 18, 2026 by
snowmanwwg
Contributor
Loading…
4 tasks
feat(megatron): add large-scale MoE tuning knobs and longer PG timeout
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2866
opened Jun 18, 2026 by
dafu-wu
Loading…
1 of 4 tasks
fix: tokenize system-led conversations for templates requiring a user turn
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2864
opened Jun 17, 2026 by
bzantium
Loading…
3 of 4 tasks
feat: add NCCL timeout config, stale ZMQ socket cleanup, OmegaConfig resolvers
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2862
opened Jun 17, 2026 by
puneeshkhanna
Loading…
1 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.