THUDM / slime Public

Notifications You must be signed in to change notification settings
Fork 793
Star 5.7k

Code
Issues 191
Pull requests 114
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: THUDM/slime

Labels 23 Milestones 0

New pull request New

114 Open 1,340 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix: add eval-before-train to train_async.py (parity with train.py)

#1906 opened May 13, 2026 by Taosheng-ty

Loading…

4 tasks done

feat: filter logits by loss_mask before log_probs/entropy computation

#1905 opened May 13, 2026 by Taosheng-ty

Loading…

5 of 6 tasks

fix: preserve fused 3D expert tensors for Qwen3.5 MoE in torch_dist→H…

#1904 opened May 12, 2026 by rouchenzi

Loading…

fix: restore actor weights after loading OPD teacher checkpoint

#1903 opened May 12, 2026 by canlin03

Loading…

Neutralize zero-advantage samples to skip wasted forward compute

#1901 opened May 11, 2026 by nanjiangwill Collaborator

Loading…

fix: align correct-sample rewards with DP-local lengths

#1900 opened May 10, 2026 by miamia0

Loading…

Add SwanLab tracking support

#1898 opened May 9, 2026 by asckaya

Loading…

[docker] upgrade to v0.5.11 run-ci-image

#1892 opened May 6, 2026 by zhuzilin Contributor

Loading…

fix: add fallback for --save-hf when Megatron-Bridge lacks model support

#1881 opened Apr 30, 2026 by WangHong-yang Contributor

Loading…

3 tasks done

feat(profile): safer torch.profiler defaults + per-grad-step capture

#1879 opened Apr 29, 2026 by leofan-lab Contributor

Loading…

Add Megatron-Bridge LoRA support for GRPO actor training

#1865 opened Apr 26, 2026 by taivu1998

Loading…

Add SAPO policy loss objective

#1864 opened Apr 26, 2026 by taivu1998

Loading…

fix: guard DP-imbalance empty micro-batches under dynamic batching

#1860 opened Apr 24, 2026 by leofan-lab Contributor

Loading…

fix: rebind asyncio Semaphore and HTTP client on event-loop change

#1858 opened Apr 24, 2026 by leofan-lab Contributor

Loading…

fix: make Megatron one-shot train() assumptions idempotent across slime rollouts

#1857 opened Apr 24, 2026 by leofan-lab Contributor

Loading…

feat(gemma4): add Gemma4 26B-A4B MoE and 31B dense support

#1855 opened Apr 24, 2026 by leofan-lab Contributor

Loading…

Add GLM5 SFT support

#1844 opened Apr 20, 2026 by samaritan1998

Loading…

Fix double prepare_grads / loss-scaler-double-update in train_one_step

#1842 opened Apr 17, 2026 by jthomy

Loading…

fix(sft): enable max-length filtering for messages datasets

#1841 opened Apr 17, 2026 by none0663 Contributor

Loading…

3 tasks done

Fix missing activation checkpointing (recompute) parameters in bridge mode

#1833 opened Apr 14, 2026 by XJL010622

Loading…

[build] Add A100 support: patch set, offline-friendly conda build, and examples

#1832 opened Apr 14, 2026 by jason9693

Loading…

fix(gemma3): use GeGLU activation instead of SwiGLU

#1825 opened Apr 10, 2026 by leofan-lab Contributor

Loading…

fix address already in use

#1819 opened Apr 8, 2026 by xutianming Contributor

Loading…

Add Qwen3 VLM CI run-ci-changed

#1814 opened Apr 7, 2026 by zhuzilin Contributor

Loading…

fix: auto-fallback to flash_attn for Qwen3.5 on pre-Hopper GPUs (head_dim=256)

#1808 opened Apr 6, 2026 by dadiaomengmeimei

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!