hiyouga / LlamaFactory Public

Notifications You must be signed in to change notification settings
Fork 8.7k
Star 70.8k

Code
Issues 952
Pull requests 44
Discussions
Actions
Security and quality 4
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: hiyouga/LlamaFactory

Labels 13 Milestones 0

New pull request New

44 Open 1,252 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix(data/converter): handle None tool_calls in OpenAI-style messages

#10455 opened May 1, 2026 by Anai-Guo

Loading…

[train] fix loss aggregation bug in SFT and PT training

#10454 opened Apr 30, 2026 by 4teven

Loading…

[xpu] extend runs_on test markers to include xpu

#10445 opened Apr 29, 2026 by singhalshubham03

Loading…

2 tasks

[model] support DeepSeek V4 pending

This problem is yet to be addressed

#10434 opened Apr 25, 2026 by isLinXu Contributor

Loading…

[V1] support reward training stage pending

This problem is yet to be addressed

#10431 opened Apr 25, 2026 by frozenleaves Collaborator

Loading…

[model-test] Add test for Llama-3.1-8B-Instruct model

#10426 opened Apr 23, 2026 by pankd

Loading…

2 tasks done

[chat] add thinking token injection for reasoning models in all engines

#10424 opened Apr 23, 2026 by kally788

Loading…

4 tasks

[chat] fix enable_thinking=None overriding template defaults

#10423 opened Apr 23, 2026 by kally788

Loading…

3 tasks

fix: add ignore_mismatched_sizes option to model loader

#10420 opened Apr 22, 2026 by octo-patch

Loading…

avoid the EOFError issue when run chat with a sample prompt at a jupyter notebook

#10409 opened Apr 19, 2026 by zhangnju

Loading…

Optimize Qwen video token metadata preprocessing pending

This problem is yet to be addressed

#10404 opened Apr 17, 2026 by luca-888

Loading…

fix: materialize FSDP2 model on CPU when CPU offloading is enabled

#10403 opened Apr 17, 2026 by octo-patch

Loading…

fix: handle mm_token_type_ids in collator and packing tests pending

This problem is yet to be addressed

#10397 opened Apr 16, 2026 by markmochi200

Loading…

2 tasks done

fix: bump transformers upper bound to <=5.6.0 for Gemma4 support

#10390 opened Apr 14, 2026 by Anai-Guo

Loading…

fix: add None check for feature_extractor in Gemma4Plugin audio processing

#10388 opened Apr 13, 2026 by Ricardo-M-L

Loading…

2 tasks done

fix: prevent training hang when WebUI client disconnects

#10383 opened Apr 11, 2026 by Ricardo-M-L

Loading…

4 tasks done

fix: Ray placement group over-allocation and NCCL hang on GPU-less head node

#10380 opened Apr 10, 2026 by Ricardo-M-L

Loading…

5 tasks

[feat] support HyperParallel PT training and activation optimization

#10370 opened Apr 9, 2026 by Cui-yshoho Contributor • Draft

fix: sanitize subprocess call in launcher.py

#10369 opened Apr 9, 2026 by orbisai0security

Loading…

3 tasks done

[feat] Add support FA3

#10368 opened Apr 8, 2026 by y2sman

Loading…

2 tasks done

fix: use json.loads with Path.read_text() instead of json.load with Path

#10361 opened Apr 6, 2026 by satishkc7

Loading…

replace chat.py pop(0) with deque.popleft()

#10356 opened Apr 5, 2026 by nameearly

Loading…

1 task

[fix] parser json.load PosixPath bug

#10354 opened Apr 5, 2026 by Belle0918

Loading…

2 tasks done

[perf] Skip unused lm_head projection and hidden state storage in RM trainer

#10353 opened Apr 5, 2026 by tonywang1990

Loading…

4 tasks done

[ray] fix placement group over-allocation and NCCL hang on GPU-less head node

#10349 opened Apr 3, 2026 by ilover311

Loading…

2 tasks done

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!