-
Notifications
You must be signed in to change notification settings - Fork 31.3k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Docs: clarify testing dependencies installation in AGENTS.md
#42691
opened Dec 8, 2025 by
OliverZhangA
Loading…
Fix some static cache tensor indexing for ROCm in gemma3
#42688
opened Dec 7, 2025 by
Abdennacer-Badaoui
•
Draft
Fix: Skip weight initialization for quantized int8 models
#42686
opened Dec 7, 2025 by
AnjaliiD
Loading…
Add is_flash_attn_4_available() detection functionAdd is_flash_attn_4_available() detection function
#42683
opened Dec 7, 2025 by
vasanthrpjan1-boop
Loading…
3 of 8 tasks
Fix post_process_semantic_segmentation removing valid class in Conditional DETR
#42681
opened Dec 7, 2025 by
vasanthrpjan1-boop
Loading…
Fixed failing
BioGPT batch generation test
#42677
opened Dec 6, 2025 by
Sai-Suraj-27
Loading…
1 of 5 tasks
Fixed failing Bart-Model Integration Tests
#42676
opened Dec 6, 2025 by
Sai-Suraj-27
Loading…
1 of 5 tasks
Fix parallelism_config being overwritten in TP-only training
#42671
opened Dec 6, 2025 by
arrdel
Loading…
Support having multiple sub-processors (of any kind) in the same processor
#42667
opened Dec 5, 2025 by
yonigozlan
Loading…
Remove Neptune integration references and deprecate
NeptuneCallback
#42666
opened Dec 5, 2025 by
qgallouedec
Loading…
FIX Error when trying to load non-LoRA PEFT
#42663
opened Dec 5, 2025 by
BenjaminBossan
Loading…
2 of 5 tasks
New Feature: Enabling Speculative Decoding with Batch Size > 1 (If draft and target model share tokenizer)
#42655
opened Dec 5, 2025 by
YanivDorGalron
Loading…
[
Ernie 4.5 Moe] Fix routing, weights, and update expectations
#42653
opened Dec 5, 2025 by
vasqu
Loading…
Only default
rope_parameters to empty dict if there is something to put in it
#42651
opened Dec 5, 2025 by
hmellor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-12-04.