-
Notifications
You must be signed in to change notification settings - Fork 28.9k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Force real tensors and clone state_dict in src/transformers/modeling_utils.py
#38114
opened May 13, 2025 by
MutugiD
Loading…
Fix incorrect batching audio index calculation for Phi-4-Multimodal
#38103
opened May 13, 2025 by
Isotr0py
Loading…
5 tasks
disable deepspeed when setting up fake trainer
#38101
opened May 13, 2025 by
winglian
Loading…
5 tasks
In Llama4 fix wrongly inverted causal attention mask when using SDPA implementation
#38094
opened May 12, 2025 by
sogartar
Loading…
Omit creation of positional IDs within ESM if applicable
#38089
opened May 12, 2025 by
simonlevine
•
Draft
Add optional RMSNorm support to BitNet quantization (config + layers)
#38087
opened May 12, 2025 by
Codys12
Loading…
3 of 5 tasks
Refactor
MambaCache
to modeling_mamba.py
(parity with Zamba)
#38086
opened May 12, 2025 by
manueldeprada
Loading…
fix multi-image case for llava-onevision
#38084
opened May 12, 2025 by
cyr0930
Loading…
3 of 5 tasks
Cache System Refactor: Layered Architecture
#38077
opened May 12, 2025 by
manueldeprada
•
Draft
7 of 25 tasks
Fix temporal padding in Qwen2VLImageProcessor when the number of frames is not divisible by temporal_patch_size
#38076
opened May 12, 2025 by
ritwickchaudhry
Loading…
Updated the Model docs - for the ALIGN model
#38072
opened May 11, 2025 by
1himan
Loading…
3 of 5 tasks
Added scores in the streamer classes based on generation flag
#38064
opened May 10, 2025 by
LuisCarlos-104171
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.