Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[save_pretrained ] Skip collecting duplicated weight
#36409 opened Feb 26, 2025 by wejoncy Loading…
5 tasks
set non_blocking=True when move data from cpu to gpu
#36408 opened Feb 26, 2025 by Hukongtao Loading…
5 tasks
Add THL-150 model architecture implementation
#36407 opened Feb 25, 2025 by ErebusTN Loading…
2 of 3 tasks
Fix edge case for continue_final_message
#36404 opened Feb 25, 2025 by Rocketknight1 Loading…
Add DeepSeek V2 Model into Transformers
#36400 opened Feb 25, 2025 by VladOS95-cyber Draft
4 of 5 tasks
[preview] configs as dataclasses
#36396 opened Feb 25, 2025 by gante Draft
fix: prevent model access error during Optuna hyperparameter tuning
#36395 opened Feb 25, 2025 by emapco Loading…
2 of 5 tasks
Update "who to tag" / "who can review"
#36394 opened Feb 25, 2025 by gante Loading…
Added Cosmos model files
#36389 opened Feb 25, 2025 by Tanuj-rai Loading…
1 of 5 tasks
【WIP】add recommendations for Ascend NPU using flash_attn
#36383 opened Feb 25, 2025 by zheliuyu Loading…
1 of 5 tasks
[generate] torch.distributed-compatible DynamicCache
#36373 opened Feb 24, 2025 by gante Loading…
[WIP]: LLaVa for VLLM's transformers backend
#36367 opened Feb 24, 2025 by zucchini-nlp Loading…
Fix typos
#36358 opened Feb 23, 2025 by triplechecker-com Loading…
Add EfficientLoFTR model New model Vision
#36355 opened Feb 23, 2025 by sbucaille Loading…
4 of 5 tasks
fix: use correct type annotation for padding_side
#36349 opened Feb 23, 2025 by winstxnhdw Loading…
4 of 5 tasks
set empty list of suppress tokens to None
#36344 opened Feb 22, 2025 by Lewington-pitsos Loading…
1 of 2 tasks
ProTip! Follow long discussions with comments:>50.