-
Notifications
You must be signed in to change notification settings - Fork 28.1k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[save_pretrained ] Skip collecting duplicated weight
#36409
opened Feb 26, 2025 by
wejoncy
Loading…
5 tasks
set non_blocking=True when move data from cpu to gpu
#36408
opened Feb 26, 2025 by
Hukongtao
Loading…
5 tasks
Add THL-150 model architecture implementation
#36407
opened Feb 25, 2025 by
ErebusTN
Loading…
2 of 3 tasks
Refactor siglip2 fast image processor
Processing
Vision
#36406
opened Feb 25, 2025 by
yonigozlan
Loading…
[Model] Optimize BERT memory usage and improve code readability
#36401
opened Feb 25, 2025 by
eleanorTurintech
Loading…
Add DeepSeek V2 Model into Transformers
#36400
opened Feb 25, 2025 by
VladOS95-cyber
•
Draft
4 of 5 tasks
fix: prevent model access error during Optuna hyperparameter tuning
#36395
opened Feb 25, 2025 by
emapco
Loading…
2 of 5 tasks
Handle DAC conversion when using weight_norm with newer PyTorch versions
#36393
opened Feb 25, 2025 by
edwko
Loading…
Fix rescale normalize inconsistencies in fast image processors
#36388
opened Feb 25, 2025 by
yonigozlan
Loading…
【WIP】add recommendations for Ascend NPU using flash_attn
#36383
opened Feb 25, 2025 by
zheliuyu
Loading…
1 of 5 tasks
Fix: Use config.use_sliding_window instead of config.sliding_window
#36377
opened Feb 24, 2025 by
KarthikaRajagopal44
Loading…
Support loading Quark quantized models in Transformers
#36372
opened Feb 24, 2025 by
fxmarty-amd
Loading…
fix: use correct type annotation for
padding_side
#36349
opened Feb 23, 2025 by
winstxnhdw
Loading…
4 of 5 tasks
Fix test isolation for clear_import_cache utility
#36345
opened Feb 22, 2025 by
sambhavnoobcoder
Loading…
set empty list of suppress tokens to None
#36344
opened Feb 22, 2025 by
Lewington-pitsos
Loading…
1 of 2 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.