huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 28.1k
Star 140k

Code
Issues 995
Pull requests 552
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: huggingface/transformers

Labels 131 Milestones 0

New pull request New

552 Open 18,595 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[save_pretrained ] Skip collecting duplicated weight

#36409 opened Feb 26, 2025 by wejoncy

Loading…

5 tasks

set non_blocking=True when move data from cpu to gpu

#36408 opened Feb 26, 2025 by Hukongtao

Loading…

5 tasks

Add THL-150 model architecture implementation

#36407 opened Feb 25, 2025 by ErebusTN

Loading…

2 of 3 tasks

Refactor siglip2 fast image processor Processing Vision

#36406 opened Feb 25, 2025 by yonigozlan

Loading…

[generate] Run custom generation code from the Hub

#36405 opened Feb 25, 2025 by gante • Draft

Fix edge case for continue_final_message

#36404 opened Feb 25, 2025 by Rocketknight1

Loading…

[Model] Optimize BERT memory usage and improve code readability

#36401 opened Feb 25, 2025 by eleanorTurintech

Loading…

Add DeepSeek V2 Model into Transformers

#36400 opened Feb 25, 2025 by VladOS95-cyber • Draft

4 of 5 tasks

[preview] configs as dataclasses

#36396 opened Feb 25, 2025 by gante • Draft

fix: prevent model access error during Optuna hyperparameter tuning

#36395 opened Feb 25, 2025 by emapco

Loading…

2 of 5 tasks

Update "who to tag" / "who can review"

#36394 opened Feb 25, 2025 by gante

Loading…

Handle DAC conversion when using weight_norm with newer PyTorch versions

#36393 opened Feb 25, 2025 by edwko

Loading…

Added Cosmos model files

#36389 opened Feb 25, 2025 by Tanuj-rai

Loading…

1 of 5 tasks

Fix rescale normalize inconsistencies in fast image processors

#36388 opened Feb 25, 2025 by yonigozlan

Loading…

【WIP】add recommendations for Ascend NPU using flash_attn

#36383 opened Feb 25, 2025 by zheliuyu

Loading…

1 of 5 tasks

Fix: Use config.use_sliding_window instead of config.sliding_window

#36377 opened Feb 24, 2025 by KarthikaRajagopal44

Loading…

[generate] torch.distributed-compatible DynamicCache

#36373 opened Feb 24, 2025 by gante

Loading…

Support loading Quark quantized models in Transformers

#36372 opened Feb 24, 2025 by fxmarty-amd

Loading…

[WIP]: LLaVa for VLLM's transformers backend

#36367 opened Feb 24, 2025 by zucchini-nlp

Loading…

Fix typos

#36358 opened Feb 23, 2025 by triplechecker-com

Loading…

Introduce numpy/numba optimization to Qwen2VLImageProcessor optimization Processing Vision

#36356 opened Feb 23, 2025 by Isotr0py • Draft

5 tasks

Add EfficientLoFTR model New model Vision

#36355 opened Feb 23, 2025 by sbucaille

Loading…

4 of 5 tasks

fix: use correct type annotation for padding_side

#36349 opened Feb 23, 2025 by winstxnhdw

Loading…

4 of 5 tasks

Fix test isolation for clear_import_cache utility

#36345 opened Feb 22, 2025 by sambhavnoobcoder

Loading…

set empty list of suppress tokens to None

#36344 opened Feb 22, 2025 by Lewington-pitsos

Loading…

1 of 2 tasks

Previous 1 2 3 4 5 … 22 23 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly