-
Notifications
You must be signed in to change notification settings - Fork 5.1k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
使用deepspeed进行2机8卡训练时,怎么把模型切成16份呢?我发现现在只会切成8份。
bug
Something isn't working
pending
This problem is yet to be addressed
#7066
opened Feb 25, 2025 by
joyyyhuang
1 task done
关于FunctionFormatter中think标签的疑问
bug
Something isn't working
pending
This problem is yet to be addressed
#7064
opened Feb 25, 2025 by
zhangch-ss
1 task done
Problems arising from Inferrence
bug
Something isn't working
pending
This problem is yet to be addressed
#7062
opened Feb 25, 2025 by
yaosheng-zhang
1 task done
多卡微调Qwen2.5-14B显存分配不均
bug
Something isn't working
pending
This problem is yet to be addressed
#7055
opened Feb 24, 2025 by
Jimmy-L99
1 task done
使用streaming模式,但内存随着训练会增加,符合预期吗?
bug
Something isn't working
pending
This problem is yet to be addressed
#7049
opened Feb 24, 2025 by
caoxu915683474
1 task done
Dataset image path incorrectly loaded 多模态数据集图像路径错误
bug
Something isn't working
pending
This problem is yet to be addressed
#7046
opened Feb 24, 2025 by
SovietLongbow
1 task done
Ray多机多卡训练指定每个节点的显卡功能
enhancement
New feature or request
pending
This problem is yet to be addressed
#7045
opened Feb 24, 2025 by
rexjm
1 task done
api接入chatbox报错ERROR: Exception in ASGI application
bug
Something isn't working
pending
This problem is yet to be addressed
#7044
opened Feb 24, 2025 by
eyexin
1 task done
Inability to effectively fine-tune models with built-in inference capabilities
bug
Something isn't working
pending
This problem is yet to be addressed
#7042
opened Feb 23, 2025 by
lxcxjxhx
1 task done
Long context full SFT validation causes OOM
bug
Something isn't working
pending
This problem is yet to be addressed
#7041
opened Feb 23, 2025 by
Yixi-Rao
1 task done
colab和kaggle平台部署报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7037
opened Feb 22, 2025 by
DullJZ
1 task done
基于最新的LLaMA-Factory训练Qwen2.5-vl, 训练变得非常慢
bug
Something isn't working
pending
This problem is yet to be addressed
#7030
opened Feb 21, 2025 by
leon-cas
1 task done
deepseek r1微调
enhancement
New feature or request
pending
This problem is yet to be addressed
#7027
opened Feb 21, 2025 by
ZTurboX
1 task done
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc0 in position 17: invalid start byte
bug
Something isn't working
pending
This problem is yet to be addressed
#7016
opened Feb 20, 2025 by
DspringL
1 task done
使用昇腾910A llama factory sft qwen2-7b时报错E40024: 2025-02-20-14:05:46.947.014 Failed call Python Func/Meathod [get_binfile_sha256_hash_from_c],
bug
Something isn't working
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#7014
opened Feb 20, 2025 by
Goo-goo-goo
1 task done
在服务器端配置端口映射后,WebUI无法正常显示。确实前端css文件
bug
Something isn't working
pending
This problem is yet to be addressed
#7013
opened Feb 20, 2025 by
HemiFate
1 task done
使用API模型能力变差
bug
Something isn't working
pending
This problem is yet to be addressed
#7010
opened Feb 20, 2025 by
zzysos
1 task done
mac本用llama-factory如何用MPS ?
enhancement
New feature or request
pending
This problem is yet to be addressed
#7001
opened Feb 19, 2025 by
catman002
1 task done
A800 7*80g 全参微调qwen-2.5-32b OOM?
bug
Something isn't working
pending
This problem is yet to be addressed
#6999
opened Feb 19, 2025 by
coinfist-lucian
1 task done
minicpm_2_6o全量微调验证集eval_loss不计算不打印,也不绘制eval_loss图
bug
Something isn't working
pending
This problem is yet to be addressed
#6988
opened Feb 18, 2025 by
Maflyflyy
1 task done
[help]如何添加规则到数据集的每个条目中,并且不影响返回值
bug
Something isn't working
pending
This problem is yet to be addressed
#6984
opened Feb 18, 2025 by
frankyuan
1 task done
怎么返回输出token的prob呢?
enhancement
New feature or request
pending
This problem is yet to be addressed
#6980
opened Feb 18, 2025 by
PapaMadeleine2022
1 task done
converting model Error: unknown data type: I32
bug
Something isn't working
pending
This problem is yet to be addressed
#6971
opened Feb 17, 2025 by
nemoisfash
1 task done
用deepspeed zero-3-offload去微调DeepSeek-R1-Distill-Qwen-32B,系统卡住,长时间无反应
bug
Something isn't working
pending
This problem is yet to be addressed
#6964
opened Feb 17, 2025 by
erichuazhou
1 task done
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.