Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

使用deepspeed进行2机8卡训练时,怎么把模型切成16份呢?我发现现在只会切成8份。 bug Something isn't working pending This problem is yet to be addressed
#7066 opened Feb 25, 2025 by joyyyhuang
1 task done
关于FunctionFormatter中think标签的疑问 bug Something isn't working pending This problem is yet to be addressed
#7064 opened Feb 25, 2025 by zhangch-ss
1 task done
Problems arising from Inferrence bug Something isn't working pending This problem is yet to be addressed
#7062 opened Feb 25, 2025 by yaosheng-zhang
1 task done
多卡微调Qwen2.5-14B显存分配不均 bug Something isn't working pending This problem is yet to be addressed
#7055 opened Feb 24, 2025 by Jimmy-L99
1 task done
使用streaming模式,但内存随着训练会增加,符合预期吗? bug Something isn't working pending This problem is yet to be addressed
#7049 opened Feb 24, 2025 by caoxu915683474
1 task done
Dataset image path incorrectly loaded 多模态数据集图像路径错误 bug Something isn't working pending This problem is yet to be addressed
#7046 opened Feb 24, 2025 by SovietLongbow
1 task done
Ray多机多卡训练指定每个节点的显卡功能 enhancement New feature or request pending This problem is yet to be addressed
#7045 opened Feb 24, 2025 by rexjm
1 task done
api接入chatbox报错ERROR: Exception in ASGI application bug Something isn't working pending This problem is yet to be addressed
#7044 opened Feb 24, 2025 by eyexin
1 task done
Inability to effectively fine-tune models with built-in inference capabilities bug Something isn't working pending This problem is yet to be addressed
#7042 opened Feb 23, 2025 by lxcxjxhx
1 task done
Long context full SFT validation causes OOM bug Something isn't working pending This problem is yet to be addressed
#7041 opened Feb 23, 2025 by Yixi-Rao
1 task done
colab和kaggle平台部署报错 bug Something isn't working pending This problem is yet to be addressed
#7037 opened Feb 22, 2025 by DullJZ
1 task done
基于最新的LLaMA-Factory训练Qwen2.5-vl, 训练变得非常慢 bug Something isn't working pending This problem is yet to be addressed
#7030 opened Feb 21, 2025 by leon-cas
1 task done
deepseek r1微调 enhancement New feature or request pending This problem is yet to be addressed
#7027 opened Feb 21, 2025 by ZTurboX
1 task done
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc0 in position 17: invalid start byte bug Something isn't working pending This problem is yet to be addressed
#7016 opened Feb 20, 2025 by DspringL
1 task done
使用昇腾910A llama factory sft qwen2-7b时报错E40024: 2025-02-20-14:05:46.947.014 Failed call Python Func/Meathod [get_binfile_sha256_hash_from_c], bug Something isn't working npu This problem is related to NPU devices pending This problem is yet to be addressed
#7014 opened Feb 20, 2025 by Goo-goo-goo
1 task done
在服务器端配置端口映射后,WebUI无法正常显示。确实前端css文件 bug Something isn't working pending This problem is yet to be addressed
#7013 opened Feb 20, 2025 by HemiFate
1 task done
使用API模型能力变差 bug Something isn't working pending This problem is yet to be addressed
#7010 opened Feb 20, 2025 by zzysos
1 task done
mac本用llama-factory如何用MPS ? enhancement New feature or request pending This problem is yet to be addressed
#7001 opened Feb 19, 2025 by catman002
1 task done
A800 7*80g 全参微调qwen-2.5-32b OOM? bug Something isn't working pending This problem is yet to be addressed
#6999 opened Feb 19, 2025 by coinfist-lucian
1 task done
启动WebUI失败 bug Something isn't working pending This problem is yet to be addressed
#6989 opened Feb 18, 2025 by xfrqh
1 task done
minicpm_2_6o全量微调验证集eval_loss不计算不打印,也不绘制eval_loss图 bug Something isn't working pending This problem is yet to be addressed
#6988 opened Feb 18, 2025 by Maflyflyy
1 task done
[help]如何添加规则到数据集的每个条目中,并且不影响返回值 bug Something isn't working pending This problem is yet to be addressed
#6984 opened Feb 18, 2025 by frankyuan
1 task done
怎么返回输出token的prob呢? enhancement New feature or request pending This problem is yet to be addressed
#6980 opened Feb 18, 2025 by PapaMadeleine2022
1 task done
converting model Error: unknown data type: I32 bug Something isn't working pending This problem is yet to be addressed
#6971 opened Feb 17, 2025 by nemoisfash
1 task done
用deepspeed zero-3-offload去微调DeepSeek-R1-Distill-Qwen-32B,系统卡住,长时间无反应 bug Something isn't working pending This problem is yet to be addressed
#6964 opened Feb 17, 2025 by erichuazhou
1 task done
ProTip! Find all open issues with in progress development work with linked:pr.