Qwen2.5-VL-7B的问题 #131

ffanyt · 2025-02-21T08:05:56Z

我的命令是：
vllm serve /home/share/models/Qwen2.5-VL-3B-Instruct --limit_mm_per_prompt image=5 \ --dtype float16 \ --port 10004 \ --tensor-parallel-size 4 \ --gpu-memory-utilization 0.9 \ --max-model-len 32768

我发现7B的模型会回答感叹号

而3b的模型一切正常

The text was updated successfully, but these errors were encountered:

wangxiyuan · 2025-02-24T03:45:35Z

@ganyi1996ppo Any update?

ganyi1996ppo · 2025-02-25T07:20:44Z

我的命令是： vllm serve /home/share/models/Qwen2.5-VL-3B-Instruct --limit_mm_per_prompt image=5 \ --dtype float16 \ --port 10004 \ --tensor-parallel-size 4 \ --gpu-memory-utilization 0.9 \ --max-model-len 32768

我发现7B的模型会回答感叹号

而3b的模型一切正常

Thanks for reporting this issue, did you use the latest release version of vllm-ascend? If so, maybe you can try to set a specific seed to the request, this might helps on the accuracy issue.

wangxiyuan added the bug Something isn't working label Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2.5-VL-7B的问题 #131

Qwen2.5-VL-7B的问题 #131

ffanyt commented Feb 21, 2025

wangxiyuan commented Feb 24, 2025

ganyi1996ppo commented Feb 25, 2025

Qwen2.5-VL-7B的问题 #131

Qwen2.5-VL-7B的问题 #131

Comments

ffanyt commented Feb 21, 2025

wangxiyuan commented Feb 24, 2025

ganyi1996ppo commented Feb 25, 2025