Patch release for older vllm engine lora support in gateway plugins #599

varungup90 · 2025-01-24T06:04:01Z

No description provided.

zhangjyr · 2025-01-24T18:09:24Z

pkg/plugins/gateway/gateway.go

@@ -410,17 +410,18 @@ func (s *Server) HandleResponseBody(ctx context.Context, requestID string, req *
 					Key: "x-error-response-unmarshal", RawValue: []byte("true"),
 				}}},
 				err.Error()), complete
-		} else if res.Model != model {


This check is used to handle error messages from the vLLM model. I assumed "model" from request header should equal to "res.Model" returned from the response. If that is not the case, replacing with len(res.Model) == 0 is ok.

Patch release for gateway plugins

d69b114

Jeffwan changed the title ~~Patch release for gateway plugins~~ Patch release for older vllm engine lora support in gateway plugins Jan 24, 2025

Jeffwan approved these changes Jan 24, 2025

View reviewed changes

Jeffwan merged commit c85514d into main Jan 24, 2025
10 checks passed

Jeffwan deleted the patch-rel branch January 24, 2025 06:16

Jeffwan mentioned this pull request Jan 24, 2025

Lora request failed due to recent added validation in gateway #601

Closed

zhangjyr reviewed Jan 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Patch release for older vllm engine lora support in gateway plugins #599

Patch release for older vllm engine lora support in gateway plugins #599

varungup90 commented Jan 24, 2025

zhangjyr Jan 24, 2025

Patch release for older vllm engine lora support in gateway plugins #599

Patch release for older vllm engine lora support in gateway plugins #599

Conversation

varungup90 commented Jan 24, 2025

zhangjyr Jan 24, 2025

Choose a reason for hiding this comment