For Ollama add a configuration parameter for context size #1253

rubixhacker · 2025-02-16T13:35:28Z

Ollama defaults to a context size of 2048 token and Goose is often exceeding that window, when this happens Ollama truncates the input. This leads to suboptimal results from the LLM.

Output from Ollama when this occurs:
time=2025-02-16T13:27:35.103Z level=WARN source=runner.go:129 msg="truncating input prompt" limit=2048 prompt=3520 keep=4 new=2048

When sending the request to Ollama the context size can be specified by adding the following to the payload
"options": { "num_ctx": 4096 }

The text was updated successfully, but these errors were encountered:

addhyh · 2025-02-20T08:28:16Z

agree.And I hope to see the CTX usage of LLM like gemini studio.

tiensi · 2025-02-22T00:16:03Z

Did a bit of a dive into this. Unfortunately Goose is hitting Ollama with the OpenAi API v1/chat/completions which does not expose any direct way of modifying the model's context size.

The recommendation (seen in the above link) is to create your own ollama model with a context size and point to it with the same api. This seems convoluted and not the right solution for this problem. A short term solution assuming you're running an ollama service locally would be to update your existing instance with the desired context window

There was work planned to expose num_ctx as an openai supported endpoint but it was discarded. Instead what seems to be in progress is setting the context length via an environment variable .

If the above work goes through we can maybe add a helper in goose to set the environment variable, but I'll defer to the main Goose team to decide whether that's appropriate.

CrazyBoyM · 2025-02-25T00:20:13Z

ollama is just a toy for chinldren developers, 2k context size can not do anything. and people do not want change there openai sdk to ollama's/

yingjiehe-xyz added the enhancement New feature or request label Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For Ollama add a configuration parameter for context size #1253

For Ollama add a configuration parameter for context size #1253

rubixhacker commented Feb 16, 2025

addhyh commented Feb 20, 2025

tiensi commented Feb 22, 2025 •

edited

Loading

CrazyBoyM commented Feb 25, 2025

For Ollama add a configuration parameter for context size #1253

For Ollama add a configuration parameter for context size #1253

Comments

rubixhacker commented Feb 16, 2025

addhyh commented Feb 20, 2025

tiensi commented Feb 22, 2025 • edited Loading

CrazyBoyM commented Feb 25, 2025

tiensi commented Feb 22, 2025 •

edited

Loading