Add Ollama multi-server fallback with in-memory blacklisting

Fork: 0

root / navi-1

Browse code Add Ollama multi-server fallback with in-memory blacklisting - New FallbackOllamaBackend (navi/llm/fallback.py): tries servers and models in priority order; on LLMConnectionError blacklists the server for the process lifetime, on LLMModelNotFoundError blacklists the (server, model) pair — eliminates latency from repeated failed probes - OllamaBackend now raises typed LLMConnectionError / LLMModelNotFoundError instead of bare LLMBackendError; accepts list[str] \| str \| None for model - AgentProfile.model changed from str to list[str] (str auto-normalised); all profiles updated to ["gemma4:31b-cloud", "gemma4:26b-a4b-it-q4_K_M"] - New config field OLLAMA_BACKENDS_FILE: path to [{host, api_key?}] JSON; when set, registry creates FallbackOllamaBackend instead of OllamaBackend - ollama_backends.json template added (gitignored — contains API key) - current_model ContextVar type widened to list[str] \| str \| None Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> feature/navi-code master vmkdemo
1 parent 470d0be commit 511dc463e302f1e5ee169f941ea5ac9019235a99 Eugene Sukhodolskiy authored on 24 Apr

Browse code

- New FallbackOllamaBackend (navi/llm/fallback.py): tries servers and
  models in priority order; on LLMConnectionError blacklists the server
  for the process lifetime, on LLMModelNotFoundError blacklists the
  (server, model) pair — eliminates latency from repeated failed probes
- OllamaBackend now raises typed LLMConnectionError / LLMModelNotFoundError
  instead of bare LLMBackendError; accepts list[str] | str | None for model
- AgentProfile.model changed from str to list[str] (str auto-normalised);
  all profiles updated to ["gemma4:31b-cloud", "gemma4:26b-a4b-it-q4_K_M"]
- New config field OLLAMA_BACKENDS_FILE: path to [{host, api_key?}] JSON;
  when set, registry creates FallbackOllamaBackend instead of OllamaBackend
- ollama_backends.json template added (gitignored — contains API key)
- current_model ContextVar type widened to list[str] | str | None

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

feature/navi-code master vmkdemo

1 parent 470d0be commit 511dc463e302f1e5ee169f941ea5ac9019235a99

Eugene Sukhodolskiy authored on 24 Apr

Patch

Unified Split

Showing 14 changed files

Ignore Space Show notes View .env.example

Ignore Space Show notes View .gitignore

Ignore Space Show notes View navi/config.py

Ignore Space Show notes View navi/core/registry.py

Ignore Space Show notes View navi/exceptions.py

Ignore Space Show notes View navi/llm/fallback.py 0 → 100644

Ignore Space Show notes View navi/llm/ollama.py

Ignore Space Show notes View navi/profiles/base.py

Ignore Space Show notes View navi/profiles/developer/config.json

Ignore Space Show notes View navi/profiles/loader.py

Ignore Space Show notes View navi/profiles/secretary/config.json

Ignore Space Show notes View navi/profiles/server_admin/config.json

Ignore Space Show notes View navi/profiles/tool_developer/config.json

Ignore Space Show notes View navi/tools/base.py

Show line notes below