root/navi-1

Fork: 0

root / navi-1

History for navi-1 / navi / llm / fallback.py

2026-05-12	ac44c84 Browse files » Remove dead LLMBackend.stream() method ... The method was defined on all backends (Ollama, FallbackOllama, OpenAI) and in the base LLMBackend interface, but was never called by agent.py or messages.py. stream_complete() covers all streaming use cases. - navi/llm/base.py: remove abstract stream() method - navi/llm/ollama.py: remove OllamaBackend.stream() - navi/llm/fallback.py: remove FallbackOllamaBackend.stream() - navi/llm/openai_backend.py: remove OpenAIBackend.stream() - docs/tech_debt_review: mark item 54 as fixed 236 tests passing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 12 May
2026-05-11	cebc073 Browse files » Fix ollama_backends / FallbackOllamaBackend issues ... - registry.py: always use FallbackOllamaBackend (unified backend). Enables model priority lists in all deployments, not just multi-server. - agent.py: add missing think=profile.think_enabled to run() (REST endpoint). - compressor.py: fix model param type (str → list[str] \| str \| None). - fallback.py: harden load_servers_from_file against missing/bad JSON files and entries without host. Add clear_blacklists() for manual reset. - admin.py: add POST /admin/ollama/clear-blacklists endpoint. - tech_debt_review: document dead stream() methods. - tests: add tests for single-server fallback, bad file handling, missing host skipping, and blacklist clearing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 May
2026-04-29	30dd183 Browse files » Align Ollama HTTP timeout with LLM timeouts Eugene Sukhodolskiy committed on 29 Apr
2026-04-29	098401a Browse files » Stability fixes batch — tech debt review 2026-04-29 ... Critical: - Concurrent WS run race guard (#1) - Tool task cancellation on generator teardown (#2) - StopAsyncIteration kills fallback chain (#3) - Session loading race with _lastLoadId guard (#4) - ContentCard .match() crash on non-string result (#5) - Image data type guard in buildMessageList (#6) High: - Cap WS replay buffer at 500 events (#7) - Deduplicate memory extraction task with asyncio.Lock (#9) - TTL-based fallback blacklisting (5 min) (#10) - Subagent tool exception isolation (#11) - Inline image size/count validation on WS (#12) - Clean up orphaned file on DB insert failure (#13) - Deep watch streamingMsg for auto-scroll (#14) - WS_SCHEME wss:// support for HTTPS (#15) - Sending guard against duplicate message sends (#16) - Global unhandledrejection listener in API layer (#17) Medium: - Cap planning_logs at 20 entries (#22) - Store cleanup_loop task reference (#23) - BaseException → Exception in _run_with_sentinel (#24) - Propagate SystemExit in agent loop (#25) - Configurable output_reserve_tokens (#26) - Always reloadSession on session_sync (#30) - FIFO queue for confirm dialogs (#31) - Reset body.overflow on ImageLightbox unmount (#32) - try/finally in fallback copy (#33) - _isConnecting guard in WS send() (#34) Low: - Lazy-init deps.py singletons (#36) - Replace __import__ with direct imports (#38) - Preserve token count 0 in ollama.py (#39) - Clear orphaned streamingMsg on reconnect reload (#43) - Escape single quote in UserMessage (#44) - Polyfill-free findLast replacement (#48) - Match <table> tags with attributes in markdown (#49) - Attach copy buttons only when msg.done (#50) - Fix hasMeta falsy-0 bug (#53) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
2026-04-28	c874cbe Browse files » Wire pgvector semantic search into memory system ... - Add vector(768) column + HNSW index to memory_facts - Add LLMBackend.embed() with Ollama + fallback implementation - MemoryStore: cosine-distance search with ILIKE fallback - New memory tool params: source, confidence, expires_days, source_context - Update extractor, sqlite_store, deps wiring - Add pgvector to requirements Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-26	b5b11be Browse files » changed llm & new ollama param Eugene Sukhodolskiy committed on 26 Apr
2026-04-25	52b4069 Browse files » Tune profile sampling configs Eugene Sukhodolskiy committed on 25 Apr
2026-04-24	511dc46 Browse files » Add Ollama multi-server fallback with in-memory blacklisting ... - New FallbackOllamaBackend (navi/llm/fallback.py): tries servers and models in priority order; on LLMConnectionError blacklists the server for the process lifetime, on LLMModelNotFoundError blacklists the (server, model) pair — eliminates latency from repeated failed probes - OllamaBackend now raises typed LLMConnectionError / LLMModelNotFoundError instead of bare LLMBackendError; accepts list[str] \| str \| None for model - AgentProfile.model changed from str to list[str] (str auto-normalised); all profiles updated to ["gemma4:31b-cloud", "gemma4:26b-a4b-it-q4_K_M"] - New config field OLLAMA_BACKENDS_FILE: path to [{host, api_key?}] JSON; when set, registry creates FallbackOllamaBackend instead of OllamaBackend - ollama_backends.json template added (gitignored — contains API key) - current_model ContextVar type widened to list[str] \| str \| None Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 Apr