root/navi-1

Fork: 0

root / navi-1

History for navi-1 / navi / config.py

2026-06-16	29d39ac Browse files » Add internal navi_ui MCP server for structured UI components ... - Add navi/mcp/ui_server.py: FastMCP streamable_http server on port 8001 exposing render_component(component_name, payload, session_id). - Start server in main lifespan before container creation so McpManager can connect; wire orchestrator once container is ready; clean up on shutdown. - Add env settings NAVI_UI_MCP_ENABLED/HOST/PORT. - Add mcp_servers.d/navi_ui.json config with the 'ui' tool group. - Frontend: dispatch ui_component websocket event, store in chat.js, render placeholder UiComponentCard inside AssistantMessage.vue. - Unit tests for ui_server tool and chat.onUiComponent. Co-Authored-By: Claude <noreply@anthropic.com> Eugene Sukhodolskiy committed 9 days ago
2026-05-25	7dcec4c Browse files » Add archive message pagination, configurable WS replay buffer ... Backend: - Add archive_threshold to Session model and getSession response - Add next_before_seq to archive endpoint for cursor pagination - Make WS replay buffer size configurable via WS_REPLAY_BUFFER_SIZE Webclient: - Add getArchivedMessages API function - Add archive pagination state and loadArchivedMessages to chat store - MessageList: auto-load older messages on scroll-to-top with scroll position preservation and loading spinner Docs: update config.md with new env vars Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 25 May
2026-05-25	6cea761 Browse files » Wire archive trigger into agent after compression ... After _do_compress_and_save finishes, if the total persisted message count (db_next_sequence) exceeds session_messages_window (default 1000), the agent now calls archive_old_messages() to move older rows into session_messages_archive. Adds session_messages_window config and unit tests for archive SQL. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 25 May
2026-05-24	4f78099 Browse files » Raise first-chunk timeout to 90s and retry same server+model before fallback ... - config.py: llm_stream_first_chunk_timeout 180s → 90s - fallback.py stream_complete: wrap gen.__anext__() in asyncio.wait_for() with llm_stream_first_chunk_timeout; on TimeoutError or LLMConnectionError sleep 2s and retry once on the same server+model before blacklisting/fallback Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 May
2026-05-18	a97c203 Browse files » Make Settings immutable (frozen=True) and fix all test mutations ... - Add frozen=True to SettingsConfigDict in navi/config.py - Convert model_validator to mode="before" since mode="after" cannot mutate frozen instances - Replace all field-level monkeypatches in tests with whole-Settings object replacement - Ensure cross-module settings consistency (content_store, session_files, share_file, content_publish, filesystem) 392 passed, 1 skipped Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 18 May
2026-05-08	db0261b Browse files » Add multi-user sandbox: filesystem, terminal, code_exec, security policy ... - filesystem, share_file: sandbox non-admin users to user_data/<user_id>/ - terminal: working_dir sandbox + allowlist + dangerous pattern block for users - code_exec: sandbox CWD and temp files to user_data/<user_id>/ for users - context_builder: inject dynamic security policy into LLM context (user/admin) - config: terminal_user_allowed_commands setting - agent: wire user_id/user_role through ContextBuilder.build() - base: add current_user_role ContextVar; run_ephemeral inherits role Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 May
2026-05-04	08f4015 Browse files » Fix default gnauth profile path to /account/profile ... Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	f1ccb57 Browse files » Revert "Fix avatar: use Gravatar instead of non-existent profile fields" ... This reverts commit `f485e54`. Eugene Sukhodolskiy committed on 4 May
	f485e54 Browse files » Fix avatar: use Gravatar instead of non-existent profile fields ... Investigated gnexus-auth UserinfoController and found that the profile response only contains: username, display_name, first_name, last_name, phone, birth_date, country, city, locale, timezone. There is no picture or avatar_url field. - Add make_gravatar_url() helper in navi/auth/__init__.py - Update deps.py to generate Gravatar URL from user email - Update config.py default gnauth_profile_path to /account/profile - Update .env.example comment accordingly - Frontend already handles avatar_url correctly Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	8e2ae02 Browse files » Add avatar display and gnexus-auth profile link ... Backend: - User model: add avatar_url field - auth/deps.py: extract avatar_url from auth_user.profile (picture/avatar_url) - auth.py /auth/me: return avatar_url + computed profile_url - config.py: add gnauth_profile_path setting - .env.example: document GNAUTH_PROFILE_PATH Frontend: - AppSidebar.vue: show user avatar (or initial fallback) next to name - Clicking user info opens gnexus-auth profile in new tab - Rebuild dist/ Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	c1fd543 Browse files » Fix pydantic-settings env var name mapping for auth ... Pydantic-settings converts snake_case field names to UPPER_CASE env vars by removing underscores. gnexus_auth_client_id became GNEXUS_AUTH_CLIENT_ID but .env used GNAUTH_CLIENT_ID. Rename all Settings fields from gnexus_auth_* to gnauth_* so they map correctly to GNAUTH_* env vars. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
2026-05-03	3014ba6 Browse files » Multi-user auth via gnexus-auth OAuth + hybrid role/permission model ... - Integrate gnexus-auth-client-py (GAuthClient) for OAuth flow, token refresh, and webhook parsing - Add navi/auth/ package: User model, Fernet encryptor, client singleton, deps (get_current_user, require_admin, require_permission) - New tables: navi_users, user_auth_sessions (auto-created on startup) - Session/memory isolation by user_id with legacy NULL support - Cookie-based auth proxy: /auth/login, /callback, /logout, /me - Webhook receiver /webhooks/gnexus-auth handling user events, global logout, session revocation, role/permission changes - Admin endpoints (/admin/*) gated by role + permissions - Webclient auth store with isAdmin/hasPermission guards - Admin-only profile filtering in /agents/profiles - 200/200 tests passing Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 3 May
2026-04-29	a139c32 Browse files » Clarify share file publishing boundaries Eugene Sukhodolskiy committed on 29 Apr
	8f68841 Browse files » Architecture extensibility — event bus, middleware, auto-discovery, Pydantic profiles ... - EventBus: async pub/sub for AgentEvents, WebSocket subscribes instead of direct yield - Declarative serialization: AgentEvent.to_wire() on all event types - Auto-discovery for LLM backends (_discover_backends) and workers (scan navi/workers/*.py) - AgentProfile: Pydantic BaseModel with extra='allow', @field_validator for model coercion - Tool middleware chain: pre/post execute hooks via ToolRegistry.add_middleware() - LoggingMiddleware: built-in, logs every tool call - Fix pg_trgm DDL: conditional GIN indexes via DO $$ block, no CREATE EXTENSION - New files: event_bus.py, middleware.py, logging_middleware.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	7b672c3 Browse files » Remove SQLite legacy support ... SQLite is no longer supported; PostgreSQL is now required. - Delete navi/core/sqlite_session_store.py - Delete navi/memory/sqlite_store.py - Remove SqliteSessionStore from navi/core/__init__.py exports - deps.py: drop SQLite fallback, raise RuntimeError if DATABASE_URL missing - config.py: remove db_path setting - pyproject.toml & requirements.txt: drop aiosqlite dependency - .gitignore: remove navi.db entry - tech_debt_review_2026-04-29.md: mark #8 as REMOVED Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	098401a Browse files » Stability fixes batch — tech debt review 2026-04-29 ... Critical: - Concurrent WS run race guard (#1) - Tool task cancellation on generator teardown (#2) - StopAsyncIteration kills fallback chain (#3) - Session loading race with _lastLoadId guard (#4) - ContentCard .match() crash on non-string result (#5) - Image data type guard in buildMessageList (#6) High: - Cap WS replay buffer at 500 events (#7) - Deduplicate memory extraction task with asyncio.Lock (#9) - TTL-based fallback blacklisting (5 min) (#10) - Subagent tool exception isolation (#11) - Inline image size/count validation on WS (#12) - Clean up orphaned file on DB insert failure (#13) - Deep watch streamingMsg for auto-scroll (#14) - WS_SCHEME wss:// support for HTTPS (#15) - Sending guard against duplicate message sends (#16) - Global unhandledrejection listener in API layer (#17) Medium: - Cap planning_logs at 20 entries (#22) - Store cleanup_loop task reference (#23) - BaseException → Exception in _run_with_sentinel (#24) - Propagate SystemExit in agent loop (#25) - Configurable output_reserve_tokens (#26) - Always reloadSession on session_sync (#30) - FIFO queue for confirm dialogs (#31) - Reset body.overflow on ImageLightbox unmount (#32) - try/finally in fallback copy (#33) - _isConnecting guard in WS send() (#34) Low: - Lazy-init deps.py singletons (#36) - Replace __import__ with direct imports (#38) - Preserve token count 0 in ollama.py (#39) - Clear orphaned streamingMsg on reconnect reload (#43) - Escape single quote in UserMessage (#44) - Polyfill-free findLast replacement (#48) - Match <table> tags with attributes in markdown (#49) - Attach copy buttons only when msg.done (#50) - Fix hasMeta falsy-0 bug (#53) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
2026-04-28	cbb1e5d Browse files » Add dedicated CPU embedding server for memory backfill ... - Install Ollama CPU-only on 192.168.1.168 server - Pull nomic-embed-text:latest on server - Create systemd service ollama-embed.service (0.0.0.0:11434) - Add embedding_ollama_host / embedding_ollama_api_key to config.py - Update deps.py to build separate embedding backend when host configured - Update backfill_embeddings.py to use dedicated embedding backend - Add _generate_embeddings batch helper and backfill_embeddings to store.py - Backfilled 119 existing facts with embeddings Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-28	c874cbe Browse files » Wire pgvector semantic search into memory system ... - Add vector(768) column + HNSW index to memory_facts - Add LLMBackend.embed() with Ollama + fallback implementation - MemoryStore: cosine-distance search with ILIKE fallback - New memory tool params: source, confidence, expires_days, source_context - Update extractor, sqlite_store, deps wiring - Add pgvector to requirements Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-25	65ffb4d Browse files » Add context providers: dynamic system message injection per LLM call ... - navi/context_providers/ registry + built-in public_url provider (global, always injected) - context_providers/ user directory, hot-reloaded via reload_tools - AgentProfile.context_providers field for per-profile opt-in providers - Agent._collect_context_injections() called before every tool-calling loop - reload_tools now reloads both user tools and user context providers - manuals/write_context_provider.md for Navi, docs/context_providers.md reference Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 25 Apr
2026-04-24	511dc46 Browse files » Add Ollama multi-server fallback with in-memory blacklisting ... - New FallbackOllamaBackend (navi/llm/fallback.py): tries servers and models in priority order; on LLMConnectionError blacklists the server for the process lifetime, on LLMModelNotFoundError blacklists the (server, model) pair — eliminates latency from repeated failed probes - OllamaBackend now raises typed LLMConnectionError / LLMModelNotFoundError instead of bare LLMBackendError; accepts list[str] \| str \| None for model - AgentProfile.model changed from str to list[str] (str auto-normalised); all profiles updated to ["gemma4:31b-cloud", "gemma4:26b-a4b-it-q4_K_M"] - New config field OLLAMA_BACKENDS_FILE: path to [{host, api_key?}] JSON; when set, registry creates FallbackOllamaBackend instead of OllamaBackend - ollama_backends.json template added (gitignored — contains API key) - current_model ContextVar type widened to list[str] \| str \| None Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 Apr
2026-04-22	25ecc63 Browse files » Use gemma4 cloud model by default Eugene Sukhodolskiy committed on 22 Apr
2026-04-22	7eea278 Browse files » Support Ollama Cloud API key Eugene Sukhodolskiy committed on 22 Apr
2026-04-20	9704a92 Browse files » Autonomous reasoning improvements: budget, anchoring, anti-stall, validation ... - AgentProfile: per-profile thinking mechanics flags (think_enabled, iteration_budget_enabled, goal_anchoring, anti_stall, step_validation, planning_reflect, adaptive_replan) — all profiles updated in config.json - Iteration budget: inject remaining iterations into context so model knows when to wrap up; urgency levels at ≤7 and ≤3 remaining - Goal anchoring: inject original goal + todo state every N iterations to prevent drift on long tasks - Anti-stall: two signals — no todo progress for N iterations, or identical tool calls repeated N times; warning injected into context - Todo step validation: marking done requires a validation field describing how result was verified; failed gets a soft nudge with tip for re-planning - stream_complete: add think param to base class, ollama and openai backends - Summarizer: raise max_tokens 1024→3000, expand system prompt with user-preferences section and verbatim-value instructions - Compression card: persist to session.messages (is_compression flag on Message), show expandable summary in webclient with markdown body - ToolResult.to_message_content: always include output on failure so tracebacks and error details reach the model (fixes silent Error: None) - Developer profile: fix subagent profile secretary→developer, add write_tool to subagent_tools, clarify write_tool vs filesystem in system prompt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 20 Apr
2026-04-17	0c3dc98 Browse files » Planning phases, context compression, and tool improvements ... Agent: - Planning now a 3-phase async generator: Analysis → Execution plan → AIHelper critic - Yield PlanningStatus events before each phase (UI progress labels) - Phase 1 runs with think=True for deeper analysis - Phase 2 includes available tool list so executor assignments are accurate - Phase 3: independent critic pass validates and corrects TOOL: names against real tool list - Planning converted from list return to async generator (fixes token accounting) Backend: - Context compression threshold: 80% → 70% to trigger earlier - Compressor summary prompt: structured sections (goal, work state, key facts, outputs, errors) - Terminal output capped at 5000 chars to prevent context flooding - Web search: region=wt-wt for DDG, country=ALL for Brave, language=all for SearxNG - Scratchpad: mandate writing a 'goal' section at start of multi-step tasks - secretary max_iterations: 40→25, temperature: 0.7→0.5 - server_admin max_iterations: 40→20 Webclient: - ThinkingCard strips <thought> XML tags leaked by Ollama - planning_status WS event wired to chat.onPlanningStatus() Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 17 Apr
2026-04-16	02c7dd8 Browse files » Fix gmail auth: read credentials from settings, not os.environ ... pydantic-settings loads .env only into the Settings object — it does not populate os.environ. Added gmail_address and gmail_app_password fields to Settings; gmail tool now reads from settings instead of os.environ. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 Apr
2026-04-15	2d2bf84 Browse files » Migrate storage to PostgreSQL with SQLite fallback; misc fixes ... - Add PgSessionStore (asyncpg pool) and PgMemoryStore replacing aiosqlite - Keep SqliteSessionStore + SqliteMemoryStore for zero-dependency quick start - Selection logic in deps.py: DATABASE_URL set → PG, else → SQLite - Add asyncpg>=0.29 to dependencies; add DATABASE_URL / DB_PATH to config - Add RESPONSE HYGIENE rule to persona: never echo tool output or plan state - Add developer profile user tools: weather, internal_monitor - Update README: developer profile, DB section, current tool/profile state Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 15 Apr
	4b64763 Browse files » Add explicit output token budget for summarizer (context_summary_max_tokens) ... Previously there was no num_predict set for the summarization LLM call, so Ollama used its server default (often 128 tokens — very short summaries). - Add max_tokens param to LLMBackend.complete() and OllamaBackend (→ num_predict) - Add context_summary_max_tokens: int = 1024 to config - Thread it through compress_context() and CompressionWorker Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 15 Apr
	96548a1 Browse files » Expand summarization budget for better context quality ... - _MAX_SUMMARY_INPUT_CHARS: 12k → 24k chars (2x input fed to summarizer) - context_keep_recent: 10 → 8 turns (2 more turns go into each summary batch) - Summarizer prompt: replace "Be brief" with "Be thorough" — capture code/config snippets and enough detail to continue the conversation without original messages Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 15 Apr
2026-04-14	a8d0b37 Browse files » Add share_file tool and session-lifetime file storage ... Session file directories now live until the session is deleted, not 24h TTL. Cleanup loop only removes orphaned dirs (session gone from DB). New share_file tool: copies any file to the session directory and returns a clickable download URL. Navi can call this after generating any file the user will want to keep. New GET /sessions/{id}/files/{filename} endpoint serves files with correct Content-Disposition (inline for images/HTML/PDF, attachment for everything else). Added PUBLIC_URL config key for building correct download links behind reverse proxies. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr
2026-04-14	33b2880 Browse files » Improve filesystem, web search, context guard, and subagent narration ... filesystem: add find (glob), info (stat), move, append actions; read now supports offset/limit with hard 1MB guard; list shows sizes, dates, optional recursion. web_search: retry DDG across auto/html/lite backends; add optional Brave Search API and SearXNG fallbacks configured via .env. agent: fix ContextTooLargeError to surface as Navi response instead of raw system error; fix _check_context_size to calculate from remaining budget (window - output_reserve) rather than a fixed 92% threshold. persona: add ReAct narration instruction to subagent briefing template. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr