root/navi-1

Fork: 0

root / navi-1

History for navi-1 / navi / api

2026-05-04	03e5f11 Browse files » Revert "Force login prompt in gnexus-auth OAuth flow" ... This reverts commit `f232f21`. Eugene Sukhodolskiy committed on 4 May
	f232f21 Browse files » Force login prompt in gnexus-auth OAuth flow ... Add prompt=login to authorization URL so gnexus-auth always shows the login form instead of silently re-authenticating via existing session. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	605f817 Browse files » Fix missing Annotated/Depends imports in auth.py and UI button class ... - auth.py: add from typing import Annotated and from fastapi import Depends to fix 422 Unprocessable Content on /auth/me and /auth/logout - AppSidebar.vue: replace btn-ghost with btn-primary for login/logout buttons Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	c1fd543 Browse files » Fix pydantic-settings env var name mapping for auth ... Pydantic-settings converts snake_case field names to UPPER_CASE env vars by removing underscores. gnexus_auth_client_id became GNEXUS_AUTH_CLIENT_ID but .env used GNAUTH_CLIENT_ID. Rename all Settings fields from gnexus_auth_* to gnauth_* so they map correctly to GNAUTH_* env vars. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	d65490e Browse files » Add graceful auth-not-configured guards ... - auth_login/auth_callback return 503 when GNAUTH_CLIENT_ID/SECRET are empty - webhooks return 503 when OAuth not configured - _resolve_user returns None early if auth not configured, avoiding crash during anonymous requests when gnexus-auth is not set up Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
	dc14fe0 Browse files » Support dynamic redirect_uri for multi-domain OAuth ... - get_gauth_client(redirect_uri=...) creates per-request client with dynamic redirect_uri while keeping shared state/PKCE stores - auth_login/auth_callback derive redirect_uri from request.base_url so both localhost and server addresses work with one gnexus-auth client Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
2026-05-03	3014ba6 Browse files » Multi-user auth via gnexus-auth OAuth + hybrid role/permission model ... - Integrate gnexus-auth-client-py (GAuthClient) for OAuth flow, token refresh, and webhook parsing - Add navi/auth/ package: User model, Fernet encryptor, client singleton, deps (get_current_user, require_admin, require_permission) - New tables: navi_users, user_auth_sessions (auto-created on startup) - Session/memory isolation by user_id with legacy NULL support - Cookie-based auth proxy: /auth/login, /callback, /logout, /me - Webhook receiver /webhooks/gnexus-auth handling user events, global logout, session revocation, role/permission changes - Admin endpoints (/admin/*) gated by role + permissions - Webclient auth store with isAdmin/hasPermission guards - Admin-only profile filtering in /agents/profiles - 200/200 tests passing Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 3 May
2026-05-02	31c8534 Browse files » Paginate session list loading Eugene Sukhodolskiy committed on 2 May
2026-05-01	e8b54fd Browse files » Disable thinking stalls for 3D subagents Eugene Sukhodolskiy committed on 1 May
2026-05-01	970f21e Browse files » Improve 3D modeling validation prompts Eugene Sukhodolskiy committed on 1 May
2026-04-30	dce281e Browse files » Improve content publishing UX Eugene Sukhodolskiy committed on 30 Apr
2026-04-29	8f68841 Browse files » Architecture extensibility — event bus, middleware, auto-discovery, Pydantic profiles ... - EventBus: async pub/sub for AgentEvents, WebSocket subscribes instead of direct yield - Declarative serialization: AgentEvent.to_wire() on all event types - Auto-discovery for LLM backends (_discover_backends) and workers (scan navi/workers/*.py) - AgentProfile: Pydantic BaseModel with extra='allow', @field_validator for model coercion - Tool middleware chain: pre/post execute hooks via ToolRegistry.add_middleware() - LoggingMiddleware: built-in, logs every tool call - Fix pg_trgm DDL: conditional GIN indexes via DO $$ block, no CREATE EXTENSION - New files: event_bus.py, middleware.py, logging_middleware.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	7b672c3 Browse files » Remove SQLite legacy support ... SQLite is no longer supported; PostgreSQL is now required. - Delete navi/core/sqlite_session_store.py - Delete navi/memory/sqlite_store.py - Remove SqliteSessionStore from navi/core/__init__.py exports - deps.py: drop SQLite fallback, raise RuntimeError if DATABASE_URL missing - config.py: remove db_path setting - pyproject.toml & requirements.txt: drop aiosqlite dependency - .gitignore: remove navi.db entry - tech_debt_review_2026-04-29.md: mark #8 as REMOVED Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	098401a Browse files » Stability fixes batch — tech debt review 2026-04-29 ... Critical: - Concurrent WS run race guard (#1) - Tool task cancellation on generator teardown (#2) - StopAsyncIteration kills fallback chain (#3) - Session loading race with _lastLoadId guard (#4) - ContentCard .match() crash on non-string result (#5) - Image data type guard in buildMessageList (#6) High: - Cap WS replay buffer at 500 events (#7) - Deduplicate memory extraction task with asyncio.Lock (#9) - TTL-based fallback blacklisting (5 min) (#10) - Subagent tool exception isolation (#11) - Inline image size/count validation on WS (#12) - Clean up orphaned file on DB insert failure (#13) - Deep watch streamingMsg for auto-scroll (#14) - WS_SCHEME wss:// support for HTTPS (#15) - Sending guard against duplicate message sends (#16) - Global unhandledrejection listener in API layer (#17) Medium: - Cap planning_logs at 20 entries (#22) - Store cleanup_loop task reference (#23) - BaseException → Exception in _run_with_sentinel (#24) - Propagate SystemExit in agent loop (#25) - Configurable output_reserve_tokens (#26) - Always reloadSession on session_sync (#30) - FIFO queue for confirm dialogs (#31) - Reset body.overflow on ImageLightbox unmount (#32) - try/finally in fallback copy (#33) - _isConnecting guard in WS send() (#34) Low: - Lazy-init deps.py singletons (#36) - Replace __import__ with direct imports (#38) - Preserve token count 0 in ollama.py (#39) - Clear orphaned streamingMsg on reconnect reload (#43) - Escape single quote in UserMessage (#44) - Polyfill-free findLast replacement (#48) - Match <table> tags with attributes in markdown (#49) - Attach copy buttons only when msg.done (#50) - Fix hasMeta falsy-0 bug (#53) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
2026-04-28	cabfce8 Browse files » Fix system prompt leakage into chat history; polish content cards ... Backend: - websocket.py + agent.py: separate user-visible display_message from LLM user_message. System hints (image/file attachments) no longer leak into session.messages and appear after page reload. - main.py: add ensure_tables() on startup so session_content table is created before first publish. - profiles: add kimi-k2.6:cloud to all model lists as fallback. Frontend: - ContentCard.vue: remove border-radius, add scrollbar styles, fix metadata fallback parsing so cards survive page reload. - content-viewers/*.html: add matching scrollbar styles. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
	b88b7c0 Browse files » Add content hosting system with inline viewers ... Backend: - Add navi/content/ directory for published files - Add content_store.py with publish/list/delete/cleanup functions - Add content_publish tool for publishing files as viewable content - Add /content static file mount in main.py - Add /content-viewers mount for viewer pages - Extend ToolEvent with metadata field - Forward metadata through websocket tool_call events - Update Agent to include metadata in ToolEvent Frontend: - Add ContentCard.vue component for displaying published content - Add viewer pages: stl.html (Three.js), svg.html, html.html, pdf.html - Update AssistantMessage.vue to render ContentCard for content_publish - Update chat store to preserve metadata in tool cards - Update websocket protocol docs with metadata field Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
	cbb1e5d Browse files » Add dedicated CPU embedding server for memory backfill ... - Install Ollama CPU-only on 192.168.1.168 server - Pull nomic-embed-text:latest on server - Create systemd service ollama-embed.service (0.0.0.0:11434) - Add embedding_ollama_host / embedding_ollama_api_key to config.py - Update deps.py to build separate embedding backend when host configured - Update backfill_embeddings.py to use dedicated embedding backend - Add _generate_embeddings batch helper and backfill_embeddings to store.py - Backfilled 119 existing facts with embeddings Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
	c874cbe Browse files » Wire pgvector semantic search into memory system ... - Add vector(768) column + HNSW index to memory_facts - Add LLMBackend.embed() with Ollama + fallback implementation - MemoryStore: cosine-distance search with ILIKE fallback - New memory tool params: source, confidence, expires_days, source_context - Update extractor, sqlite_store, deps wiring - Add pgvector to requirements Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
	d9e9f4d Browse files » Stop image_view hallucinations on inline-attached images ... The model was inventing fake paths/URLs (e.g. files.oaiusercontent.com, /home/ubuntu/navi-1/input_file_0.png) and calling image_view on them when the user attached an image directly in chat — the image was already in the multimodal context, but the tool description and lack of a signal pushed the model to "load" it anyway. - websocket.py: when a user message has inline images, append a brief note that they are already in context. - image_view.py: soften the description — keep proactive use for paths and URLs the model genuinely cannot see, but tell it inline images don't need this tool. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-25	d025bfc Browse files » Fix websocket.py: unpack 4-tuple from get_registries(), pass cp_registry to Agent ... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 25 Apr
2026-04-25	65ffb4d Browse files » Add context providers: dynamic system message injection per LLM call ... - navi/context_providers/ registry + built-in public_url provider (global, always injected) - context_providers/ user directory, hot-reloaded via reload_tools - AgentProfile.context_providers field for per-profile opt-in providers - Agent._collect_context_injections() called before every tool-calling loop - reload_tools now reloads both user tools and user context providers - manuals/write_context_provider.md for Navi, docs/context_providers.md reference Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 25 Apr
2026-04-24	470d0be Browse files » Make server I/O non-blocking; update docs ... - Wrap all heavy filesystem ops in asyncio.to_thread: filesystem tool (read/write/append/list/find/info/move/delete/query/smart_edit), image_view (read_bytes), share_file (shutil.copy2), write_tool (write_text, _register_user_tool), session_files (shutil.rmtree, iterdir), sessions upload endpoint (sync open/write → to_thread) - Make delete_session_dir async; update its caller in sessions.py - docs/config.md: fix wrong defaults (threshold 0.70, keep_recent 8), remove phantom SESSION_FILES_TTL_HOURS, add LLM timeouts, DATABASE_URL, PUBLIC_URL, Gmail, CONTEXT_SUMMARY_MAX_TOKENS sections - docs/profiles.md: add missing tool_developer profile to table - android-client: add WebView remote debugging; remove unused toolbar menu - Remove stale helper scripts and test files Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 Apr
2026-04-21	b43428a Browse files » WebSocket event replay buffer for disconnect resilience ... On reconnect to an active agent run the server now replays all events emitted since the turn started, then switches to live forwarding. This eliminates the gap where tool cards, thinking blocks and stream deltas were permanently lost after a network blip. Server (_AgentRun): - events: list[dict] buffers every serialised agent event - broadcast() serialises and appends before putting in subscriber queues - reconnect flow: subscribe → replay_count snapshot → stream_start → replay events[0:replay_count] → live _stream_to_client Client: - onStreamStart() removes the frozen ghost message instead of marking done=true, so replay cleanly rebuilds the message from scratch - replayMode flag suppresses animations during replay - onReplayStart/onReplayEnd handlers set/clear the flag and restore animate on the message once live events resume Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 Apr
2026-04-20	94e32e9 Browse files » Planning debug panel, todo auto-populate, scratchpad/persona improvements ... - Planning debug panel: new Planning tab in debug/index.html shows raw phase 1/2 outputs and token counts per planning run, stored in session.planning_logs (new column in both SQLite and PostgreSQL) - New GET /sessions/{id}/planning API endpoint - PlanningDebugData internal event wires _run_planning() output into session storage; never forwarded to WebSocket clients - Phase 3 (plan critic) disabled — to be reworked with reflect integration - Todo tool: auto-populated from plan steps after phase 2; model only needs to call update/view, not set - Scratchpad: clarified description and persona instructions; removed context_transfer from user-facing docs (internal mechanism only) - web_search: switched to ddgs package, SearXNG as primary backend, DDG html-only fallback; added find_up action to filesystem tool - Persona: added SCRATCHPAD and TODO sections with clear usage rules; added NAVI.md project context instructions - chat.js: fixed subagent planning event fallthrough into parent UI; statusLabel cleared on first stream delta Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 20 Apr
2026-04-17	59f01b3 Browse files » Route subagent planning events into spawn_agent card in the UI ... Previously PlanningStatus/PlanReady had no is_subagent flag, so subagent planning spinners and plan cards rendered as top-level Navi planning UI. Backend: - Add is_subagent field to PlanningStatus and PlanReady events - _run_planning accepts is_subagent param, passes it through all yields - run_ephemeral calls _run_planning with is_subagent=True - websocket.py forwards is_subagent in planning_status and plan_ready messages Frontend (chat.js): - onPlanningStatus: if is_subagent, set planningLabel on the last spawn_agent card instead of msg.statusLabel - onPlanReady: if is_subagent, push plan into spawn card steps and clear planningLabel; otherwise behave as before Frontend (ToolCard.vue): - Render subagent-planning-indicator (spinner + label) when planningLabel set - Render plan cards inside subagent steps using the same plan-card pattern Also includes leftover session changes: spawn_agent default 40 in description and manual, updated manual content. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 17 Apr
	0c3dc98 Browse files » Planning phases, context compression, and tool improvements ... Agent: - Planning now a 3-phase async generator: Analysis → Execution plan → AIHelper critic - Yield PlanningStatus events before each phase (UI progress labels) - Phase 1 runs with think=True for deeper analysis - Phase 2 includes available tool list so executor assignments are accurate - Phase 3: independent critic pass validates and corrects TOOL: names against real tool list - Planning converted from list return to async generator (fixes token accounting) Backend: - Context compression threshold: 80% → 70% to trigger earlier - Compressor summary prompt: structured sections (goal, work state, key facts, outputs, errors) - Terminal output capped at 5000 chars to prevent context flooding - Web search: region=wt-wt for DDG, country=ALL for Brave, language=all for SearxNG - Scratchpad: mandate writing a 'goal' section at start of multi-step tasks - secretary max_iterations: 40→25, temperature: 0.7→0.5 - server_admin max_iterations: 40→20 Webclient: - ThinkingCard strips <thought> XML tags leaked by Ollama - planning_status WS event wired to chat.onPlanningStatus() Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 17 Apr
	d8ce61a Browse files » Add Prompts and Tools tabs to debug page ... Backend: - GET /agents/prompts — returns full built system prompt for every profile, broken into sections (persona / profile / profiles block) with char/token counts; mirrors Agent._build_system_prompt() exactly - GET /agents/tools — now includes parameters schema alongside name and description Debug page: - Tab bar: Context / Prompts / Tools - Prompts tab: profile sidebar + collapsible sections per prompt part (persona, profile prompt, profiles block), togglable tools list - Tools tab: searchable list of all tools with description and parameter table (name, type, description, required marker) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 17 Apr
2026-04-16	f83886a Browse files » Fix WS disconnect and missed stream on reconnect ... Two related problems: - During long AIHelper calls (non-streaming LLM), no data flows to the WebSocket and browsers drop the connection after ~30-60s of inactivity. Fixed with a 20s heartbeat: _stream_to_client now uses asyncio.wait_for and sends {"type":"heartbeat"} on timeout to keep the connection alive. - After reconnect, if the agent finished while the client was offline, _runs no longer holds the session and no stream_start is sent. Client would reconnect silently with no response shown. Fixed by sending {"type":"session_sync"} on every new WS connection (after reattach completes or immediately when no run is active) so the client knows to reload session history. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 Apr
	f74de4f Browse files » Persist context token count: return from API, restore on session load ... - GET /sessions/{id} now returns context_token_count and max_context_tokens (max pulled from settings.ollama_num_ctx) - loadSession() in chat store sets contextTokens/maxContextTokens from the response so ContextBar shows the last known fill level immediately on load, not only after the first new message - Restore v-if guard on ContextBar (hides for brand-new sessions with 0 tokens) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 Apr
	a338f8b Browse files » Add response metrics: elapsed time, tool calls, token count ... Server: - Message model: elapsed_seconds, tool_call_count, token_count fields (display-only, excluded from LLM context via exclude_none) - StreamEnd event: carries same three fields - agent.run_stream: tracks turn start time, counts ToolEvent completions, writes metrics onto the final assistant Message before saving to DB - WebSocket: forwards metrics in stream_end payload Client: - chat.onStreamEnd: attaches elapsed_seconds, tool_call_count, token_count to the streaming message on completion - buildMessageList: scans each assistant group for metrics from history - AssistantMessage: renders .msg-meta-row below the response — timer icon + Xs · wrench icon + N tools · coins icon + Nk tokens · time (each item only shown if present; time pushed right via margin-left: auto) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 Apr