root/navi-1

Fork: 0

root / navi-1

2026-04-14	8c88f49 Browse files » Fix LLM hang: stop button during prefill, context guard, timeouts ... Root cause: during prefill (processing input tokens), Ollama emits no HTTP chunks. The `async for chunk in stream_complete()` loop body never executes, so stop_event is never checked — Stop button has no effect. Same issue with complete() calls (planning, compression): blocking await with no cancellation path. Fixes: _iter_stream_guarded() (agent.py, module-level): Wraps any stream_complete() generator. Polls stop_event every 1s while waiting for the next chunk using asyncio.wait() — so Stop works even during multi-minute prefill. On stop or timeout, calls aclose() on the generator which closes the HTTP connection to Ollama → generation halts → GPU drops to idle. Applied to both run_stream() and run_ephemeral(). _check_context_size() (Agent method): Estimates context tokens (chars/4 + 500 per image) before every LLM call. Raises ContextTooLargeError (new NaviError subclass) at 92% of ollama_num_ctx — before Ollama ever receives the request. _run_planning() timeouts: Both complete() calls (phase 1 and 2) wrapped with asyncio.wait_for(). Timeout logged and planning skipped gracefully — execution continues. New config (config.py): llm_complete_timeout = 120s llm_stream_first_chunk_timeout = 180s (prefill budget) llm_stream_chunk_timeout = 60s (inter-token budget) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr
	8b09439 Browse files » Update README: fix run command, WS events, structure, profiles ... - Add --reload-dir navi --port 8000 to run command - WebSocket events: full sequence including plan_ready, thinking_delta, turn_thinking, tool_started, context_compressed - Structure: add memory/, workers/, tools/ (user), manuals/, docs/ - Profiles: correct tool list (all profiles share same full toolset), add planning/temperature columns - Extension: document user tool format in tools/ without registry edit - Add config section with link to docs/config.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr
	594ad9e Browse files » Improve planning: two-phase pipeline and orchestrator discipline ... agent.py: - _run_planning() now runs two sequential LLM calls: Phase 1 (analysis): reformulate task, identify subtasks and unknowns; skip immediately if DIRECT. Phase 2 (execution plan): assign each subtask an executor — TOOL/AGENT/SELF — using a structured ## Plan format. Phase 2 context = analysis (embedded in system prompt) + last user message only; full history excluded to keep focus on plan structure. - Warn in logs when plan lacks TOOL/AGENT/SELF executor assignments. persona.txt: - MANDATORY sequence: step 0 = scratchpad init before anything else; todo tasks must mirror plan steps exactly (same order, same executors). - PLAN → EXECUTION BINDING: explicit rule — never switch an AGENT step to inline execution silently. - SCRATCHPAD: initialize sections at task start, not after first tool call; write context to scratchpad before briefing subagents. - Fix typo in BRIEFING ("sub-lagent" → "sub-agent"). - Replace stale Knowledge Retrieval Protocol with accurate one-liner. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr
	0cd74d0 Browse files » Merge branch 'master' of https://git.gnexus.space/git/root/navi-1 root committed on 14 Apr
	8ddf69d Browse files » . ubuntu committed on 14 Apr
	2108274 Browse files » self edited ubuntu committed on 14 Apr
	af76dab Browse files » Add interactive HTML architecture visualization ... Single self-contained HTML file covering all backend subsystems: component diagram, request lifecycle, agent loop, tool system, WebSocket protocol with event sequences, API reference, profiles, memory, and config. Dark theme, sticky sidebar nav, no external deps. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr
	b0a89e5 Browse files » Add backend documentation ... 10 markdown files covering all backend subsystems: architecture, agent loop, tool system, sessions, WebSocket protocol, profiles, memory, config, and API reference. Structured for AI readability with a single entry point (docs/index.md) and cross-references. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 14 Apr
2026-04-11	d843858 Browse files » Review and tighten all system prompts ... persona.txt: - PLANNING: threshold now 'plan has 2+ steps' instead of '2+ tool calls' - MEMORY: remove mandatory session-start memory_search (was conflicting with planning order); replace with contextual trigger rules - SCRATCHPAD: add 'if you've written anything to it' qualifier before read - DELEGATION: clarify sequential spawning is fine; tighten when-not-to-spawn secretary: trim redundant execution discipline, add 'test in code_exec before writing to disk' rule, profile-specific scratchpad sections server_admin: add explicit diagnostic workflow (gather → diagnose → act), profile-specific scratchpad sections, expanded safety and delegation guidance smart_home: add 'read-before-act' rule (check entity state before modifying), profile-specific scratchpad sections, tighten safety rules Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 Apr
	b292ddd Browse files » Strengthen Navi planning/delegation, unify toolsets, isolate subagent scratchpad ... persona.txt: - DELEGATION: 'default to spawning, not to doing inline' — stronger default, clearer triggers, explicit when-not-to-spawn rules - PLANNING: ties automatic planning phase to mandatory todo(op='set') as first tool call; reconciles pre-loop plan with in-loop execution discipline - SCRATCHPAD: new section — when to write, section naming conventions, mandatory read before final answer Profiles (secretary, server_admin, smart_home): - All three now share the same 18-tool set (each file independent) - planning_enabled=True on all three - scratchpad and web_search added to smart_home - System prompts updated with scratchpad/todo execution discipline sections agent.py run_ephemeral: - Each subagent gets a unique session ID (subagent_<uuid>) for scratchpad isolation — parallel or sequential subagents no longer share working notes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 Apr
	efb870f Browse files » Skip planning phase for simple/direct requests ... The planning prompt now asks the model to respond with "DIRECT" if the request doesn't need multiple steps. Added a regex fallback: if the response has no numbered steps it's also discarded. This prevents plan cards appearing for conversational replies that would just duplicate the final message. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 Apr
	fe6d7bc Browse files » Add planning phase and scratchpad tool for smarter task execution ... - ScratchpadTool: session-scoped working notepad with named sections (write/append/read/clear). Lets Navi capture intermediate findings between tool calls instead of losing track of them. - Planning phase: when profile.planning_enabled=True, a fast pre-loop LLM call (think=False, no tools) outlines a numbered plan before any actions are taken. The plan is injected into session context as an assistant message so the model naturally continues from it. - PlanReady event + plan_ready WebSocket message + plan card in UI (green-tinted, collapsible, mirroring thinking card design). - secretary and server_admin profiles: planning_enabled=True, scratchpad added to enabled_tools, system prompts updated with explicit execution discipline instructions. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 Apr
	65428f5 Browse files » Fix WebSocket state corruption preventing messages after first reply ... Replace concurrent WS reads (_stream_recv + recv_task.cancel()) with HTTP stop endpoint (POST /sessions/{id}/stop). Cancelling a background receive_text() task corrupted Starlette's WS state, breaking all subsequent receives. Now the WS has a single reader at all times. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 Apr
	551f838 Browse files » Fix mobile viewport height jumps (100dvh) ... - Replace 100vh with 100dvh on .app and mobile sidebar — dvh tracks the actual visible viewport and adjusts when the browser address bar or virtual keyboard appear/disappear, eliminating layout jumps in Chrome - Add overflow: hidden to html/body to prevent page-level scroll leaking through when the virtual keyboard resizes the viewport Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 Apr
2026-04-10	7fdab7a Browse files » Add responsive layout for mobile ... - Sidebar becomes a fixed drawer on ≤768px: slides in from left with transform + transition, backdrop overlay behind it - Hamburger ☰ button in chat header (hidden on desktop, visible on mobile) - Backdrop click closes sidebar; session select also closes it - Wider message bubbles (90%) and tighter padding on small screens - No layout changes on desktop Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	42c6012 Browse files » Refactor client input state management ... - Replace setStreamMode + syncSendButton + setInputEnabled (three functions competing for the same button) with a single updateInputUI() that derives button state from streaming, inputAllowed, uploadCount — eliminates the bug where upload completion during streaming could reset the stop button - Add inputAllowed module var as explicit "session is ready" flag - Remove stale client/app.js (was shadowing client/js/app.js, never loaded) - Clean up localStorage draft on session delete to prevent key accumulation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	86402e0 Browse files » Add stop button and fix context compression hang ... Stop generation: - Client: send button toggles to red ■ during streaming; sends {type:stop} via WS - Server: _stream_recv concurrently reads incoming messages during streaming using asyncio.wait — stop signal is handled immediately without polling - Cooperative stop via asyncio.Event (current_stop_event ContextVar): agent breaks out of LLM async-for cleanly so aclose() fires → Ollama stream closes gracefully, model stays in VRAM. No task.cancel() which would eject the model. - StreamStopped event propagates through run_stream/run_ephemeral; sub-agents stop via the same shared stop_event inherited through task context Context compression fix: - compress_context passes think=False to llm.complete() — no extended reasoning during summarization which caused GPU hang - Input truncated to 12k chars before sending to summarizer - LLMBackend.complete() / OllamaBackend.complete() accept think: bool \| None override Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	b3fb945 Browse files » ssh_exec icon 🖧→🔌 ... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	84da729 Browse files » Terminal icon ⚡, subagent max_iterations 20→100 ... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	5d53f0c Browse files » Fix ssh_exec: is_closing() → is_closed() (asyncssh API) ... SSHClientConnection has no is_closing() — that's asyncio Transport. The correct public method is is_closed(). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	da71d96 Browse files » Fix save(): persist profile_id to DB ... profile_id was never included in the UPDATE statement — only set on initial INSERT. Profile switches appeared to work in-memory but reverted on page reload or server restart. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	5386b61 Browse files » Fix profile switch: reload tools/schema after switch_profile tool call ... switch_profile updates profile_id in DB, but run_stream() held a stale local session object — the final save would overwrite the change, and subsequent LLM calls in the same turn still used the old tool schemas. After each tool-call iteration, compare DB profile_id with the local session object. On mismatch: update session.profile_id, reload profile, tools, tool_schemas, and llm backend so the next LLM call gets the correct schema and the final save preserves the new profile. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	9746fcc Browse files » Add ssh_exec to secretary profile ... Was missing, causing 'tool not found' when Navi tried to SSH from secretary mode. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	f68b071 Browse files » Dynamic system prompt — inject per-call instead of storing in context ... System prompt is no longer stored in session.context. Instead, _build_context() prepends the current profile's system prompt fresh on every LLM call. This means profile switches take effect immediately on the next message — no stale prompt lingering in stored context. Also strips any existing system messages from context for migration safety (old sessions that have one stored will still work). _with_memory() removed, replaced by _build_context(context, profile, mem). run_ephemeral() context no longer includes system message either. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	d68d059 Browse files » Fix ssh_exec error reporting + circular import in switch_profile ... ssh_exec: - Non-zero exit status: output (stdout/stderr) now included in error field so model can actually see why the command failed - All error cases: error field now contains human-readable message instead of short code (no_target, permission_denied, timeout) - Named connections from ssh_hosts.json: default known_hosts=none same as ad-hoc connections (was missing, caused host key failures) switch_profile: - Move ProfileSwitched import inside execute() to break circular import: tools.__init__ → switch_profile → core.events → core.agent → core.registry → tools Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	4cd6727 Browse files » docs: add profile_switched WS event and switch_profile flow ... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	2012de2 Browse files » Profile switch: emit WS event so client updates UI immediately ... ProfileSwitched event emitted by switch_profile tool via current_event_sink. Client handles profile_switched: updates chat header, profile selector, and local sessions[] — no page refresh needed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	2b9cdc1 Browse files » Add switch_profile tool for automatic profile switching ... Navi can now switch her own profile mid-session when the task domain changes. The new profile (tools + system prompt) takes effect from the next user message. Injected with session_store + profile_registry like SpawnAgentTool. Added to all profiles' enabled_tools and persona. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	d6f695f Browse files » Add API reference documentation ... Covers: REST endpoints (health, profiles, tools, sessions, files, messages), WebSocket protocol (all event types, client→server format, typical event sequences), error codes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
	a1c5abc Browse files » Add workspace dir + clean up junk from project root ... - workspace/ — persistent dir for Navi's long-term files (scripts, notes, data); excluded from git, Navi instructed to use it instead of project root - .gitignore: session_files/ and workspace/* added - persona.txt: WORKSPACE section pointing Navi to workspace/ - Deleted scanner.py, network_scan_results.txt, targets.txt (Navi artifacts) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr