root/navi-1

Fork: 0

root / navi-1

2026-04-10	a1c5abc Browse files » Add workspace dir + clean up junk from project root ... - workspace/ — persistent dir for Navi's long-term files (scripts, notes, data); excluded from git, Navi instructed to use it instead of project root - .gitignore: session_files/ and workspace/* added - persona.txt: WORKSPACE section pointing Navi to workspace/ - Deleted scanner.py, network_scan_results.txt, targets.txt (Navi artifacts) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
2026-04-10	1e8b65e Browse files » Major feature batch: visibility, planning, file uploads, streaming ... - stream_complete(): streaming with tools for all LLM turns — thinking now streams as ThinkingDelta/ThinkingEnd in real-time during tool- selection turns, not just on the final response - todo built-in tool: session-scoped plan manager (set/view/update/clear); persona + all profiles updated with mandatory planning instructions - TurnThinking event: sub-agent thinking forwarded to parent sink as a collapsible block in the spawn_agent card - File uploads: non-image files uploaded via XHR, shown as badges in message bubble; SVG treated as regular file (not base64 image) - session_files: POST /sessions/{id}/files, TTL cleanup, forbidden exts - WebSocket reconnect: _AgentRun broadcast pattern, re-attach mid-stream - UI: favicon, sidebar logo, turn-thinking cards, subagent thinking blocks, token counter, draft persistence, file progress bar - Removed AgentNote (content is always None alongside tool_calls) - Ollama stream_complete: tool_calls captured from non-final chunk (done=False) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 10 Apr
2026-04-09	4112e36 Browse files » Live tool visibility: pending cards, sub-agent step log ... Backend: - ToolStarted event: emitted before tool execution begins so client can render a pending card with spinner immediately - ToolEvent gains is_subagent flag; ToolStarted same - current_event_sink ContextVar in tools/base.py — run_stream() sets it to an asyncio.Queue before create_task(); run_ephemeral() reads it and puts ToolStarted/ToolEvent into the queue as each sub-agent step runs - run_stream() tool loop: sequential execution via create_task() + polling drain loop (20ms sleep); yields ToolStarted → sub-agent events from sink → ToolEvent (completed) for each tool call - run_ephemeral() rewritten to inline sequential tool execution with sink emission (replaces _execute_tool_calls gather) - _run_single_tool() helper extracted for run_stream() - websocket.py handles tool_started and adds is_subagent to tool_call Frontend: - appendPendingToolCard(): creates card with spinner; spawn_agent opens body immediately to show sub-agent log as it fills - finalizeToolCard(): fills result, removes spinner, adds toggle; strips "[Sub-agent result — ...]" reminder prefix from displayed text - appendSubagentStep() / finalizeSubagentStep(): live step log inside spawn_agent card — each sub-agent tool call gets a ↳ row - app.js: tool_started → pending card; tool_call → finalize card; is_subagent routing to sub-step vs main card; abandonStream() resets pendingToolCard/pendingSubStep - CSS: .spinner-inline for card headers; .subagent-log / .subagent-step for nested step display; .tool-body-open for always-open spawn_agent body; .tool-card.pending suppresses chevron Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	ae4efaa Browse files » spawn_agent: fix model behavior — synthesis reminder + lower delegation threshold ... - ToolResult.output now prefixed with explicit reminder that the user cannot see sub-agent output and the orchestrator must present findings in its own final response; reminder appears adjacent to the result so the model reads it in context - spawn_agent description updated: lowers threshold from "3+" to "2+" tool calls, makes USER CANNOT SEE warning prominent, tightens wording - persona.txt DELEGATION section: clearer rule (default to spawning for multi-step sub-tasks), explicit "never end turn after spawn results", removed abstract threshold in favour of concrete examples Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	2efec37 Browse files » Add spawn_agent: sub-agent delegation with isolated context ... - Agent.run_ephemeral() — runs a sub-agent loop without a persistent session; accepts exclude_tools to block recursion; logs start/complete - session_store made Optional in Agent.__init__ (None for ephemeral runs) - SpawnAgentTool (navi/tools/spawn_agent.py): spawns an isolated Agent for a focused task; resolves profile from parent session via ContextVar; blocks spawn_agent recursion via exclude_tools=["spawn_agent"] - build_default_registries() accepts session_store param; registers SpawnAgentTool after BackendRegistry is built (patches _backend_registry) - deps.py passes _session_store to build_default_registries - All profiles: spawn_agent added to enabled_tools, max_iterations 10→30 - persona.txt: DELEGATION section — when/how to use spawn_agent Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	b190a4b Browse files » Client: wider sidebar, 2-line preview, loading spinners ... - Sidebar width: 260px → 300px - Session preview: nowrap/ellipsis → 2-line clamp (-webkit-line-clamp: 2) - Action buttons: align-items flex-start + 2px padding-top so they sit at the top of multi-line items instead of vertically centered - Spinner shown in sidebar session list during initial data fetch - Spinner shown in chat area during session history load; cleared on both success and error Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	a911bfc Browse files » Client UX: profile-filtered sessions, draft persistence, stream safety ... - Profile selector now filters session list: only sessions of the selected profile are shown; switching profile opens the most recent session for that profile (or empty state if none exist) - openSession() syncs profileSelect to the opened session's profile so the list always reflects what's on screen - Session items no longer show redundant profile name (list is filtered) - Bug fix: switching sessions while streaming now calls abandonStream() before ws.disconnect() — resets state without touching DOM, preventing WS onClose from calling finishStream() on the wrong session - Textarea no longer disabled during streaming — only the Send button is locked; user can type the next message while model is still responding - Draft auto-saved to localStorage per session on every keystroke, restored on session switch, cleared on send — survives page refresh Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	108f65d Browse files » SSH connection pooling: per-session, 20-minute TTL ... - Pool keyed by session_id:host:port:username — parallel sessions share no state even when targeting the same server - Per-key asyncio.Lock prevents concurrent connection creation races - TTL (20 min) and is_closing() checked on every access; expired/closed connections are evicted and replaced transparently - On disconnect error during command execution: evict + retry once with fresh connection - current_session_id ContextVar (set by Agent before tool calls) is read in ssh_exec to build the pool key without changing tool signatures Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	d04c75d Browse files » Fix naive/aware datetime comparison in session store and memory extraction ... Old sessions stored datetimes without timezone offset. _row_to_session now always returns timezone-aware datetimes via _parse_dt() helper, fixing the TypeError when comparing session.last_active against timezone.utc cutoffs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	56611d7 Browse files » Add long-term user memory system ... Architecture: - navi/memory/store.py: MemoryStore backed by SQLite (memory_facts, memory_summary, session_memory_state tables in navi.db) - navi/memory/extractor.py: LLM-based fact extraction from sessions + summary regeneration (triggered after session goes idle >30 min) - Fact upsert uses UNIQUE(category, key) — same key always overwrites, no duplicates or stale contradictions - Keyword search across category + key + value (LIKE-based, no extra deps) Context injection: - Memory summary injected as an ephemeral system message on every LLM call via Agent._with_memory() — never persisted to session.context Tools (all profiles): - memory_search(query): keyword search against fact DB; persona instructs model to call it at session start and before personal-context questions - memory_forget(key, category?): delete a specific fact on user request Extraction trigger: - On new session creation, fire-and-forget background task checks all sessions idle >30 min with unprocessed messages → runs extraction Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	fb3bb54 Browse files » Add web_view tool: headless browser with text extraction and screenshot ... - New built-in tool web_view: opens URL in headless Chromium via Playwright, strips nav/footer/scripts, returns clean readable text (capped at 20k chars). Optional screenshot=true returns a PNG injected into context as an image. Handles JS-rendered pages and SPAs (waits for networkidle by default). - http_request description updated: explicitly says to use web_view for human- readable pages, http_request for APIs/JSON/custom auth. - web_view added to all three profiles. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	ef771e5 Browse files » Add /debug page: inspect LLM context by session ID ... - GET /sessions/{id}/context — returns session.context (what the model sees) with message count and total char count - /debug — standalone HTML page, session ID from URL hash Shows each context message: role badge, full content, tool calls as JSON, images as thumbnails, per-message and total char/token estimates Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	527d684 Browse files » Fix NAVI_PERSONA not loading: move persona to persona.txt ... python-dotenv cannot parse multi-line env values that contain blank lines or lines resembling key=value assignments (like the write_tool code examples). The persona was silently dropped, leaving Navi with no identity. Solution: navi_persona_file setting (default empty) — if set, the persona is read from the specified file at startup via a model_validator. The .env now points to persona.txt instead of embedding the multi-line string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
	7b1fa2c Browse files » Fix context loss: ensure system prompt is always present in LLM context ... Replaced `if not session.context:` with a role-based check so the system message is inserted whenever it is absent — not just for brand-new sessions. Root cause: backward-compat sessions (context column was empty) had their context initialised from session.messages, which never contains a system message. The old check (`if not session.context:`) saw a non-empty list and skipped the system prompt, so every subsequent request ran without it — Navi had no persona and no profile instructions. Also add context_token_count field to Session (follow-up for token counter fix — persistence wiring comes in next commit). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr
2026-04-08	bdd3786 Browse files » Review fixes: events module, circular imports, deps, vision-aware compression ... - Extract all AgentEvent dataclasses to navi/core/events.py; import from there in agent.py and __init__.py — eliminates circular import between workers and core - workers/compressor.py: remove runtime import hack, use navi.core.events - workers/base.py: WorkerResult.events typed as list[AgentEvent] (was Any) - api/deps.py: replace @lru_cache on mutable list with module-level singletons (_registries, _workers) - core/compressor.py: _format_for_summary returns (text, images); images passed to summarization LLM so vision models describe them in summary; non-vision models silently ignore the images field; docstring updated - client/js/app.js: add comment explaining is_summary backward compat branch Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	ece41eb Browse files » Switch all profiles to gemma4:e4b-it-q8_0 ... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	261459a Browse files » Separate display history from LLM context; formalize worker system ... Architecture change: - session.messages: full display history, never modified by compression - session.context: what the LLM sees, may be compressed by workers - System messages go only into context (not display history) - Image injections (synthetic) go only into context - User/assistant/tool messages go into both SQLite: add context column with backward-compat migration (empty context → initialized from messages on load) Workers (navi/workers/): - Worker ABC + WorkerContext + WorkerResult (base.py) - CompressionWorker: compresses session.context when above threshold - build_default_workers() returns [CompressionWorker()] - Agent accepts workers list, runs them after StreamEnd - Workers injected via deps.py get_workers() (lru_cached singleton) - WebSocket agent construction also receives workers Compressor: compress_context() now takes context[], not messages[] Config: context_keep_recent 6 → 10 Agent: _run_workers() collects events from all workers and yields them Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	802c186 Browse files » Add context compression: rolling summarization when context fills up ... Mechanism: - After streaming ends, if context_tokens >= threshold (80% of num_ctx), compress old turns into a summary message using the same LLM - Partition: keep system msg + last N turns verbatim (default 6); everything older goes to the summarizer - Tool call groups (assistant + tool results) never split across boundary - Existing summary messages folded into new compression pass — no stack growth - Summary stored as Message(role=user, is_summary=True) after system msg - On failure: logged, session left unchanged (non-fatal) New files: - navi/core/compressor.py: should_compress, partition_messages, compress_session (pure logic, testable without agent) New config (navi/config.py): - context_compression_enabled: bool = True - context_compression_threshold: float = 0.80 - context_keep_recent: int = 6 - context_summary_temperature: float = 0.3 New agent event: ContextCompressed(messages_before, messages_after) Message.is_summary: bool field marks compressed history blocks Client: - context_compressed WS event → subtle inline notice in message list - loadHistory: is_summary messages rendered as collapsible summary cards - style.css: .summary-card, .compression-notice Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	72f5a6f Browse files » Token counter: hide on session switch, hide when no data ... Reset counter to hidden on openSession() — prevents stale token data from previous session showing when switching. Also hide when stream_end arrives without token info. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	9c0c6b3 Browse files » Add context token counter: 64k default, live UI display ... - config: ollama_num_ctx default 8192 → 65536 - LLMChunk: add prompt_tokens / completion_tokens fields - OllamaBackend.stream: populate token counts from final chunk (prompt_eval_count + eval_count when chunk.done) - StreamEnd: add context_tokens and max_context_tokens - Agent.run_stream: capture token counts, pass to StreamEnd - websocket: include context_tokens / max_context_tokens in stream_end - index.html: split chat-header into title span + token-counter span - sidebar.js: updateChatHeader targets #chat-header-title, not innerHTML - app.js: updateTokenCounter() shows "X/Y (Z%) tokens", colors: gray <50%, amber 50–79%, red ≥80% - style.css: .token-counter, .warn, .danger styles Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	f5f8d90 Browse files » Server review fixes: profile model routing, sorting, datetime, cleanup ... - LLMBackend.complete/stream: add model param; OllamaBackend uses it over self.model, enabling per-profile model selection - BackendRegistry.get(): remove unused model param - Agent: pass profile.model to complete() and stream() - Profiles: correct model to gemma4:e2b-it-q8_0 (was leftover e4b) - InMemorySessionStore.list_all(): fix sort (pinned+newest first, was pinned+oldest) — now consistent with SQLite ORDER BY - session.py, sqlite_session_store.py: datetime.utcnow() → datetime.now(timezone.utc) (deprecated since Python 3.12) - _base_options(): accept temperature param, remove dead default - deps.py: rename _registries → get_registries (public API) - websocket.py: update import accordingly Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	92ef26f Browse files » Client review fixes: bug fix, dead code, deduplication, CSS vars ... - renderPreviewStrip: fix index-capture bug (splice by value, not index) - api.js: remove dead sendMessage (messaging goes via WebSocket only) - utils.js: extract shared esc() and timeLabel() from chat.js + sidebar.js - enrichSession: remove unnecessary async/Promise.all wrapper - style.css: thinking-card colors moved to CSS variables in :root - WsClient: remove unused #sessionId field Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	aa34e55 Browse files » Update .gitignore: ignore tmp files and tool data files ... Add .tmp, task_manager.json, tools/*_data.json (runtime data written by user tools like user_notes). Remove brain_kb.py (broken syntax). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	1cf8604 Browse files » Add project docs, tool manuals, and initial user tools ... - CLAUDE.md: comprehensive project reference for Claude Code sessions - manuals/write_tool.md: full format reference with working example - tools/_template.py: canonical module-level tool format - tools/enabled.json: auto-include list (get_current_datetime, user_notes) - get_current_datetime, user_notes: first Navi-written tools Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	a8bbe84 Browse files » Add thinking blocks UI: collapsible reasoning display ... Thinking card appears on first thinking_delta, auto-collapses on thinking_end. Re-openable like tool cards. Blue color scheme to distinguish from regular tool results. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	0aa7f6f Browse files » Wire self-extension tools into all profiles; improve tool descriptions ... All profiles now include write_tool, list_tools, tool_manual, reload_tools. User tools from enabled.json merged in at runtime via Agent._tool_list(). Built-in tool descriptions rewritten to be more LLM-actionable. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	3d8b37e Browse files » Add self-extension tool system: write_tool, list_tools, tool_manual ... - loader.py: module-level format (name/description/parameters/execute) preferred, class-based as fallback; isolated errors per file - write_tool: validates + writes tools/name.py, reloads registry, adds to tools/enabled.json in one call - list_tools: live tool list from registry (prevents hallucination) - tool_manual: serves manuals/*.md or auto-generates from schema - reload_tools: hot-reload without server restart - registry: registry injection pattern for tools that need it; _builtin_names set to guard against reload overwriting builtins Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	c9ee0ec Browse files » Add thinking/reasoning streaming support ... Enable Ollama think param and stream reasoning chunks to client. New agent events: ThinkingDelta, ThinkingEnd. Config gains ollama_think and ollama_num_ctx settings. WebSocket protocol updated accordingly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	9a8056d Browse files » Add multimodal image support and client UX improvements ... Server: - Add ImageViewTool (load image from file/URL, returns base64) - Add images field to Message model with created_at timestamp - Agent run/run_stream accept images param; inject image messages after image_view tool calls - WebSocket handler accepts images array from client, strips data URI prefix - All profiles include image_view tool - Fix tool call serialization (model_dump mode=json for datetime) - Add no-store cache headers for static files Client: - Image attachment: file picker button + clipboard paste + preview strip with remove - Images rendered in chat bubbles; loaded from history - Tool cards rebuilt as div+CSS toggle (fixes details/overflow-hidden collapse bug) - Tool cards appear before response bubble (lazy bubble creation on first stream_delta) - Typing indicator persists through tool calls, removed only when text starts streaming - Tool cards restored from history on page reload - Message timestamps stored via created_at field, shown correctly in history - Session ID reflected in URL hash for bookmarking; restored on page load - Remove localStorage session tracking (server last_active used instead) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 Apr
	f3ce16d Browse files » Pin sessions + larger code font ... Server: - Session.pinned field (bool, default False) - SqliteSessionStore: ALTER TABLE migration for existing DBs, set_pinned(), ORDER BY pinned DESC, last_active DESC - PATCH /sessions/{id}/pin endpoint Client: - Pin/unpin button appears on hover (📌/📍) - Pinned sessions get blue left border and always-visible pin icon - Local sort mirrors server sort (pinned first) - Code blocks font-size bumped from 0.82em to 0.9em Eugene Sukhodolskiy committed on 8 Apr