root/navi-1

Fork: 0

root / navi-1

History for navi-1 / navi / core / agent.py

2026-05-21	7ac12f7 Browse files » Fix stop button responsiveness and shutdown CancelledError ... Agent loop (_execute_tools_with_sink): - Poll stop_event every 1s while draining the event sink via asyncio.wait_for - When stopped, cancel the tool task, yield a synthetic ToolEvent failure, append a cancellation message to session, yield StreamStopped, and return - Pass stop_event into _execute_tools_with_sink call site Subagent runner: - Check stop_event at the start of each tool in turn_tool_calls loop - Returns early with ("", False) when stopped mid-batch McpManager.disconnect_all(): - Disconnect clients sequentially instead of asyncio.gather - Handle asyncio.CancelledError per-client to avoid shutdown traceback AppContainer.shutdown(): - Catch BaseException instead of Exception for MCP and DB cleanup Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 May
2026-05-21	e04b4ca Browse files » Fix token counting: show only completion tokens, not cumulative prompt+completion ... The token_count displayed next to assistant messages was summing prompt_tokens + completion_tokens across ALL tool-calling iterations, giving hundreds of thousands of tokens for multi-turn conversations. Now: - token_count (coins icon) = only completion tokens generated by the model - context_tokens (ContextBar) = only prompt tokens (context size sent to LLM) This gives users a realistic measure of how much the model actually generated. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 May
2026-05-16	fbc7cb1 Browse files » Step 5-7: Extract async generators from run_stream, unify run() as wrapper ... - _compression_events_preturn / _compression_events_midturn - _consume_stream (uses StreamState) - _execute_tools_with_sink - run() is now a thin wrapper around run_stream() collecting StreamEnd - Remove dead imports (json, LLMChunk) - Mark god-object decomposition complete in architecture_weak_spots.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
	7ecf1b1 Browse files » Step 4: Extract SubAgentRunner from run_ephemeral() ... - Create navi/core/subagent_runner.py with full sub-agent loop logic - Move _iter_stream_guarded to navi/core/stream_guard.py - Move _check_context_size to ContextCompressor.check_context_size() - Extract build_tool_list() and load_user_enabled_tools() to tool_utils.py - Agent.run_ephemeral() becomes a thin wrapper delegating to SubAgentRunner - Remove ~310 lines from agent.py - All existing run_ephemeral tests pass unchanged Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
	8bd25a7 Browse files » Step 3: Extract AntiStallMonitor from run_stream() ... - Create navi/core/anti_stall.py with AntiStallMonitor class - Encapsulates stall detection (todo progress + repeated tool calls) - Encapsulates adaptive re-plan (failed todo step detection) - Provides init() / pre_turn() / post_turn() two-phase interface - Remove ~50 lines of stall/replan logic from agent.py run_stream() - Remove _todo_status_snapshot and _todo_failed_steps helpers from agent.py - Update AgentTurnContext: remove stall fields (now live in AntiStallMonitor) - Add 13 unit tests for pre_turn and post_turn behavior Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
	d004530 Browse files » Step 2: Extract AgentTurnContext dataclass from run_stream() ... Move 10 turn-level local variables from run_stream() into AgentTurnContext: - turn_start, tool_call_count, turn_tokens, subagent_tokens - stall_no_todo, stall_repeat_tools, prev_tool_sigs - known_failed, replan_msg, injected_fact_ids This makes run_stream() readable and prepares the ground for AntiStallMonitor (Step 3) which will consume this context. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
	d67992a Browse files » Extract ContextCompressor, fix STL viewer, expand test suite, add architecture audit docs ... - Extract ContextCompressor from agent.py (Step 1 of god-object refactor) - Add retry + hard-truncate fallback logic to ContextCompressor - Add unit tests: agent loop (14), compressor (18), KV store (8), auth encrypt (3), auth deps (13), todo/scratchpad/image_view/memory - Fix WebGL STL viewer: allow-same-origin sandbox + graceful fallback - Add CompressionStarted event and client-side compression notice - Add docs/architecture_weak_spots.md and plan_01_god_object_agent.md - Update test_events.py and test_agent_context_size.py for new logic Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
	d812e65 Browse files » Add inherit_system_prompt and is_subagent_only mechanisms ... - inherit_system_prompt: subagent parameter to prepend parent's system prompt as a base layer before subagent specialisation - is_subagent_only: profile flag blocking switch_profile, allowing spawn_agent only; shown with [subagent only] tag in list_profiles - Document both in docs/profiles.md and docs/tools.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
	7489e6a Browse files » Enhance native toolset and add persistent KV store ... - Add PostgreSQL-backed KvStore (navi/store/) for session-scoped data. - Migrate todo and scratchpad from in-memory dicts to KvStore. - Filesystem: add copy, grep, diff actions; compress description. - CodeExec: remove language param, expose working_dir in schema. - ImageView: resize to 1024px JPEG + Content-Type guard for URLs. - Memory list: return distinct categories instead of all facts. - SSH: add scp action with upload/download support. - Update CLAUDE.md (Postgres-only), docs/tools.md, add docs/store.md. - Fix agent/planning/context_builder async signatures for todo helpers. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 16 May
2026-05-15	e4984fa Browse files » fix(recall): stabilize scheduled callback system and improve UX ... Backend fixes: - stop_session now stops headless recall runs via _busy_sessions dict - _fire_recall sets user ContextVars so tools work correctly - MaxIterationsReached treated as success, not failure - skip_next_recall uses GREATEST(trigger_at, now) for overdue recalls - schedule_recall rejects past trigger times - timezone offset double-adjustment fixed for aware datetimes - _fire_recall registers _AgentRun for reconnect/replay support - session_sync race with stream_start fixed Frontend improvements: - Recall banner moved to ChatHeader with live Cancel/Skip buttons - Recall messages styled with is_recall flag and badge - Real-time recall updates via WebSocket (recall_update events) - Recall filter moved to sessions-header as toggle button - Session list shows clock icon for sessions with pending recall - Empty state messages for empty/filtered session lists - Fixed missing api import in ChatHeader.vue Tests: - Updated scheduler_loop tests for _busy_sessions dict change Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 15 May
2026-05-15	438b239 Browse files » Add self-recall (scheduled callback) system ... Core features: - schedule_recall tool: once/recurring/immediate callbacks - manage_recall tool: cancel/skip/list scheduled recalls - Natural-language time parser (ISO, relative, "tomorrow at 09:00") - PostgreSQL-backed RecallScheduler with lazy pool init - Background recall_scheduler_loop with asyncio.Semaphore(3) - _busy_sessions guard prevents user messages during headless runs - Agent.run() preserves thinking field for session history visibility - API endpoints: GET/DELETE/POST for session recall, admin list - Frontend: recall badge, filter, cancel/skip in sidebar and chat header - Tests: parser, scheduler CRUD, tools, API, scheduler loop (53 tests) - Manuals: schedule_recall.md and manage_recall.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 15 May
2026-05-13	1f6f538 Browse files » Persist uploaded files in messages, live file tree updates, and UI polish ... Backend: - Add `files` field to `Message` model so uploaded file metadata survives page refresh - Pass `files` through websocket handler → `agent.run_stream` / `agent.run` - `list_tools`: make `profile_id` required; return error instead of all-tools fallback Webclient: - Call `fetchFiles()` after successful file upload for immediate Files tab update - Live refresh file tree on filesystem (write/edit/append/mkdir/rm/cp/mv), terminal, and code_exec tool calls - Add manual refresh button (desktop) and pull-to-refresh (mobile) to Files tab - Fix live link updates: move regex creation inside per-message loop to avoid lastIndex state leak - Restore full profile name text next to avatar in ChatHeader; hide avatar in header - Fix mobile ArtifactsPanel: collapse tab text labels so close button fits with 3 tabs Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 13 May
2026-05-13	61a9c7a Browse files » Fix agent.py _tool_list to use colon-delimited MCP names ... The main registry and API routes were migrated, but Agent._tool_list still built names with the old underscore format. This meant MCP tools were silently dropped from the tool list passed to the LLM, causing "tool not found" when the model correctly called mcp:server:tool names. Also updates stale docstring in McpTools. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 13 May
2026-05-12	deb7846 Browse files » Clarify knowledge persistence prompts Eugene Sukhodolskiy committed on 12 May
2026-05-12	de38f31 Browse files » Auto-inject relevant memory facts into LLM context on every user turn. ... Semantic search against the vector memory store runs in parallel with context provider collection. Number of injected facts scales with message length (1–3) to reduce noise on short queries. Guardrails: min length gate, per-turn deduplication, and structured logging at info level for observability. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 12 May
2026-05-11	cebc073 Browse files » Fix ollama_backends / FallbackOllamaBackend issues ... - registry.py: always use FallbackOllamaBackend (unified backend). Enables model priority lists in all deployments, not just multi-server. - agent.py: add missing think=profile.think_enabled to run() (REST endpoint). - compressor.py: fix model param type (str → list[str] \| str \| None). - fallback.py: harden load_servers_from_file against missing/bad JSON files and entries without host. Add clear_blacklists() for manual reset. - admin.py: add POST /admin/ollama/clear-blacklists endpoint. - tech_debt_review: document dead stream() methods. - tests: add tests for single-server fallback, bad file handling, missing host skipping, and blacklist clearing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 May
	c466bef Browse files » Add deterministic line-based file editing (edit_lines), rating UI fix, and session refresh ... - filesystem.py: add edit_lines action (deterministic line ops via operations array) + numbered param for read (1-based line numbers in output) + clarify four editing modes in tool description - chat.js: fix rating IDs for streaming messages (assign h_ ID on stream_end) - SessionList.vue: mobile pull-to-refresh with PTR_THRESHOLD=80 - AppSidebar.vue: desktop refresh button next to Conversations header - planning.py: knowledge source assessment in Phase 1 - debug panel: MCP servers tab + resolved tools per profile - NAVI.md: reposition as neutral quick-reference Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 May
	6bcff13 Browse files » Inject MCP server instructions into system prompt ... - McpClient: collect instructions from MCP initialize handshake - McpServerConfig: add 'instructions' field for Navi-side overlay - McpManager.get_instructions(): merge server + config instructions - ContextBuilder: new _mcp_context_msg() injects MCP server descriptions into every LLM context as a system message - Agent passes mcp_manager to ContextBuilder - mcp_servers.json: add overlay instructions for gnexus-book Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 May
	651731c Browse files » Refactor MCP integration: server groups in profiles ... - mcp_servers.json: add 'groups' (read/write/admin) for gnexus-book - AgentProfile: new 'mcp_servers' field (server_name -> group list) - Profile loader: parse and persist 'mcp_servers' in config.json - Agent._tool_list(): expands mcp_servers into concrete tool names via McpManager.resolve_group(), wildcard '*' supported - /agents/profiles API: includes 'mcp_servers' in response - Profiles no longer list individual mcp_ tools in 'enabled_tools' - discuss: gnexus-book read group - server_admin: gnexus-book read+write+admin groups Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 11 May
2026-05-08	bba283f Browse files » Propagate user profile to LLM context via current_user_info ContextVar ... - Extend User model: username, first_name, last_name, phone, birth_date, country, city, locale (all from gnexus-auth profile) - navi_users DDL: add new profile columns - auth/deps + auth/callback: populate new fields on upsert - /auth/me: return all profile fields - Add current_user_info ContextVar for full user profile propagation - websocket + messages: set current_user_info before agent.run() - run_ephemeral: inherit and restore current_user_info - ContextBuilder: _user_context_msg() injects [User context] with name, email, location, locale, role into LLM system messages - _security_policy_msg: reads user_id/role from ContextVar directly Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 May
	db0261b Browse files » Add multi-user sandbox: filesystem, terminal, code_exec, security policy ... - filesystem, share_file: sandbox non-admin users to user_data/<user_id>/ - terminal: working_dir sandbox + allowlist + dangerous pattern block for users - code_exec: sandbox CWD and temp files to user_data/<user_id>/ for users - context_builder: inject dynamic security policy into LLM context (user/admin) - config: terminal_user_allowed_commands setting - agent: wire user_id/user_role through ContextBuilder.build() - base: add current_user_role ContextVar; run_ephemeral inherits role Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 May
	df61f76 Browse files » Add per-user filesystem sandbox via current_user_id ContextVar ... - tools/base.py: add current_user_id ContextVar (set by Agent before every tool call, cleared after) - core/agent.py: set current_user_id in run_stream from session.user_id and in run_ephemeral from parent_session.user_id; restore in finally - tools/filesystem.py: _check_path resolves all paths inside user_data/<user_id>/ when current_user_id is present; legacy mode (no user_id) falls back to FS_ALLOWED_PATHS - tools/share_file.py: validate source path is inside user sandbox Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 8 May
2026-05-04	8e074a0 Browse files » Fix NameError in run_ephemeral: session was undefined ... run_ephemeral doesn't have a session variable. Pass user_id from the parent session (looked up via parent_session_id) instead of referencing non-existent session variable. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 4 May
2026-05-03	3014ba6 Browse files » Multi-user auth via gnexus-auth OAuth + hybrid role/permission model ... - Integrate gnexus-auth-client-py (GAuthClient) for OAuth flow, token refresh, and webhook parsing - Add navi/auth/ package: User model, Fernet encryptor, client singleton, deps (get_current_user, require_admin, require_permission) - New tables: navi_users, user_auth_sessions (auto-created on startup) - Session/memory isolation by user_id with legacy NULL support - Cookie-based auth proxy: /auth/login, /callback, /logout, /me - Webhook receiver /webhooks/gnexus-auth handling user events, global logout, session revocation, role/permission changes - Admin endpoints (/admin/*) gated by role + permissions - Webclient auth store with isAdmin/hasPermission guards - Admin-only profile filtering in /agents/profiles - 200/200 tests passing Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 3 May
2026-05-02	8b3ebca Browse files » Refine 3D modeler workflow Eugene Sukhodolskiy committed on 2 May
2026-05-01	2b76e92 Browse files » Simplify 3D SCAD subagent prompt Eugene Sukhodolskiy committed on 1 May
2026-05-01	e8b54fd Browse files » Disable thinking stalls for 3D subagents Eugene Sukhodolskiy committed on 1 May
2026-04-30	dce281e Browse files » Improve content publishing UX Eugene Sukhodolskiy committed on 30 Apr
2026-04-29	9a8606e Browse files » Extract PlanningEngine, ContextBuilder, ToolExecutor from agent.py ... - navi/core/planning.py: new 3-phase planning pipeline (~390 lines) - navi/core/context_builder.py: system prompt caching, memory/context injection, goal anchoring (~160 lines) - navi/core/tool_executor.py: tool execution with middleware chain (~150 lines) - navi/core/agent.py: reduced from ~1420 to ~770 lines; delegates to extracted classes All compilation verified. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
2026-04-29	8f68841 Browse files » Architecture extensibility — event bus, middleware, auto-discovery, Pydantic profiles ... - EventBus: async pub/sub for AgentEvents, WebSocket subscribes instead of direct yield - Declarative serialization: AgentEvent.to_wire() on all event types - Auto-discovery for LLM backends (_discover_backends) and workers (scan navi/workers/*.py) - AgentProfile: Pydantic BaseModel with extra='allow', @field_validator for model coercion - Tool middleware chain: pre/post execute hooks via ToolRegistry.add_middleware() - LoggingMiddleware: built-in, logs every tool call - Fix pg_trgm DDL: conditional GIN indexes via DO $$ block, no CREATE EXTENSION - New files: event_bus.py, middleware.py, logging_middleware.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr