root/navi-1

Fork: 0

root / navi-1

History for navi-1 / navi / core

2026-04-30	bfc61e2 Browse files » Open artifact links externally on Android Eugene Sukhodolskiy committed on 30 Apr
	169ea8d Browse files » Fix 3D modeling feedback: session context, CORS, and PNG workflow ... - Embed session_id into the main system prompt so the model reliably sees the correct session files directory instead of hallucinating one. - Add global CORS middleware to fix STL viewer iframe fetch errors. - Switch modeler_3d PNG previews from content_publish to image_view so renders are debug-only for Navi, not user-facing cards. - Remove literal {session_id} template from persona.txt. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 30 Apr
	ffb2c6b Browse files » Add 3D modeling profile with model_3d and render_3d tools ... New profile: - modeler_3d: 3D model design for 3D printing (OpenSCAD → STL) New tools: - model_3d: compile .scad → binary STL via OpenSCAD CLI - render_3d: render PNG previews from STL (up to 3 views, 400×300) Both tools return openscad_not_found error if OpenSCAD is not installed. Docs: - README.md: add modeler_3d to profile table - docs/profiles.md: add modeler_3d to active profiles - docs/tools.md: add model_3d and render_3d to builtin tools - manuals/model_3d.md, manuals/render_3d.md: full usage manuals - modeler_3d system_prompt.txt: OpenSCAD-first workflow with dedicated tools Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 30 Apr
	dce281e Browse files » Improve content publishing UX Eugene Sukhodolskiy committed on 30 Apr
2026-04-29	30dd183 Browse files » Align Ollama HTTP timeout with LLM timeouts Eugene Sukhodolskiy committed on 29 Apr
	fe973b2 Browse files » Bootstrap test suite — Phase 1 unit tests ... - docs/testing.md: testing strategy, mock strategy, phase breakdown - tests/conftest.py: autouse fixture to reset navi.config.settings per test - tests/conftest_factory.py: FakeLLMBackend, FakeTool, make_profile, make_registry helpers - tests/unit/core/test_events.py: wire serialization for all 15 event dataclasses - tests/unit/core/test_compressor.py: should_compress, partition_messages, format_for_summary, compress_context - tests/unit/core/test_registry.py: ToolRegistry, ProfileRegistry, BackendRegistry - tests/unit/core/test_context_builder.py: system prompt caching, persona injection, goal anchor, iteration budget - tests/unit/profiles/test_base.py: Pydantic model coercion, defaults, extra fields - navi/core/context_builder.py: use module-level `import navi.config` instead of `from navi.config import settings` so tests can swap the singleton 59 tests passing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	9a8606e Browse files » Extract PlanningEngine, ContextBuilder, ToolExecutor from agent.py ... - navi/core/planning.py: new 3-phase planning pipeline (~390 lines) - navi/core/context_builder.py: system prompt caching, memory/context injection, goal anchoring (~160 lines) - navi/core/tool_executor.py: tool execution with middleware chain (~150 lines) - navi/core/agent.py: reduced from ~1420 to ~770 lines; delegates to extracted classes All compilation verified. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	8f68841 Browse files » Architecture extensibility — event bus, middleware, auto-discovery, Pydantic profiles ... - EventBus: async pub/sub for AgentEvents, WebSocket subscribes instead of direct yield - Declarative serialization: AgentEvent.to_wire() on all event types - Auto-discovery for LLM backends (_discover_backends) and workers (scan navi/workers/*.py) - AgentProfile: Pydantic BaseModel with extra='allow', @field_validator for model coercion - Tool middleware chain: pre/post execute hooks via ToolRegistry.add_middleware() - LoggingMiddleware: built-in, logs every tool call - Fix pg_trgm DDL: conditional GIN indexes via DO $$ block, no CREATE EXTENSION - New files: event_bus.py, middleware.py, logging_middleware.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	5cebfcd Browse files » Architecture fixes batch — NaN validation, ILIKE indexes, prompt cache, N+1 batching ... - _vector_to_str: reject NaN/Inf via math.isfinite() to avoid invalid pgvector syntax - memory DDL: add pg_trgm + GIN trigram indexes on category/key/value for fast ILIKE fallback - _build_system_prompt: cache per-profile to avoid rebuilding every iteration - backfill_embeddings: batch UPDATEs via executemany instead of N+1 loop No new Python deps; pg_trgm is a PostgreSQL extension auto-created on startup. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	6dad458 Browse files » Architecture cleanup: old_webclient, SSH shutdown, todo decoupling, ContextVar reset ... - Remove old_webclient directory and /static mount from main.py - Add shutdown handler in main.py to close all SSH pooled connections - Decouple agent.py from todo module internals: - Add public API to navi/tools/todo.py: get_task_snapshot, get_failed_steps, get_progress_message, set_tasks, render_todo_lines - Replace all direct _plans/_STATUS_ICON/_Task imports in agent.py with API calls - Wrap run_ephemeral in try/finally to restore _sid_var and _model_var after subagent Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	7b672c3 Browse files » Remove SQLite legacy support ... SQLite is no longer supported; PostgreSQL is now required. - Delete navi/core/sqlite_session_store.py - Delete navi/memory/sqlite_store.py - Remove SqliteSessionStore from navi/core/__init__.py exports - deps.py: drop SQLite fallback, raise RuntimeError if DATABASE_URL missing - config.py: remove db_path setting - pyproject.toml & requirements.txt: drop aiosqlite dependency - .gitignore: remove navi.db entry - tech_debt_review_2026-04-29.md: mark #8 as REMOVED Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	098401a Browse files » Stability fixes batch — tech debt review 2026-04-29 ... Critical: - Concurrent WS run race guard (#1) - Tool task cancellation on generator teardown (#2) - StopAsyncIteration kills fallback chain (#3) - Session loading race with _lastLoadId guard (#4) - ContentCard .match() crash on non-string result (#5) - Image data type guard in buildMessageList (#6) High: - Cap WS replay buffer at 500 events (#7) - Deduplicate memory extraction task with asyncio.Lock (#9) - TTL-based fallback blacklisting (5 min) (#10) - Subagent tool exception isolation (#11) - Inline image size/count validation on WS (#12) - Clean up orphaned file on DB insert failure (#13) - Deep watch streamingMsg for auto-scroll (#14) - WS_SCHEME wss:// support for HTTPS (#15) - Sending guard against duplicate message sends (#16) - Global unhandledrejection listener in API layer (#17) Medium: - Cap planning_logs at 20 entries (#22) - Store cleanup_loop task reference (#23) - BaseException → Exception in _run_with_sentinel (#24) - Propagate SystemExit in agent loop (#25) - Configurable output_reserve_tokens (#26) - Always reloadSession on session_sync (#30) - FIFO queue for confirm dialogs (#31) - Reset body.overflow on ImageLightbox unmount (#32) - try/finally in fallback copy (#33) - _isConnecting guard in WS send() (#34) Low: - Lazy-init deps.py singletons (#36) - Replace __import__ with direct imports (#38) - Preserve token count 0 in ollama.py (#39) - Clear orphaned streamingMsg on reconnect reload (#43) - Escape single quote in UserMessage (#44) - Polyfill-free findLast replacement (#48) - Match <table> tags with attributes in markdown (#49) - Attach copy buttons only when msg.done (#50) - Fix hasMeta falsy-0 bug (#53) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
2026-04-28	cabfce8 Browse files » Fix system prompt leakage into chat history; polish content cards ... Backend: - websocket.py + agent.py: separate user-visible display_message from LLM user_message. System hints (image/file attachments) no longer leak into session.messages and appear after page reload. - main.py: add ensure_tables() on startup so session_content table is created before first publish. - profiles: add kimi-k2.6:cloud to all model lists as fallback. Frontend: - ContentCard.vue: remove border-radius, add scrollbar styles, fix metadata fallback parsing so cards survive page reload. - content-viewers/*.html: add matching scrollbar styles. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-28	b88b7c0 Browse files » Add content hosting system with inline viewers ... Backend: - Add navi/content/ directory for published files - Add content_store.py with publish/list/delete/cleanup functions - Add content_publish tool for publishing files as viewable content - Add /content static file mount in main.py - Add /content-viewers mount for viewer pages - Extend ToolEvent with metadata field - Forward metadata through websocket tool_call events - Update Agent to include metadata in ToolEvent Frontend: - Add ContentCard.vue component for displaying published content - Add viewer pages: stl.html (Three.js), svg.html, html.html, pdf.html - Update AssistantMessage.vue to render ContentCard for content_publish - Update chat store to preserve metadata in tool cards - Update websocket protocol docs with metadata field Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-26	b5b11be Browse files » changed llm & new ollama param Eugene Sukhodolskiy committed on 26 Apr
2026-04-25	b016a54 Browse files » Strengthen todo progress discipline Eugene Sukhodolskiy committed on 25 Apr
	d878bb0 Browse files » Add structured planning review and adaptive depth Eugene Sukhodolskiy committed on 25 Apr
	e67e7a5 Browse files » Improve compression and memory prompts Eugene Sukhodolskiy committed on 25 Apr
	65ffb4d Browse files » Add context providers: dynamic system message injection per LLM call ... - navi/context_providers/ registry + built-in public_url provider (global, always injected) - context_providers/ user directory, hot-reloaded via reload_tools - AgentProfile.context_providers field for per-profile opt-in providers - Agent._collect_context_injections() called before every tool-calling loop - reload_tools now reloads both user tools and user context providers - manuals/write_context_provider.md for Navi, docs/context_providers.md reference Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 25 Apr
2026-04-24	b1a5f44 Browse files » Set temperature=1.0, top_k=64, top_p=0.95 for all profiles (Google recommended for gemma4) ... Also fixes discuss profile memory tools: use combined "memory" tool name, not nonexistent split variants. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 Apr
	66fb042 Browse files » Add per-phase planning flags and planning_mandatory ... - planning_mandatory: disables DIRECT shortcut, forces all phases to run - planning_phase1_enabled / phase2_enabled / phase3_enabled: per-phase toggles - planning_phase2_enabled replaces planning_reflect_enabled (migrated in loader with backward compat) - Migrate all profile configs; rewrite docs/profiles.md as full config reference Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 Apr
	511dc46 Browse files » Add Ollama multi-server fallback with in-memory blacklisting ... - New FallbackOllamaBackend (navi/llm/fallback.py): tries servers and models in priority order; on LLMConnectionError blacklists the server for the process lifetime, on LLMModelNotFoundError blacklists the (server, model) pair — eliminates latency from repeated failed probes - OllamaBackend now raises typed LLMConnectionError / LLMModelNotFoundError instead of bare LLMBackendError; accepts list[str] \| str \| None for model - AgentProfile.model changed from str to list[str] (str auto-normalised); all profiles updated to ["gemma4:31b-cloud", "gemma4:26b-a4b-it-q4_K_M"] - New config field OLLAMA_BACKENDS_FILE: path to [{host, api_key?}] JSON; when set, registry creates FallbackOllamaBackend instead of OllamaBackend - ollama_backends.json template added (gitignored — contains API key) - current_model ContextVar type widened to list[str] \| str \| None Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 24 Apr
2026-04-22	7eea278 Browse files » Support Ollama Cloud API key Eugene Sukhodolskiy committed on 22 Apr
2026-04-21	f7c7a17 Browse files » Agent improvements: mandatory planning, tool cleanup, smart_edit fixes ... - Planning now mandatory on first message of every session (force_plan) - RESOURCES, COMMITMENTS, ATOMICITY fields added to planning phase 1 - Todo auto-injected at iteration 0 so model tracks steps immediately - Execution trigger injected after plan to prevent model treating plan as response - Split developer profile: tool_developer (Navi tools) vs developer (general code) - Simplified persona.txt: trimmed redundant content now handled by mechanics - AIHelper.ask(): 120s timeout via asyncio.wait_for to prevent smart_edit hangs - filesystem._smart_edit(): atomic write via temp file + os.replace() - Removed 5 junk user tools (game project artifacts, trivial utilities) - Removed instagram tools (to be rewritten); cleaned enabled.json Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 Apr
	e9d1e77 Browse files » Remove code-specific scoping rules from planning prompt ... Keep only the universal comma test heuristic — code-specific rules were too narrow and cluttered the prompt. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 Apr
	0db1ea6 Browse files » Tighten AGENT step scoping in planning prompt ... Added comma test heuristic: if a step description lists things with 'and' or commas, each item is a separate step. Added code-specific guidance: one step = one file or one focused feature addition, never scaffold + logic + helpers combined. Replaced abstract good/bad examples with concrete code implementation examples. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 Apr
2026-04-20	98c0be9 Browse files » Adaptive re-plan on todo step failure ... When a todo step is newly marked failed, queue a targeted system message for the next iteration prompting the model to revise its remaining pending steps before continuing. Enabled by adaptive_replan_enabled flag (on by default in developer profile). Zero overhead when no failure occurs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 20 Apr
	9704a92 Browse files » Autonomous reasoning improvements: budget, anchoring, anti-stall, validation ... - AgentProfile: per-profile thinking mechanics flags (think_enabled, iteration_budget_enabled, goal_anchoring, anti_stall, step_validation, planning_reflect, adaptive_replan) — all profiles updated in config.json - Iteration budget: inject remaining iterations into context so model knows when to wrap up; urgency levels at ≤7 and ≤3 remaining - Goal anchoring: inject original goal + todo state every N iterations to prevent drift on long tasks - Anti-stall: two signals — no todo progress for N iterations, or identical tool calls repeated N times; warning injected into context - Todo step validation: marking done requires a validation field describing how result was verified; failed gets a soft nudge with tip for re-planning - stream_complete: add think param to base class, ollama and openai backends - Summarizer: raise max_tokens 1024→3000, expand system prompt with user-preferences section and verbatim-value instructions - Compression card: persist to session.messages (is_compression flag on Message), show expandable summary in webclient with markdown body - ToolResult.to_message_content: always include output on failure so tracebacks and error details reach the model (fixes silent Error: None) - Developer profile: fix subagent profile secretary→developer, add write_tool to subagent_tools, clarify write_tool vs filesystem in system prompt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 20 Apr
	94e32e9 Browse files » Planning debug panel, todo auto-populate, scratchpad/persona improvements ... - Planning debug panel: new Planning tab in debug/index.html shows raw phase 1/2 outputs and token counts per planning run, stored in session.planning_logs (new column in both SQLite and PostgreSQL) - New GET /sessions/{id}/planning API endpoint - PlanningDebugData internal event wires _run_planning() output into session storage; never forwarded to WebSocket clients - Phase 3 (plan critic) disabled — to be reworked with reflect integration - Todo tool: auto-populated from plan steps after phase 2; model only needs to call update/view, not set - Scratchpad: clarified description and persona instructions; removed context_transfer from user-facing docs (internal mechanism only) - web_search: switched to ddgs package, SearXNG as primary backend, DDG html-only fallback; added find_up action to filesystem tool - Persona: added SCRATCHPAD and TODO sections with clear usage rules; added NAVI.md project context instructions - chat.js: fixed subagent planning event fallthrough into parent UI; statusLabel cleared on first stream delta Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 20 Apr
2026-04-17	59f01b3 Browse files » Route subagent planning events into spawn_agent card in the UI ... Previously PlanningStatus/PlanReady had no is_subagent flag, so subagent planning spinners and plan cards rendered as top-level Navi planning UI. Backend: - Add is_subagent field to PlanningStatus and PlanReady events - _run_planning accepts is_subagent param, passes it through all yields - run_ephemeral calls _run_planning with is_subagent=True - websocket.py forwards is_subagent in planning_status and plan_ready messages Frontend (chat.js): - onPlanningStatus: if is_subagent, set planningLabel on the last spawn_agent card instead of msg.statusLabel - onPlanReady: if is_subagent, push plan into spawn card steps and clear planningLabel; otherwise behave as before Frontend (ToolCard.vue): - Render subagent-planning-indicator (spinner + label) when planningLabel set - Render plan cards inside subagent steps using the same plan-card pattern Also includes leftover session changes: spawn_agent default 40 in description and manual, updated manual content. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 17 Apr