root/navi-1

Fork: 0

root / navi-1

History for navi-1 / navi / memory

2026-04-29	02f2d8e Browse files » Split memory/store.py into focused mixins ... - _ddl.py: table creation (conditional on pgvector/pg_trgm) - _embeddings.py: EmbeddingMixin — pgvector checks, embed generation, backfill - _facts.py: FactMixin — upsert, search (vector + ILIKE), delete, list, count - _summary.py: SummaryMixin — get/set conversation summary - _session_state.py: SessionStateMixin — extraction tracking per session - store.py: reduced to ~60-line facade composing all mixins No external API changes — MemoryStore remains the single interface. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	8f68841 Browse files » Architecture extensibility — event bus, middleware, auto-discovery, Pydantic profiles ... - EventBus: async pub/sub for AgentEvents, WebSocket subscribes instead of direct yield - Declarative serialization: AgentEvent.to_wire() on all event types - Auto-discovery for LLM backends (_discover_backends) and workers (scan navi/workers/*.py) - AgentProfile: Pydantic BaseModel with extra='allow', @field_validator for model coercion - Tool middleware chain: pre/post execute hooks via ToolRegistry.add_middleware() - LoggingMiddleware: built-in, logs every tool call - Fix pg_trgm DDL: conditional GIN indexes via DO $$ block, no CREATE EXTENSION - New files: event_bus.py, middleware.py, logging_middleware.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	5cebfcd Browse files » Architecture fixes batch — NaN validation, ILIKE indexes, prompt cache, N+1 batching ... - _vector_to_str: reject NaN/Inf via math.isfinite() to avoid invalid pgvector syntax - memory DDL: add pg_trgm + GIN trigram indexes on category/key/value for fast ILIKE fallback - _build_system_prompt: cache per-profile to avoid rebuilding every iteration - backfill_embeddings: batch UPDATEs via executemany instead of N+1 loop No new Python deps; pg_trgm is a PostgreSQL extension auto-created on startup. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	7b672c3 Browse files » Remove SQLite legacy support ... SQLite is no longer supported; PostgreSQL is now required. - Delete navi/core/sqlite_session_store.py - Delete navi/memory/sqlite_store.py - Remove SqliteSessionStore from navi/core/__init__.py exports - deps.py: drop SQLite fallback, raise RuntimeError if DATABASE_URL missing - config.py: remove db_path setting - pyproject.toml & requirements.txt: drop aiosqlite dependency - .gitignore: remove navi.db entry - tech_debt_review_2026-04-29.md: mark #8 as REMOVED Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
	098401a Browse files » Stability fixes batch — tech debt review 2026-04-29 ... Critical: - Concurrent WS run race guard (#1) - Tool task cancellation on generator teardown (#2) - StopAsyncIteration kills fallback chain (#3) - Session loading race with _lastLoadId guard (#4) - ContentCard .match() crash on non-string result (#5) - Image data type guard in buildMessageList (#6) High: - Cap WS replay buffer at 500 events (#7) - Deduplicate memory extraction task with asyncio.Lock (#9) - TTL-based fallback blacklisting (5 min) (#10) - Subagent tool exception isolation (#11) - Inline image size/count validation on WS (#12) - Clean up orphaned file on DB insert failure (#13) - Deep watch streamingMsg for auto-scroll (#14) - WS_SCHEME wss:// support for HTTPS (#15) - Sending guard against duplicate message sends (#16) - Global unhandledrejection listener in API layer (#17) Medium: - Cap planning_logs at 20 entries (#22) - Store cleanup_loop task reference (#23) - BaseException → Exception in _run_with_sentinel (#24) - Propagate SystemExit in agent loop (#25) - Configurable output_reserve_tokens (#26) - Always reloadSession on session_sync (#30) - FIFO queue for confirm dialogs (#31) - Reset body.overflow on ImageLightbox unmount (#32) - try/finally in fallback copy (#33) - _isConnecting guard in WS send() (#34) Low: - Lazy-init deps.py singletons (#36) - Replace __import__ with direct imports (#38) - Preserve token count 0 in ollama.py (#39) - Clear orphaned streamingMsg on reconnect reload (#43) - Escape single quote in UserMessage (#44) - Polyfill-free findLast replacement (#48) - Match <table> tags with attributes in markdown (#49) - Attach copy buttons only when msg.done (#50) - Fix hasMeta falsy-0 bug (#53) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 29 Apr
2026-04-28	cbb1e5d Browse files » Add dedicated CPU embedding server for memory backfill ... - Install Ollama CPU-only on 192.168.1.168 server - Pull nomic-embed-text:latest on server - Create systemd service ollama-embed.service (0.0.0.0:11434) - Add embedding_ollama_host / embedding_ollama_api_key to config.py - Update deps.py to build separate embedding backend when host configured - Update backfill_embeddings.py to use dedicated embedding backend - Add _generate_embeddings batch helper and backfill_embeddings to store.py - Backfilled 119 existing facts with embeddings Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
	b5f2793 Browse files » Enrich memory extractor with tool calls/results in transcript ... - _EXTRACT_SYSTEM now explains 4 transcript entry types and instructs LLM to trust tool results over chat, return source/source_context - _extract_facts builds tool_call_map, appends [Tool call] and [Tool result] lines with truncation (500/200 chars) - Transcript capped at 12k chars (head+tail, drop middle) - Parse source/source_context from LLM response; map confidence: tool_call/auto_discovery=95, user_explicit=90, default=70 - Add TODO comment about deferred semantic deduplication Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
	aeac902 Browse files » Add pgvector migration script for memory_facts ... - ALTER TABLE memory_facts: embedding, source, confidence, expires_at, source_context - CREATE INDEX: hnsw(embedding), expires, source+category - Safe to run multiple times (IF NOT EXISTS) - Reads DATABASE_URL from settings Eugene Sukhodolskiy committed on 28 Apr
	c874cbe Browse files » Wire pgvector semantic search into memory system ... - Add vector(768) column + HNSW index to memory_facts - Add LLMBackend.embed() with Ollama + fallback implementation - MemoryStore: cosine-distance search with ILIKE fallback - New memory tool params: source, confidence, expires_days, source_context - Update extractor, sqlite_store, deps wiring - Add pgvector to requirements Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 28 Apr
2026-04-25	e67e7a5 Browse files » Improve compression and memory prompts Eugene Sukhodolskiy committed on 25 Apr
2026-04-21	4050c24 Browse files » Improve memory search: normalize query, AND-first, relevance scoring ... - _normalize_query(): hyphens/underscores/slashes/dots → word boundaries, strip all other punctuation, lowercase — fixes comma-separated keyword bug - Auto-dump: if ≤ 60 facts in DB, skip search and return all (no false negatives in a small personal memory store) - AND-first: try matching all terms; fall back to OR only when AND returns nothing - OR-fallback with scoring: facts matching more terms rank higher (score DESC), ties broken by recency Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 21 Apr
2026-04-15	2d2bf84 Browse files » Migrate storage to PostgreSQL with SQLite fallback; misc fixes ... - Add PgSessionStore (asyncpg pool) and PgMemoryStore replacing aiosqlite - Keep SqliteSessionStore + SqliteMemoryStore for zero-dependency quick start - Selection logic in deps.py: DATABASE_URL set → PG, else → SQLite - Add asyncpg>=0.29 to dependencies; add DATABASE_URL / DB_PATH to config - Add RESPONSE HYGIENE rule to persona: never echo tool output or plan state - Add developer profile user tools: weather, internal_monitor - Update README: developer profile, DB section, current tool/profile state Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 15 Apr
2026-04-09	56611d7 Browse files » Add long-term user memory system ... Architecture: - navi/memory/store.py: MemoryStore backed by SQLite (memory_facts, memory_summary, session_memory_state tables in navi.db) - navi/memory/extractor.py: LLM-based fact extraction from sessions + summary regeneration (triggered after session goes idle >30 min) - Fact upsert uses UNIQUE(category, key) — same key always overwrites, no duplicates or stale contradictions - Keyword search across category + key + value (LIKE-based, no extra deps) Context injection: - Memory summary injected as an ephemeral system message on every LLM call via Agent._with_memory() — never persisted to session.context Tools (all profiles): - memory_search(query): keyword search against fact DB; persona instructs model to call it at session start and before personal-context questions - memory_forget(key, category?): delete a specific fact on user request Extraction trigger: - On new session creation, fire-and-forget background task checks all sessions idle >30 min with unprocessed messages → runs extraction Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Eugene Sukhodolskiy committed on 9 Apr