|
Raise first-chunk timeout to 90s and retry same server+model before fallback
- config.py: llm_stream_first_chunk_timeout 180s → 90s - fallback.py stream_complete: wrap gen.__anext__() in asyncio.wait_for() with llm_stream_first_chunk_timeout; on TimeoutError or LLMConnectionError sleep 2s and retry once on the same server+model before blacklisting/fallback Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> |
|---|
|
|
| navi/config.py |
|---|
| navi/llm/fallback.py |
|---|