Initial voice TTS server with F5-TTS backend, WebSocket streaming and warm-up
- FastAPI WebSocket API for real-time chunked TTS

- F5-TTS GPU backend with reference caching and auto-transcription

- Dummy backend for offline tests

- Streaming text segmenter, session state, stop/resume

- Startup warm-up, PCM16/base64 audio, basic tests

- Documentation: architecture, protocol, usage, roadmap, technical notes
1 parent 6308e7c commit 8be0fd641d43f476a80d6f1cf72df143159d5568
@Eugene Sukhodolskiy Eugene Sukhodolskiy authored 12 days ago
Showing 25 changed files
View
.env.example 0 → 100644
View
.gitignore 0 → 100644
View
README.md
View
docs/01_overview.md 0 → 100644
View
docs/02_architecture.md 0 → 100644
View
docs/03_websocket_protocol.md 0 → 100644
View
docs/04_roadmap.md 0 → 100644
View
docs/05_usage.md 0 → 100644
View
docs/06_technical_notes.md 0 → 100644
View
pyproject.toml 0 → 100644
View
requirements.txt 0 → 100644
View
scripts/download_f5_tts.py 0 → 100644
View
src/voice_tts/api/protocol.py 0 → 100644
View
src/voice_tts/api/server.py 0 → 100644
View
src/voice_tts/audio/formats.py 0 → 100644
View
src/voice_tts/config.py 0 → 100644
View
src/voice_tts/main.py 0 → 100644
View
src/voice_tts/session/state.py 0 → 100644
View
src/voice_tts/tts/engine.py 0 → 100644
View
src/voice_tts/tts/f5_backend.py 0 → 100644
View
src/voice_tts/tts/segmenter.py 0 → 100644
View
src/voice_tts/tts/utils.py 0 → 100644
View
tests/test_segmenter.py 0 → 100644
View
tests/test_server.py 0 → 100644
View
voices/.gitkeep 0 → 100644