Initial voice TTS server with F5-TTS backend, WebSocket streaming and warm-up
...
- FastAPI WebSocket API for real-time chunked TTS
- F5-TTS GPU backend with reference caching and auto-transcription
- Dummy backend for offline tests
- Streaming text segmenter, session state, stop/resume
- Startup warm-up, PCM16/base64 audio, basic tests
- Documentation: architecture, protocol, usage, roadmap, technical notes
Eugene Sukhodolskiy
committed
12 days ago