Skip to content

Text-to-speech bot: Intelligent multi-cloud orchestration platform for text-to-speech services with automatic routing .

Notifications You must be signed in to change notification settings

Kuxry/Text-to-speech-bot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎤 TTS Gateway - Multi-Platform Text-to-Speech

FastAPI React Azure-TTS

Intelligent multi-cloud orchestration platform for text-to-speech services with automatic routing and enterprise-grade management.

✨ Core Features

Multi-Cloud Support

Platform Status Features
Azure Cognitive ✅ Live Neural Voices, SSML, Real-time
Edge-TTS ✅ Live Free Tier, 20+ Voices
AWS Polly 🚧 Beta Neural & Standard Voices
Google Cloud 🚧 Planned WaveNet Voices

Technical Highlights

  • Smart Routing: Auto-select optimal TTS engine based on cost/language
  • Real-time Streaming: Low-latency audio delivery (MP3/WAV)
  • Enterprise Ready:
    • Full request tracing (X-Request-ID)
    • Structured logging with request context
    • Automated temp file cleanup

🛠️ Architecture

graph TD
    WebApp -->|API Call| TTS-Gateway
    TTS-Gateway -->|Load Balance| Azure-TTS
    TTS-Gateway -->|Fallback| Edge-TTS
    TTS-Gateway -->|Premium| AWS-Polly
    TTS-Gateway --> Cache[(Redis)]
Loading

🚀 Quick Start

Backend Setup

# Install dependencies
pip install -r requirements.txt

# Configure environment
cp .env.example .env

# Start service (dev)
uvicorn backend.main:app --reload

Frontend Setup

cd frontend
npm install
npm start

🌐 API Documentation

Generate Speech

startLine: 9
endLine: 18

Request Example:

curl -X POST "https://YOUR_DOMAIN/api/generate" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Welcome to TTS Gateway",
    "voice": "en-US-JennyNeural",
    "azure_key": "YOUR_KEY",
    "azure_region": "eastus"
  }'

🔮 Roadmap

Q3 2024

  • AWS Polly integration
  • Audio post-processing pipeline
  • Rate limiting

Q4 2024

  • Google Cloud TTS support
  • Multi-tenant architecture
  • Voice cloning API

🌍 Deployment

PythonAnywhere Config

startLine: 38
endLine: 45

Vercel Frontend

Set environment variables:

startLine: 1
endLine: 7

🤝 Contribution

See CONTRIBUTING.md for development setup and guidelines.


🐛 Report Issue

About

Text-to-speech bot: Intelligent multi-cloud orchestration platform for text-to-speech services with automatic routing .

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published