Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.
Enterprise-grade STT made refreshingly simple (seriously, see benchmarks). We provide quality comparable to Google's STT (and sometimes even better) and we are not Google.
As a bonus:
• No Kaldi;
• No compilation;
• No 20-step instructions;
Also we have published TTS models that satisfy the following criteria:
• One-line usage;
• A large library of voices;
• A fully end-to-end pipeline;
• Naturally sounding speech;
• No GPU or training required;
• Minimalism and lack of dependencies;
• Faster than real-time on one CPU thread (!!!);
• Support for 16kHz and 8kHz out of the box;
Speech-To-Text
All of the provided models are listed in the models.yml file. Any meta-data and newer versions will be added there.
You can look it on the link:
https://github.com/snakers4/silero-models
Pages 1