All Voice AI Tools
8,165 tools ranked by quality score · Page 12 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1101 |
fedden/RenderMan
Command line C++ and Python VSTi Host library with MFCC, FFT, RMS and audio... |
|
Emerging |
| 1102 |
stefantaubert/en-tts
Command-line interface and Python library for synthesizing English texts into speech. |
|
Emerging |
| 1103 |
alexpinel/Dot
Text-To-Speech, RAG, and LLMs. All local! |
|
Emerging |
| 1104 |
tema6120/ForgetMeNot
A flashcard app for Android. |
|
Emerging |
| 1105 |
OpenCOVID19CoughCheck/CoughCheckApp
Development of AI audio app to compare the cough of a Coronavirus (COVID-19)... |
|
Emerging |
| 1106 |
bold-ronin/lira
A Voice-First AI Companion |
|
Emerging |
| 1107 |
superstarryeyes/lue
Terminal eBook Reader with Audiobook-Quality Text-to-Speech — Supports EPUB,... |
|
Emerging |
| 1108 |
stefantaubert/mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral... |
|
Emerging |
| 1109 |
pnlpal/pnl-reader
PNL Reader: read quietly or read aloud |
|
Emerging |
| 1110 |
nobody132/masr
中文语音识别; Mandarin Automatic Speech Recognition; |
|
Emerging |
| 1111 |
kurianbenoy/Indic-Subtitler
Open source subtitling platform 💻 for transcribing and translating... |
|
Emerging |
| 1112 |
keonlee9420/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative... |
|
Emerging |
| 1113 |
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model... |
|
Emerging |
| 1114 |
AASHISHAG/deepspeech-german
Automatic Speech Recognition (ASR) - German |
|
Emerging |
| 1115 |
benmaster82/writher
Voice-powered productivity for Windows |
|
Emerging |
| 1116 |
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character... |
|
Emerging |
| 1117 |
deepgram-starters/django-voice-agent
Get started using Deepgram's Voice Agent with this Django demo app |
|
Emerging |
| 1118 |
DmitryRyumin/OpenAV
An open-source library for recognition of speech commands in the user... |
|
Emerging |
| 1119 |
sai9640nayak/StreamingKokoroJS
Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%... |
|
Emerging |
| 1120 |
goodmike31/pl-asr-bigos-tools
Extendable toolkit for comprehensive evaluation of ASR systems. Currently... |
|
Emerging |
| 1121 |
mikopbx/ModuleSmartIVR
Модуль умной маршрутизации для 1C:Предприятия |
|
Emerging |
| 1122 |
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei... |
|
Emerging |
| 1123 |
t0mer/tts-stt
Small pyhon flask container allowing us to convert Text to Speech and Speech to Text |
|
Emerging |
| 1124 |
sp-nitech/DNN-HSMM
pytorch implementation of DNN-HSMM for TTS |
|
Emerging |
| 1125 |
sovaai/sova-asr
SOVA ASR (Automatic Speech Recognition) |
|
Emerging |
| 1126 |
rhulha/StreamingKokoroJS
Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%... |
|
Emerging |
| 1127 |
ponlponl123/-Prototype-AIVTuber
a open-source Artificial Intelligence Virtual Youtuber (AI VTuber), (this... |
|
Emerging |
| 1128 |
novoic/surfboard
Novoic's audio feature extraction library |
|
Emerging |
| 1129 |
EricBatlle/UnityAndroidSpeechRecognizer
🗣️ Speech recognition on Unity and Android without the annoying google popup! |
|
Emerging |
| 1130 |
timmo001/home-assistant-assist-desktop
Use Home Assistant Assist on the desktop. Compatible with Windows, MacOS, and Linux |
|
Emerging |
| 1131 |
AIFSH/ComfyUI-XTTS
a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages... |
|
Emerging |
| 1132 |
soundhound/hound-sdk-web-example
An example of how to work with text and voice requests using the Houndify... |
|
Emerging |
| 1133 |
hujingshuang/MTrans
Multi-source Translation |
|
Emerging |
| 1134 |
rishikksh20/melgan
MelGAN implementation with Multi-Band and Full Band supports... |
|
Emerging |
| 1135 |
JosefAlbers/WTM
Blazing fast whisper turbo for ASR (speech-to-text) tasks |
|
Emerging |
| 1136 |
wangkaisine/mrcp-plugin-with-freeswitch
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP... |
|
Emerging |
| 1137 |
FireRedTeam/FireRedASR2S
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc... |
|
Emerging |
| 1138 |
SamYuan1990/flet_sherpa_onnx
flet_sherpa_onnx an ASR/STT library for flet basing on sherpa-onnx |
|
Emerging |
| 1139 |
Picovoice/speech-to-intent-benchmark
benchmark for Speech-to-Intent engines |
|
Emerging |
| 1140 |
George0828Zhang/torch_cif
A fast parallel PyTorch implementation of the "CIF: Continuous... |
|
Emerging |
| 1141 |
qianchang/zici
字词:收集国学/汉语字词拼音相关资源 |
|
Emerging |
| 1142 |
Appen/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech... |
|
Emerging |
| 1143 |
chandran-jr/Noteify
🔎A Currency Detection app for the visually impaired which automatically... |
|
Emerging |
| 1144 |
tomasz-oponowicz/spoken_language_identification
Identify a spoken language using artificial intelligence (LID). |
|
Emerging |
| 1145 |
keonlee9420/WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement... |
|
Emerging |
| 1146 |
zceng/LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation |
|
Emerging |
| 1147 |
haguro/elevenlabs-go
A Go API client library for the ElevenLabs speech synthesis platform |
|
Emerging |
| 1148 |
Ezdokz1337/sunona-v0.001
🎤 Build and deploy intelligent voice AI agents in minutes with Sunona, your... |
|
Emerging |
| 1149 |
mitchib1440/SpeakThat
The world's most comprehensive notification reader for Android devices. |
|
Emerging |
| 1150 |
darkautism/sensevoice-rs
A Rust-based, SenseVoiceSmall |
|
Emerging |
| 1151 |
xyqfer/reader
毕业设计-基于智能手机的报纸阅读器 |
|
Emerging |
| 1152 |
GinoShun/Accent-Activation-Steering
Official code for "Activation Steering for Accent Adaptation in Speech... |
|
Emerging |
| 1153 |
HachiroSan/google-pronouncer
🔊 Download pronunciation audio files from Google's dictionary service.... |
|
Emerging |
| 1154 |
jonatasgrosman/asrecognition
ASRecognition: just an easy-to-use library for Automatic Speech Recognition. |
|
Emerging |
| 1155 |
ai-learning-tools/viva-translate
Real-time translation copilot for your browser |
|
Emerging |
| 1156 |
karim23657/Persian-tts-coqui
Persian/Farsi text to speech(TTS) training using coqui tts |
|
Emerging |
| 1157 |
felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch
Classification of 11 types of audio clips using MFCCs features and LSTM.... |
|
Emerging |
| 1158 |
sevangelatos/py-ttspico
Python svox picotts wrapper |
|
Emerging |
| 1159 |
thetobysiu/Deepstory
Deepstory turns a text/generated text into a video where the character is... |
|
Emerging |
| 1160 |
thewh1teagle/piper-onnx
Use piper TTS with onnxruntime |
|
Emerging |
| 1161 |
aws-solutions/content-localization-on-aws
Automatically generate multi-language subtitles using AWS AI/ML services.... |
|
Emerging |
| 1162 |
MohammedRashad/FPGA-Speech-Recognition
Expiremental Speech Recognition System using VHDL & MATLAB. |
|
Emerging |
| 1163 |
R1ckShi/AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline... |
|
Emerging |
| 1164 |
rorpage/openfaas-text-to-speech
Generate an MP3 of text using Google's Text-to-Speech |
|
Emerging |
| 1165 |
dbklim/Voice_ChatBot
Chatbot in russian with speech recognition using PocketSphinx and speech... |
|
Emerging |
| 1166 |
wit-ai/android-voice-demo
Example on how to build a voice-enabled Android app with Wit.ai |
|
Emerging |
| 1167 |
lablab-ai/OpenAI_Whisper_Streamlit
A minimalistic automatic speech recognition streamlit based webapp powered... |
|
Emerging |
| 1168 |
gooofy/py-marytts
Python MaryTTS HTTP client library |
|
Emerging |
| 1169 |
rainygirl/rspeaker
말귀를 알아듣고 뉴스도 요약해 읽어줍니다 |
|
Emerging |
| 1170 |
yl4579/StyleTTS-VC
Official Implementation of StyleTTS-VC |
|
Emerging |
| 1171 |
upskyy/Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech... |
|
Emerging |
| 1172 |
LiberSonora/LiberSonora
LiberSonora,寓意“自由的声音”,是一个 AI 赋能的、强大的、开源有声书工具集,包含智能字幕提取、AI标题生成、多语言翻译等功能,支持... |
|
Emerging |
| 1173 |
developers-cosmos/Mimasa
Real time multilingual face translator |
|
Emerging |
| 1174 |
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based... |
|
Emerging |
| 1175 |
opensource-spraakherkenning-nl/Kaldi_NL
Code related to the Dutch instance and user groups of the KALDI speech... |
|
Emerging |
| 1176 |
hopkira/k9
Latest main K9 robot repository with 3D vision, local STT/TTS with GPT-3 and... |
|
Emerging |
| 1177 |
Gmzxdotzz/Dia-TTS-Server
Self-host the powerful Dia TTS model. This server offers a user-friendly Web... |
|
Emerging |
| 1178 |
taresh18/TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific... |
|
Emerging |
| 1179 |
pandeydivesh15/AVSR-Deep-Speech
Google Summer of Code 2017 Project: Development of Speech Recognition Module... |
|
Emerging |
| 1180 |
yuhr/langue
A modern platform for conlanging. Currently in the planning stage. |
|
Emerging |
| 1181 |
mozilla/DeepSpeech-examples
Examples of how to use or integrate DeepSpeech |
|
Emerging |
| 1182 |
niker/EdgeTtsSharp
EdgeTTS Sharp is a library that provides an easy-to-use, realtime-streaming,... |
|
Emerging |
| 1183 |
alex-vt/WhisperInput
Offline voice input panel & keyboard with punctuation for Android. |
|
Emerging |
| 1184 |
candlewill/Speech-Corpus-Collection
A Collection of Speech Corpus for ASR and TTS |
|
Emerging |
| 1185 |
Hecate2/sukasuka-vocal-dataset-builder
すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from... |
|
Emerging |
| 1186 |
AmphionTeam/FlexiCodec
[ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates |
|
Emerging |
| 1187 |
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM... |
|
Emerging |
| 1188 |
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with... |
|
Emerging |
| 1189 |
Pankaj-Baranwal/pocketsphinx
Updated ROS bindings to pocketsphinx |
|
Emerging |
| 1190 |
ttop32/coqui_tts_korea
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS |
|
Emerging |
| 1191 |
bawangxx/XZVoice
Free and open source text-to-speech software |
|
Emerging |
| 1192 |
journey-ad/CosyVoice2-Ex
CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API) |
|
Emerging |
| 1193 |
tover0314-w/opentypeless
Talkmore with Opentypeless. Type with your voice. Anywhere. Talk -... |
|
Emerging |
| 1194 |
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps... |
|
Emerging |
| 1195 |
chenmingxiang110/Chinese-automatic-speech-recognition
Chinese speech recognition |
|
Emerging |
| 1196 |
jojojaeger/whisper-streamlit
this master thesis project is based on OpenAI Whisper with the goal to... |
|
Emerging |
| 1197 |
flogy/gatsby-mdx-tts
🗣 Adds speech output to your Gatsby site using Amazon Polly. |
|
Emerging |
| 1198 |
jsugg/ser
The AI-powered ser Python package is a tool for recognizing and analyzing... |
|
Emerging |
| 1199 |
linux-speakup/espeakup
a light weight connector for espeak-ng and speakup |
|
Emerging |
| 1200 |
seanghay/KLEA
An open-source Khmer Word to Speech Model. Just single word not sentence! |
|
Emerging |