All Voice AI Tools
8,165 tools ranked by quality score · Page 23 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2201 |
deepgram-devs/flask-live-chatgpt-text-to-speech
Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app |
|
Emerging |
| 2202 |
silenterus/deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework |
|
Emerging |
| 2203 |
parzibyte/tts-js
Demostración de speechSynthesis con JavaScript: TTS o Síntesis de habla |
|
Emerging |
| 2204 |
Hamahmi/kaldi-tut
This is a Kaldi tutorial for beginners |
|
Emerging |
| 2205 |
OssiaAI/OssiaVoice
Ossia is an accessibility tool for those unable to speak or type; Ossia... |
|
Emerging |
| 2206 |
nico-byte/whisper-web
The Whisper Web Transcription Server is a Python-based real-time... |
|
Emerging |
| 2207 |
BayramAnnakov/gmail-to-podcast
Transform Gmail newsletters into AI-generated podcast conversations using... |
|
Emerging |
| 2208 |
LonePheasantWarrior/TalkifyTTS
云端大模型驱动的 Android 语音合成应用(TTS引擎)。支持豆包、腾讯、微软、千问等模型。An Android text-to-speech... |
|
Emerging |
| 2209 |
LonePheasantWarrior/VolcengineTTS
基于火山引擎豆包语音服务的在线TTS安卓应用 (An online TTS Android application based on the... |
|
Emerging |
| 2210 |
MiguelsPizza/local-transcription-mcp--parakeet-tdt-0.6b-v2--
Local MCP server that converts and transcribes video and audio files 100% on device |
|
Emerging |
| 2211 |
rishikksh20/LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search |
|
Emerging |
| 2212 |
prohetamine/tor-speech
🔉 Yandex & Google + Tor |
|
Emerging |
| 2213 |
ankushbhatia2/django-speech-to-text
A small API for speech to text made in Django. |
|
Emerging |
| 2214 |
6Morpheus6/Chattered
All in one Gradio interface for chatterbox. Voice cloning from uploaded... |
|
Emerging |
| 2215 |
ikfly/java-tts
java-tts 文本转语音 |
|
Emerging |
| 2216 |
golemfactory/g-flite
g-flite: flite app distributed over Golem Network |
|
Emerging |
| 2217 |
purvanshjoshi/IndiVoice-DeepASR
Deep Learning framework for Indian-accented Speech-to-Text using Whisper and... |
|
Emerging |
| 2218 |
Lightning-Universe/Echo
Production-ready audio and video transcription app that can run on your... |
|
Emerging |
| 2219 |
adhadse/Deepdubpy
A complete end-to-end Deep Learning system to generate high quality human... |
|
Emerging |
| 2220 |
innovatorved/whisper-openai-gradio-implementation
Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation |
|
Emerging |
| 2221 |
jaoafa/ChatWatcher
🗣 Discord voice-chat speech recognition |
|
Emerging |
| 2222 |
timoil/whisper-subtitles
🎬 AI-powered localhost subtitle generator for hearing-impaired users.... |
|
Emerging |
| 2223 |
M86xKC/edge-tts
Simple TTS using MS Edge built-in voices |
|
Emerging |
| 2224 |
PareekshithPalat/Transcriptor
The Transcriptor is a subtitle extractor, lightweight web application built... |
|
Emerging |
| 2225 |
jim11662418/General_Instrument_CTS256_SP0256_Speech_Synthesizer
Vintage General Instrument Speech Synthesizer CTS256 with SP0256 |
|
Emerging |
| 2226 |
samsad35/source-filter-vae
[SpeechCom Journal] Learning and controlling the source-filter... |
|
Emerging |
| 2227 |
BenLubar/espeak
Package espeak is a wrapper around espeak-ng that works both natively and in... |
|
Emerging |
| 2228 |
Kaljurand/Diktofon
An Android app, a dictaphone with Estonian speech-to-text |
|
Emerging |
| 2229 |
nexxeln/spotify-voice-control
Voice control for Spotify through the terminal |
|
Emerging |
| 2230 |
junjie-xyz/whisper-video
Generate subtitles for all the videos in a folder with OpenAI's Whisper... |
|
Emerging |
| 2231 |
heartsuit/BaiduASRAndTTS
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech;... |
|
Emerging |
| 2232 |
jx1100370217/DFCNN-master
这是一个基于全卷积神经网络的语音识别系统 |
|
Emerging |
| 2233 |
Yukaii/gakuon
Review Anki cards using Generative AI voice |
|
Emerging |
| 2234 |
JustinGOSSES/spoken-floodplain
Website that verbally tells users when they enter or leave a floodplain in... |
|
Emerging |
| 2235 |
Babakinha/Dectalk
A Simple package for using Dectalk |
|
Emerging |
| 2236 |
zerospeech/benchmarks
A command line tool that helps use the "Zero Ressource Challenge" benchmarks |
|
Emerging |
| 2237 |
MelvilQ/stacksrs
A simple Spaced Repetition app for Android. |
|
Emerging |
| 2238 |
vectominist/spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for... |
|
Emerging |
| 2239 |
Leonard2310/LibrAI
iOS app with AI for an immersive audiobook experience, text-to-speech and... |
|
Emerging |
| 2240 |
ikarago/Talkinator
Talkinator is an easy to use text-to-speech-app for Windows 10-devices |
|
Emerging |
| 2241 |
lelosaiyan/J.A.R.V.I.S.
A voice virtual desktop assistant for Windows 7/10 |
|
Emerging |
| 2242 |
matusstas/openai-whisper-microservice
This is an OpenAI Whisper automatic speech recognition microservice |
|
Emerging |
| 2243 |
noir-neo/UniSpeech
iOS speech framework native plugin for Unity |
|
Emerging |
| 2244 |
qkl9527/voice-assistant
基于Funasr的[实时]AI语音助手 |
|
Emerging |
| 2245 |
orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on... |
|
Emerging |
| 2246 |
charstorm/vilberta
Voice chatbot with voice+screen output to show that "not everything needs to... |
|
Emerging |
| 2247 |
dcavar/ELAN2split
Split ELAN Annotation Files and corresponding speech files into a corpus... |
|
Emerging |
| 2248 |
systoolz/dosbtalk
unofficial API implementation for Text-to-Speech Engine by First Byte |
|
Emerging |
| 2249 |
alisolphp/EchoTalk
A browser-based language training app using Shadowing technique with... |
|
Emerging |
| 2250 |
tuhinpal/text-to-speech
Text to Speech using Google's Library (Made for Fun) |
|
Emerging |
| 2251 |
SupernovifieD/FreeSpeechToText
A python program that extracts text from audio files - .mp3 or .wav - for free! |
|
Emerging |
| 2252 |
MazueraAlvaro/speech-recognition-asterisk
A script for speech recognition in asterisk |
|
Emerging |
| 2253 |
ORI-Muchim/One-Click-VITS-Training
VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification... |
|
Emerging |
| 2254 |
chienhsiang-hung/voice-and-wav-cloning
通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 ),並提供多種音頻處理技術來提升音質和真實感。 |
|
Emerging |
| 2255 |
codekraft-studio/vue-speech
Vue integration and components for the Web Speech API |
|
Emerging |
| 2256 |
yc9701/pansori-tedxkr-corpus
Korean ASR Corpus generated from TEDx talks |
|
Emerging |
| 2257 |
dialpad/mucs_2021_dialpad
Dialpad team's submission to the MUCS 2021 workshop |
|
Emerging |
| 2258 |
huckiyang/QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing... |
|
Emerging |
| 2259 |
hebbihebb/MBook
EPUB to M4B using Maya1 |
|
Emerging |
| 2260 |
nhut-ngnn/Voice-Based-Age-and-Gender-Recogniton
[ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of... |
|
Emerging |
| 2261 |
HarunoriKawano/BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection... |
|
Emerging |
| 2262 |
placebokkk/e6870
assignments for e6870 ASR class |
|
Emerging |
| 2263 |
maetshju/flux-blstm-implementation
An implementation of the Graves & Schmidhuber (2005) bidirectional LSTM in Flux. |
|
Emerging |
| 2264 |
mattzzz/rick-voice
Give any bot the voice of Rick Sanchez |
|
Emerging |
| 2265 |
indonesian-nlp/multilingual-asr
Multilingual Speech Recognition for Indonesian Languages |
|
Emerging |
| 2266 |
HuuHuy227/XphoneBert_Vits2
VITS2 extended with XPhoneBERT encoder |
|
Emerging |
| 2267 |
markhliu/mpt
Code repository for the book Make Python Talk |
|
Emerging |
| 2268 |
darsh-1010/Jarvis-A-Voice-Based-Assistant-Powered-by-LLaMA
Jarvis is a voice-based assistant built in Python that simplifies daily... |
|
Emerging |
| 2269 |
kostas2370/Video-Creator
This project is to automate the video creation. |
|
Emerging |
| 2270 |
thevickypedia/Jarvis_UI
Light weight UI to interact with Jarvis via API calls |
|
Emerging |
| 2271 |
yanorei32/winrt-tts-server
A simple Web Based Windows Runtime (WinRT) Speech Synthesis API |
|
Emerging |
| 2272 |
mo7amedaliEbaid/run-tracker
A flutter run tracker app - clean architecture |
|
Emerging |
| 2273 |
go-restream/supertts
🎧 Supertonic TTS ONNX Inference Openai Speech REST API |
|
Emerging |
| 2274 |
opensource-spraakherkenning-nl/asr_nl
Dutch Speech Recognition webservice |
|
Emerging |
| 2275 |
Vaibhavs10/ml-with-audio
HF's ML for Audio study group |
|
Emerging |
| 2276 |
botbahlul/Live-Subtitle
ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free... |
|
Emerging |
| 2277 |
void-xtreme/audible-text-editor
An automated Sinhala audio Text Editor for visually impaired and blind students |
|
Emerging |
| 2278 |
drivendataorg/childrens-speech-recognition-benchmark-pub
Tutorial code for the On Top of Pasketti: Children’s Speech Recognition Challenge |
|
Emerging |
| 2279 |
shreyasnisal/SpeechProgrammer
The Speech Programmer writes code based on voice commands. Right now it only... |
|
Emerging |
| 2280 |
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for... |
|
Emerging |
| 2281 |
Tristan296/Universal-MacAssistant
Advanced Personal Assistant created for macOS that utilises AppleScripts,... |
|
Emerging |
| 2282 |
saurabhchalke/whisper-meta-quest
Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model |
|
Emerging |
| 2283 |
Hamtech-ai/wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook |
|
Emerging |
| 2284 |
HaoQChen/iflytek_awaken_asr
use iflytek's technology to realize awaken and order recognition |
|
Emerging |
| 2285 |
pncnmnp/phoenix10.1
Creates personalized radio stations with your own radio jockey! |
|
Emerging |
| 2286 |
heyfoz/python-youtube-transcription
This repository contains Python scripts and a local Flask web application... |
|
Emerging |
| 2287 |
Ralireza/spoken-digit-recognition
Classifying English spoken digit by Hidden Markov Model |
|
Emerging |
| 2288 |
syntithenai/opensnips
Open source projects related to Snips https://snips.ai/. |
|
Emerging |
| 2289 |
yokawasa/vscode-translator-voice
VS Code extension for multi-language text translation and TTS... |
|
Emerging |
| 2290 |
AceCentre/pasco
Phrase Auditory Scanning COmmunicator - AAC App for iOS and the Web |
|
Emerging |
| 2291 |
theamazing0/global-subtitles-main
Closed Captioning Everywhere, With Assembly AI |
|
Emerging |
| 2292 |
candlewill/Ossian
Ossian: A simple language-independent Text-to-speech frontend |
|
Emerging |
| 2293 |
atomicoo/Tacotron2-PyTorch
PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。 |
|
Emerging |
| 2294 |
dokuniev/claude-voice
Hear which Claude Code session needs you — speaks the repo and branch name out loud |
|
Emerging |
| 2295 |
Helther/voice-pick-tbot
Text To Speech Synthesis Telegram Bot with voice customization |
|
Emerging |
| 2296 |
18F/tts-buy-challengegov-ideation
Market research documents related to the Challenge.gov Ideation Platform. |
|
Emerging |
| 2297 |
BullShark/JSpeak
A Text to Speech Reader Front-end that Reads from the Clipboard and with... |
|
Emerging |
| 2298 |
GetProjectsIdea/Convert-Text-to-Speech-in-Python
Text to speech is a process to convert any text into voice. Text to speech... |
|
Emerging |
| 2299 |
HasnainDarkNet/DarKVoice
DarKVoice is an open-source voice assistant and audio processing tool built... |
|
Emerging |
| 2300 |
AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming
speech-enhacement |
|
Emerging |