All Voice AI Tools
8,165 tools ranked by quality score · Page 6 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 501 |
AlexandreSajus/JARVIS
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed... |
|
Established |
| 502 |
keshavbhatt/glate
Open Source Google Translator and TTS App for Linux Desktop |
|
Established |
| 503 |
sveinbjornt/hear
Command line interface for the built-in speech recognition and transcription... |
|
Established |
| 504 |
yl4579/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for... |
|
Established |
| 505 |
goodatlas/zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project |
|
Established |
| 506 |
amanvirparhar/chaplin
A real-time silent speech recognition tool. |
|
Established |
| 507 |
zzw922cn/Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow |
|
Established |
| 508 |
VRCWizard/TTS-Voice-Wizard
Speech to Text to Speech. Song now playing. Sends text as OSC messages to... |
|
Established |
| 509 |
Finrandojin/alexandria-audiobook
AI-powered multi-voice audiobook generator — LLM script annotation, voice... |
|
Established |
| 510 |
Azure-Samples/Cognitive-Services-Voice-Assistant
Welcome to the Microsoft Voice Assistant samples repository! Here you will... |
|
Established |
| 511 |
moeru-ai/unspeech
🗣️🔊 Your Text-to-Speech Services, All-in-One. |
|
Established |
| 512 |
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion |
|
Established |
| 513 |
gustavostz/whisper-clip
WhisperClip simplifies your life by automatically transcribing audio... |
|
Established |
| 514 |
deepgram-starters/flask-transcription
Get started using Deepgram's Pre-Recorded Transcription with this Flask demo app |
|
Established |
| 515 |
NaomiProject/Naomi
The Naomi Project is an open source, technology agnostic platform for... |
|
Established |
| 516 |
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in... |
|
Established |
| 517 |
travisvn/openai-edge-tts
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or... |
|
Established |
| 518 |
XnneHangLab/XnneHangLab
不会聊天的字幕提取器不是一个好 B 站下载器~ |
|
Established |
| 519 |
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support... |
|
Established |
| 520 |
ekwek1/soprano-factory
Soprano-Factory: Train your own 2000x realtime text-to-speech model |
|
Established |
| 521 |
FunAudioLLM/Fun-ASR
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab. |
|
Established |
| 522 |
sergenes/runandread-audiobook
🚀 Open-source project for creating high-quality AI TTS-narrated audiobooks... |
|
Emerging |
| 523 |
Lex-au/Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech... |
|
Emerging |
| 524 |
junzew/HanTTS
Chinese Text-to-Speech web service |
|
Emerging |
| 525 |
PriesiaMioShirakana/DragonianVoice
多个SVC/TTS的C++推理库 |
|
Emerging |
| 526 |
tugstugi/pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian) |
|
Emerging |
| 527 |
NevilPatel01/RVC-WebUI-MacOS
Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs... |
|
Emerging |
| 528 |
DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions |
|
Emerging |
| 529 |
dessa-oss/fake-voice-detection
Using temporal convolution to detect Audio Deepfakes |
|
Emerging |
| 530 |
dhruvapte26/B.E.N.J.I.
B.E.N.J.I.- The Impossible Missions Force's digital assistant |
|
Emerging |
| 531 |
techiaith/pyfestival
Amlapiwr Python C ar gyfer hwyluso rhaglennu gyda Festival | A Python C... |
|
Emerging |
| 532 |
p0p4k/vits2_pytorch
unofficial vits2-TTS implementation in pytorch |
|
Emerging |
| 533 |
OpenVoiceOS/ovos-buildroot
Open Voice Operating System - Buildroot edition is a minimalistic linux OS... |
|
Emerging |
| 534 |
gionanide/Speech_Signal_Processing_and_Classification
Front-end speech processing aims at extracting proper features from short-... |
|
Emerging |
| 535 |
botbahlul/PyAutoSRT
PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free... |
|
Emerging |
| 536 |
arghyasur1991/Spark-TTS-Unity
Unity package for using Spark-TTS on-device models. This is a C# port of... |
|
Emerging |
| 537 |
juntaosun/ComeCut
「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用,功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor.... |
|
Emerging |
| 538 |
createcandle/voco
Privacy friendly voice control for the Candle Controller / WebThings... |
|
Emerging |
| 539 |
nitaiaharoni1/whisper-speech-to-text
Whisper Speech-to-Text is a JavaScript library for recording and... |
|
Emerging |
| 540 |
Poeschl/Hassio-Addons
The repository for my Home Assistant Supervisor Add-ons. |
|
Emerging |
| 541 |
Artrajz/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features. |
|
Emerging |
| 542 |
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model. |
|
Emerging |
| 543 |
CodersCreative/natural-tts
A rust crate for easily implementing Text-To-Speech into your rust programs. |
|
Emerging |
| 544 |
vasistalodagala/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR)... |
|
Emerging |
| 545 |
speechmatics/speechmatics-python-sdk
Python SDKs for Speechmatics APIs |
|
Emerging |
| 546 |
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating... |
|
Emerging |
| 547 |
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS |
|
Emerging |
| 548 |
lixiangyu890601/EasyAICC-Easy-AI-Call-Center
外呼系统,智能外呼,自动外呼系统,人工外呼,呼叫中心 |
|
Emerging |
| 549 |
OpenBMB/UltraEval-Audio
Your faithful, impartial partner for audio evaluation — know yourself, know... |
|
Emerging |
| 550 |
C-Loftus/QuickPiperAudiobook
With one command, create a natural-sounding audiobook from a variety of... |
|
Emerging |
| 551 |
thuhcsi/Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS... |
|
Emerging |
| 552 |
snakers4/silero-stress
Silero Stress — pre-trained enterprise-grade automated stress and homograph... |
|
Emerging |
| 553 |
JJWRoeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned... |
|
Emerging |
| 554 |
ARBML/klaam
Arabic speech recognition, classification and text-to-speech. |
|
Emerging |
| 555 |
artibex/piper-http
Creates a docker image that runs the piper http service |
|
Emerging |
| 556 |
nullabork/talkbot
Text-to-speech and translation bot for Discord |
|
Emerging |
| 557 |
robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation |
|
Emerging |
| 558 |
drankush/VoxRad
VOXRAD is a voice transcription application for radiologists leveraging... |
|
Emerging |
| 559 |
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis,... |
|
Emerging |
| 560 |
Steve0929/tiktok-tts
Provides a simple way to generate text-to-speech audio files using TikTok's... |
|
Emerging |
| 561 |
Audio-WestlakeU/VINP
Official PyTorch implementation of 'VINP: Variational Bayesian Inference... |
|
Emerging |
| 562 |
deepgram/deepgram-go-sdk
Official Go SDK for Deepgram. |
|
Emerging |
| 563 |
rakeshvar/rnn_ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with... |
|
Emerging |
| 564 |
google/tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end... |
|
Emerging |
| 565 |
litagin02/rvc-tts-webui
Text-to-Speech Gradio webui using RVC and edge-tts |
|
Emerging |
| 566 |
SlapBot/stephanie-va
Stephanie is an open-source platform built specifically for voice-controlled... |
|
Emerging |
| 567 |
nvidia-riva/common
Protocol buffers and other common resources. |
|
Emerging |
| 568 |
ceuk/speech-recognition-aws-polyfill
Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback |
|
Emerging |
| 569 |
iMicknl/azure-podcast-generator
Generate an engaging podcast based on your document using Azure OpenAI and... |
|
Emerging |
| 570 |
santi-pdp/pase
Problem Agnostic Speech Encoder |
|
Emerging |
| 571 |
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin |
|
Emerging |
| 572 |
Picovoice/leopard
On-device speech-to-text engine powered by deep learning |
|
Emerging |
| 573 |
woheller69/whisperIME
Android Input Method Editor (IME) based on Whisper |
|
Emerging |
| 574 |
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2) |
|
Emerging |
| 575 |
stimm-ai/stimm
The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI... |
|
Emerging |
| 576 |
belambert/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word... |
|
Emerging |
| 577 |
modal-labs/quillman
A voice chat app |
|
Emerging |
| 578 |
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text... |
|
Emerging |
| 579 |
pedroetb/tts-api
Text to speech REST API for multiple TTS engines |
|
Emerging |
| 580 |
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos |
|
Emerging |
| 581 |
eheikes/tts
Tools to convert text to speech :books::speech_balloon: |
|
Emerging |
| 582 |
voice-cloning-app/Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices |
|
Emerging |
| 583 |
thevickypedia/py3-tts
Offline Text To Speech library for python |
|
Emerging |
| 584 |
davidamacey/OpenTranscribe
Self-hosted AI-powered transcription platform with speaker diarization,... |
|
Emerging |
| 585 |
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing... |
|
Emerging |
| 586 |
savbell/whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model. |
|
Emerging |
| 587 |
ddPn08/rvc-webui
liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project |
|
Emerging |
| 588 |
opendilab/CleanS2S
High-quality and streaming Speech-to-Speech interactive agent in a single... |
|
Emerging |
| 589 |
ActiveNick/HoloBot
HoloBot is a reusable 3D interface that allows HoloLens & VR users to... |
|
Emerging |
| 590 |
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and... |
|
Emerging |
| 591 |
lucoiso/UEAzSpeech
This plugin integrates Azure Speech Cognitive Services in Unreal Engine. |
|
Emerging |
| 592 |
liangstein/Chinese-speech-to-text
Chinese Speech To Text Using Wavenet |
|
Emerging |
| 593 |
avinashvarna/sanskrit_tts
Sanskrit text to speech |
|
Emerging |
| 594 |
advanced-media-inc/amivoice-api-client-library
AmiVoice API Client Library and the sample programs |
|
Emerging |
| 595 |
travisvn/edge-tts-client
Client-side (web browser) implementation of Edge TTS package — Microsoft... |
|
Emerging |
| 596 |
albirrkarim/react-speech-highlight-demo
React / Vanilla JS Text to Speech with highlighting the words and sentences... |
|
Emerging |
| 597 |
ModelTC/LightTTS
LightTTS is a lightweight TTS inference framework optimized for CosyVoice2... |
|
Emerging |
| 598 |
zlargon/google-tts
Google TTS (Text-To-Speech) for node.js |
|
Emerging |
| 599 |
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E |
|
Emerging |
| 600 |
Aivis-Project/AIVM-Generator
Aivis Voice Model File (.aivm/.aivmx) Generator / Editor |
|
Emerging |