All Voice AI Tools
8,165 tools ranked by quality score · Page 35 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 3401 |
shockless/asr-transformer
Transformer for Automatic Speech Recognition |
|
Emerging |
| 3402 |
aishoot/DTWSpeech
A simple application of DTW Algorithm in isolate word speech recognition. |
|
Emerging |
| 3403 |
hackzilla/SpeechRecognition
A simple yet powerful SwiftUI app for iOS that demonstrates speech... |
|
Emerging |
| 3404 |
rainu/wow-quest-reader
A World of Warcraft Addon which can read the quest text with meant of AI... |
|
Emerging |
| 3405 |
avarayr/yap-for-cursor
Yap for Cursor - Voice To Text integration for Cursor IDE |
|
Emerging |
| 3406 |
France-Travail/TradEmploi-FrontEnd
Frontend of TradEmploi |
|
Emerging |
| 3407 |
DanielLin94144/Test-time-adaptation-ASR-SUTA
Test-time adaptation for speech recognition model by single utterance. The... |
|
Emerging |
| 3408 |
Audio-WestlakeU/UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA)... |
|
Emerging |
| 3409 |
AndreCoutinhom/voice_translator
With the help from a Youtube channel tutorial video, Chat GPT instructions... |
|
Emerging |
| 3410 |
kindo-tk/virtual_assistant
Personal Voice assistant using python |
|
Emerging |
| 3411 |
b4rtaz/voice-assistant-net-server
Voice Assistant Server for VSCode |
|
Emerging |
| 3412 |
AWAS666/Pngify.me
Pngtuber app build on Avalonia.UI with twitch integration and a ttspet |
|
Emerging |
| 3413 |
victormgross/RealVideo
📹 Create engaging video calls with RealVideo, a WebSocket-based system that... |
|
Emerging |
| 3414 |
poretsky/freespeech
English text preprocessor for MBROLA speech synthesizer |
|
Emerging |
| 3415 |
lingualogic/speech-framework
Javascript/Typescript Framework für Spracheingabe/ausgaben und Dialogverarbeitung. |
|
Emerging |
| 3416 |
mohanchandrass/Sentient-NPC-Lightweight-Offline-AI-Voice-Dialogue-Framework
A research-oriented lightweight offline AI NPC dialogue and voice... |
|
Emerging |
| 3417 |
Ryan-M-Smith/Quinton-VoiceAssistant
A simple voice assistant |
|
Emerging |
| 3418 |
allpaqa-jgk/twitch_text_to_speech_bot
Text to Speech bot using Twitch IRC for mac and (linux and windows |
|
Emerging |
| 3419 |
AEJays/edge-tts-nodejs
Node version of edge-tts / Node版本的edge-tts |
|
Emerging |
| 3420 |
SaranshKejriwal/Harold_Finch
Face recognition via voice Commands (OpenCV Python + SpeechRecognition 3.1.3) |
|
Emerging |
| 3421 |
mohaimenulislamshawon/text-to-voice-speech-converter
The program is created based on google text to speech or voice converter... |
|
Emerging |
| 3422 |
CMsmartvoice/Unet-TTS
One-shot TTS with Improved Unseen Speaker and Style Transfer |
|
Emerging |
| 3423 |
jharrilim/RasaDocker
Docker image with Rasa + Anaconda + Tensorflow + portaudio + PyAudio +... |
|
Emerging |
| 3424 |
Aditya-Mishra799/NLP-Speech-Translator-Website
A modern web application for translating and converting speech to text in... |
|
Emerging |
| 3425 |
dilukshann7/Vocaluxe
Python program to extract vocals from YouTube videos for free |
|
Emerging |
| 3426 |
nikolaStanojkovski/Assistive_Bus_Helper
An Android application that allows visually impaired people to hear which... |
|
Emerging |
| 3427 |
svarlamov/aws-polly-node-typescript-demo
Demo of how to use AWS Polly text-to-speech in a web app using NodeJS,... |
|
Emerging |
| 3428 |
SzLeaves/asr-webapp
ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块 |
|
Emerging |
| 3429 |
netcookies/Edge-TTS-Proxy
Edge-TTS-Proxy 插件将 Microsoft Edge TTS(文本到语音)服务集成到 Home Assistant... |
|
Emerging |
| 3430 |
davealaw/kokoro-electron
Kokoro TTS GUI - a user-friendly Electron application for local neural... |
|
Experimental |
| 3431 |
Avatar-Home-Automation/A.V.A.T.A.R-Server
Agnostic Virtual Assistant for The Automated Residences |
|
Experimental |
| 3432 |
ltphen/martha
Free text to speech synthesizer made with coqui-ai/TTS and flask |
|
Experimental |
| 3433 |
ryuuji06/keyword-spotting
In this repository, I implement a system for detecting specific spoken words... |
|
Experimental |
| 3434 |
answersolutionsapps/runandread-android
Ultimate Text-to-Speech and Audiobook Player for Android |
|
Experimental |
| 3435 |
Sls0n/desktop-assistant
A python-based desktop assistant that can perform a few mundane tasks! |
|
Experimental |
| 3436 |
wavekat/wavekat-turn
Turn detection library for Rust with a unified trait interface over multiple... |
|
Experimental |
| 3437 |
InuInu2022/NodoAme.Home
An official website for NodoAme |
|
Experimental |
| 3438 |
cmirnow/Google-Cloud-TTS-Rails
Using the power of Google Cloud Text-to-Speech API and ruby here is a simple... |
|
Experimental |
| 3439 |
yousefhany77/tts-ai
The Text-to-Speech Library provides a simple unified interface for... |
|
Experimental |
| 3440 |
nclv/RecoVoc
Projet de reconnaissance vocale |
|
Experimental |
| 3441 |
rajjitlai/MimicTTS
MimicTTS is a tool for Voice cloning from a short audio clip. Powered by... |
|
Experimental |
| 3442 |
Ilikepizza2/localspeech-AI
A one command Voice AI deployment script for MacOS. Supports Sesame, Kokoro,... |
|
Experimental |
| 3443 |
Rushi128/voice_assistance
The application is built using Python with Flask for the backend,... |
|
Experimental |
| 3444 |
djelia-org/djelia-js-sdk
Javascript client for interaction with djelia models throught it's API |
|
Experimental |
| 3445 |
hchiam/please
An experimental programming language (transpiler) to make it easier to write... |
|
Experimental |
| 3446 |
yash2410/Avon
A speech recognition based home automation system |
|
Experimental |
| 3447 |
luan78zaoha/TTS_tflite_cpp
TTS inference in C++ based on TFlite model |
|
Experimental |
| 3448 |
khaykingleb/hifi-gan
Neural vocoder for high-fidelity speech synthesis (implementation of the... |
|
Experimental |
| 3449 |
nacerbaaziz/nbsapi
a python library that helps you to control the sapi5 TTS |
|
Experimental |
| 3450 |
r9y9/jsut-lab
HTS-style full-context labels for JSUT v1.1 |
|
Experimental |
| 3451 |
vietai/ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0 |
|
Experimental |
| 3452 |
oloflarsson/whisper-spoon
🎙️ Whisper STT Shortcut for Hammerspoon (macOS) |
|
Experimental |
| 3453 |
itsanuragkumarjha/Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access
An open-source project that uses cutting-edge NLP models and real-time web... |
|
Experimental |
| 3454 |
zemags/golang-yandex-speech-kit
SDK for converting text to audio by Yandex premium voices |
|
Experimental |
| 3455 |
GmEsoft/CTS256A-AL2
Commented disassembly of the GI(tm) CTS256A-AL2(tm) Code-To-Speech Processor |
|
Experimental |
| 3456 |
KelvinCampelo/open-aiudio-client
This Next.js application provides a user interface for interacting with... |
|
Experimental |
| 3457 |
deepgram-starters/csharp-live-text-to-speech
Get started using Deepgram's Live Text-to-Speech with this C# demo app |
|
Experimental |
| 3458 |
zolomohan/speech-recognition-in-javascript-starter
Starter Code for Speech Recognition in JavaScript tutorial. |
|
Experimental |
| 3459 |
mochi-neko/VOICEVOX-API-unity
Binds VOICEVOX text to speech API to pure C# on Unity. |
|
Experimental |
| 3460 |
led-mirage/CoeiroClip
COEIROINKでクリップボードに貼り付けられたテキストを読み上げるアプリです。 |
|
Experimental |
| 3461 |
matievisthekat/MyOnlyFriend
A program I made so I could talk to someone ;( |
|
Experimental |
| 3462 |
nsoojin/VoiceControlSample-iOS
Creating a stateful UI with GameplayKit - Voice Control |
|
Experimental |
| 3463 |
smswg/callwg
语音呼叫系统-外呼系统,2026年真正可商用CALLWG语音呼叫系统,语音呼叫系统功能:机器人话术外呼系统|呼叫中心|VIP队列|来电记忆|ASR语音识别... |
|
Experimental |
| 3464 |
guozhonghao1994/Voice_Activity_Detection_V1
2018 Lenovo AI Lab Summer Intern |
|
Experimental |
| 3465 |
StanGirard/quivr-whisper
Talk to your second brain personal assistant using speech 🧠 |
|
Experimental |
| 3466 |
MycroftAI/ZZZ-RETIRED__openstt
RETIRED - OpenSTT is now retired. If you would like more information on... |
|
Experimental |
| 3467 |
UserBeingOfficial/ai-dictionary-koreader
📖 Enhance your reading experience with AI Dictionary, a KOReader plugin that... |
|
Experimental |
| 3468 |
LohChiaHeung/TechTutor
TechTutor is an Augmented Reality (AR) and AI-assisted mobile learning... |
|
Experimental |
| 3469 |
eujuliu/anki-deck-generator
This tool allows users to create Anki cards with words, meanings, examples,... |
|
Experimental |
| 3470 |
mmerlyn/asl-translator
Empowering the deaf and speech-impaired with a real-time ASL translator that... |
|
Experimental |
| 3471 |
Langhalsdino/StageMate
StageMate is the smart assistant for your presentation. It will cover all... |
|
Experimental |
| 3472 |
botbahlul/VOSK-Powered-LIVE-SUBTITLE
ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using VOSK... |
|
Experimental |
| 3473 |
Siemko/boar
boarBot :boar: voice assistant |
|
Experimental |
| 3474 |
primepake/learnable-speech
This repo is text to speech with learnable audio encoder without alignment... |
|
Experimental |
| 3475 |
oasisnoehub/OsisnoeAISpeech
English Text to Speech AI web app: You can better practice your english... |
|
Experimental |
| 3476 |
bykemalh/S2ST
Speech to Speech Translation Python |
|
Experimental |
| 3477 |
makeuseofcode/PDF-to-Audiobook
Python project to convert an eBook pdf to an audiobook. |
|
Experimental |
| 3478 |
DillionLowry/NeuralCodecs
Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia |
|
Experimental |
| 3479 |
rohankishore/Submind
🎧 Submind is a modern PyQt6 app for generating subtitles (SRT) using Whisper... |
|
Experimental |
| 3480 |
TheIncredibleVee/sqlized
Easy to use, flexible, and user-friendly SQL running app with voice command support |
|
Experimental |
| 3481 |
hezhizheng/cantonese-cool
一个能讲广东话(粤语)的小程序 |
|
Experimental |
| 3482 |
6Morpheus6/IndexTTS2
[NVIDIA, MAC, ROCM] Emotionally Expressive and Duration-Controlled... |
|
Experimental |
| 3483 |
dalehumby/openWakeWord-rhasspy
openWakeWord for Rhasspy |
|
Experimental |
| 3484 |
stefanbringuier/youtube-transcripts
Pass in a YouTube URL and to generate a transcript of the audio |
|
Experimental |
| 3485 |
Qappevox/Voice-Assistant
I'ts just a voice asistant for windows. |
|
Experimental |
| 3486 |
OpenVoiceOS/ovos-docker-tts
Open Voice OS TTS Docker images |
|
Experimental |
| 3487 |
riedemannai/parakeet-mlx-server
OpenAI-compatible FastAPI server for German neurology and neuro-oncology... |
|
Experimental |
| 3488 |
Hexanol777/Kikiyomu
聞き読む. real-time text-to-speech tool for VNs |
|
Experimental |
| 3489 |
pschatzmann/arduino-flite
A small fast portable speech synthesis system |
|
Experimental |
| 3490 |
mpoyraz/wav2vec2-turkish
Turkish Speech Recognition using Facebook's Wav2vec 2.0 models |
|
Experimental |
| 3491 |
sooftware/jasper
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural... |
|
Experimental |
| 3492 |
stitchng/infobip
A NodeJS Wrapper for InfoBip |
|
Experimental |
| 3493 |
german-asr/megs
A merged version of multiple open-source German speech datasets. |
|
Experimental |
| 3494 |
bharathraj-v/fastconformer-ctc-telugu
NVIDIA NeMo's stt_en_fastconformer_ctc_large finetuned on open-source telugu... |
|
Experimental |
| 3495 |
Syedjunaid30/Video_Dubbing_with_ML_driven_Lip_Synchronization
AI-powered video dubbing tool that translates and synchronizes speech with... |
|
Experimental |
| 3496 |
Tugaytalha/NarraPhon
NarraPhon: Advanced Text-to-Speech Conversion Pipeline NarraPhon is a... |
|
Experimental |
| 3497 |
deepgram-starters/go-text-to-speech
Get started using Deepgram's Text-to-Speech with this Go demo app |
|
Experimental |
| 3498 |
chinasilva/MySmartPc
利用微信文件助手,进行语音或者文字控制电脑 |
|
Experimental |
| 3499 |
Br3n0k/transcriber
AI-powered transcription for audio & video with Whisper — self-hosted, fast,... |
|
Experimental |
| 3500 |
ictnlp/SLED-TTS
Streamable Text-to-Speech model using a language modeling approach, without... |
|
Experimental |