All Voice AI Tools
8,165 tools ranked by quality score · Page 32 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 3101 |
AlasdairKing/Calendar-VB6
Simple, accessible Calendar for screenreader and blind users. |
|
Emerging |
| 3102 |
tigjaw/remyme
ReMyMe - a basic "Read My Messages" Android application (old) |
|
Emerging |
| 3103 |
Infineon/i2s-microphone
A collection of documentation and examples for Infineon's I2S microphones. |
|
Emerging |
| 3104 |
The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning
Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English... |
|
Emerging |
| 3105 |
BenjaminPoncet/bobby-snips-tts
bobby-snips-tts is an implementation of snips-tts written in Node.js with... |
|
Emerging |
| 3106 |
Abhradipta/OCR-With-Read-Out-Loud-Using-Python
An Optical Character Recognition (OCR) System designed using Python to read... |
|
Emerging |
| 3107 |
taeefnajib/Vocazee
A voice cloning and text-to-speech application that can generate speech in any voice. |
|
Emerging |
| 3108 |
viig99/esolafast
Fast C++ implementation of ESOLA using KFRLib, can be used for online... |
|
Emerging |
| 3109 |
koesan/Evoars
A multi-model AI platform for comics, manga, and videos. It colorizes... |
|
Emerging |
| 3110 |
PiasRoY/Bangla-Spoken-Number-Recognition
recognizing spoken Bangla numbers using MFCCs and CNN. |
|
Emerging |
| 3111 |
suzumushi0/SoundObject_binary
SoundObject binary distribution. |
|
Emerging |
| 3112 |
palahsu/Greeting-PC
Greeting PC, made with simple Visual Basic Script. Run file it will executes... |
|
Emerging |
| 3113 |
dhdaines/soundswallower-demo
Simple demo of client-side speech recognition |
|
Emerging |
| 3114 |
TCL606/Speech-Number-Recognition
基于数字信号处理的语音数字识别器 |
|
Emerging |
| 3115 |
baocin/hugging_face_example_STT_api
Demonstration of Hugging Face's (https://huggingface.co/) newly released... |
|
Emerging |
| 3116 |
vinbhaskara/Digit-Speech-Recognition
Using MFCC features on Speech Signals to classify Digits after matching... |
|
Emerging |
| 3117 |
idiap/TIDIGITSRecipe.jl
A Julia recipe for training an ASR system using the TIDIGITS database |
|
Emerging |
| 3118 |
marvinborner/CTC-LSTM
Spoken word recognition using CTC LSTMs for SWR2 Tübingen |
|
Emerging |
| 3119 |
vectominist/rspin
Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and... |
|
Emerging |
| 3120 |
SzLeaves/asr-model-ctc
ASR deep learning models (use BiGRU & WaveNet & CTC), use Tensorflow2... |
|
Emerging |
| 3121 |
loglux/FlexAudioPrint
FlexAudioPrint is a Python-based app for transcribing audio to text using... |
|
Emerging |
| 3122 |
SEPIA-Framework/sepia-web-audio
Create modular, cross-browser, web audio pipelines to record and process... |
|
Emerging |
| 3123 |
oeschsec/Sidekick---voice-controlled-keyboard-and-mouse
Voice controlled keyboard and mouse that is lightweight (minimal... |
|
Emerging |
| 3124 |
aeleraqi/gTTS---Arabic-text-to-multiple-languages
Converting Arabic text to speech in various languages with the versatile... |
|
Emerging |
| 3125 |
BobRandomNumber/ComfyUI-KyutaiTTS
A non real-time ComfyUI implementation of Kyutai TTS |
|
Emerging |
| 3126 |
papercast-dev/papercast
A Python pipeline tool and plugin ecosystem for processing technical... |
|
Emerging |
| 3127 |
deepgram/deepgram-js-captions
This package is the JavaScript implementation of Deepgram's WebVTT and SRT... |
|
Emerging |
| 3128 |
khanld/Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining |
|
Emerging |
| 3129 |
heptacode/interactivekiosk
다양한 사용자를 위한 키오스크 개선 프로젝트 ✨ |
|
Emerging |
| 3130 |
elie-atia/talk-to-chat-gpt
Enable to talk to ChatGPTusing voice-to-text (record and recognize the... |
|
Emerging |
| 3131 |
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient... |
|
Emerging |
| 3132 |
tsengia/SphinxTrainHelper
A Bash script designed to make training sphinx4 and pocketsphinx acoustic... |
|
Emerging |
| 3133 |
Phe0nix/Speech-Email-Sender
Send email with speech recognition means just start talking and send emails.... |
|
Emerging |
| 3134 |
Philipinho/ThreadVoice
Source code for https://twitter.com/threadvoice |
|
Emerging |
| 3135 |
yeyupiaoling/VITS-PaddlePaddle
本项目是基于PaddlePaddle的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复... |
|
Emerging |
| 3136 |
bookbot-hive/OpenBible-TTS
Building Text-to-Speech Systems using OpenBible! |
|
Emerging |
| 3137 |
falabrasil/cmusphinx-br
Scripts e recursos para ASR em Português Brasileiro |
|
Emerging |
| 3138 |
arcb01/g-narrator
A screen reading accessibility tool |
|
Emerging |
| 3139 |
kofemann/streetguide
An Android app to discover where you drive |
|
Emerging |
| 3140 |
Ryan5453/lyricscribe
Automated Lyric Transcription Research |
|
Emerging |
| 3141 |
pragmatrix/context-switch
Audio Streaming for FreeSWITCH with backends powered by Azure, OpenAI, and Aristech |
|
Emerging |
| 3142 |
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM.... |
|
Emerging |
| 3143 |
savg92/voice-cloning
This project provides a comprehensive testing and comparison platform for... |
|
Emerging |
| 3144 |
repodiac/espeak-ng_german_loan_words
Brief tutorial with code where you can automatically create a dictionary... |
|
Emerging |
| 3145 |
tongplw/ASR-web-based-restaurant
🍔 Foody, a smart voice-assistant web-based restaurant using Kaldi, React, and WebRTC |
|
Emerging |
| 3146 |
vishalnagda1/text-to-speech
Python program to convert text to speech. |
|
Emerging |
| 3147 |
KernelOverseer/caLLMe
Realtime voice conversation with llm models using an asynchronous Voice to... |
|
Emerging |
| 3148 |
USSLab/DolphinAttack
Inaudible Voice Commands |
|
Emerging |
| 3149 |
Arbazkhan4712/Text-To-Speech
A program that can convert Text into Speech using python |
|
Emerging |
| 3150 |
auroraapi/aurora-python
Aurora SDK for Python |
|
Emerging |
| 3151 |
belambert/asr-scripts
Lots of miscellaneous scripts to work with Sphinx ASR files and other... |
|
Emerging |
| 3152 |
mehdichaouch/nabstory
Let your Nabaztag 🐰 read you a story 📖 |
|
Emerging |
| 3153 |
hanifabd/voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like... |
|
Emerging |
| 3154 |
visu123s/MimicKit
🤖 Learn motion imitation with MimicKit, a framework offering advanced... |
|
Emerging |
| 3155 |
Inviro/Illud
Illud is a smart text analyzer written in pure Java that displays different... |
|
Emerging |
| 3156 |
speechly/api
Speechly public API definitions and generated code |
|
Emerging |
| 3157 |
lpkpaco/Bocchi-The-Rock-GPT-SoVITS-Models
Contains voice models based on the GPT-SoVITS architecture of different... |
|
Emerging |
| 3158 |
ggh-png/EMOTIBOT
emotion robot using gpt model3.5 EMOTIBOT |
|
Emerging |
| 3159 |
nikkiw/realtime_translator
Python tool for real-time voice recognition and multilingual translation |
|
Emerging |
| 3160 |
SEPIA-Framework/sepia-docs
Documentation and Wiki for SEPIA. Please post your questions and bug-reports... |
|
Emerging |
| 3161 |
m1n1v1rus/futuristic-calculator
A futuristic, AI-powered advanced calculator with voice control, graph... |
|
Emerging |
| 3162 |
wamich/personal-vocabulary
「个人词库」是一款浏览器插件。 用于英文阅读时,不断记住生词,构建个人词库。 |
|
Emerging |
| 3163 |
in03/squawk
Automatic subtitles for DaVinci Resolve with OpenAI Whisper |
|
Emerging |
| 3164 |
indri-voice/audiotoken
Audio tokenization, in the fastest way possible! |
|
Emerging |
| 3165 |
charlescao460/SpeechRecognitionByGoogleCloud
A .NET program that captures local audio and recognizes speech |
|
Emerging |
| 3166 |
milosgajdos/go-playht
PlayHT API client Go module |
|
Emerging |
| 3167 |
binglel/asr_baidu_web_server
asr web server based on flask |
|
Emerging |
| 3168 |
aks-devs/mod_whisper_asr
Freeswitch ASR module |
|
Emerging |
| 3169 |
theawless/sr-lib
Automatic Speech Recognition library for my BTech Project. |
|
Emerging |
| 3170 |
kouyt5/lightning-asr
基于pytorch-lighting框架搭建的端到端语音识别模型,目前还在实验中,性能在不断优化 |
|
Emerging |
| 3171 |
AppleHolic/FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2 |
|
Emerging |
| 3172 |
denizariyan/Real-Time-Auto-Transcriber
Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to... |
|
Emerging |
| 3173 |
naturalDesign/fusion-remote
Chatbot for Autodesk Fusion 360 with speech recognition |
|
Emerging |
| 3174 |
cjh0613/vosk-android-demo-chinese
中文 vosk-android-demo |
|
Emerging |
| 3175 |
MatteoM95/Smart-Home-Vigilance-System
An indoor video surveillance system capable of recognizing the presence of a... |
|
Emerging |
| 3176 |
kehlawicode/audiblez
🎧 Create high-quality audiobooks from e-books with ease using Audiblez,... |
|
Emerging |
| 3177 |
guibranco/talabat-hackathon-2022
🏃 💡 Talabat Hackathon 2022 API project |
|
Emerging |
| 3178 |
egorsmkv/radtts-uk
🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model |
|
Emerging |
| 3179 |
zhurlik/smart-home
A multi-project that contains UDP server, MQTT broker and a few sub-projects... |
|
Emerging |
| 3180 |
1epalpyrgou/smartbell-server
Ένα έξυπνο κουδούνι για το σχολείο μας - 1ο Επαγγελματικό Λύκειο Πύργου |
|
Emerging |
| 3181 |
nisiddharth/TextToSpeech
A Simple Java based Text to Speech converter made using NetBeans 8.2 |
|
Emerging |
| 3182 |
burrmill/sph2pipe
sph2pipe v2.5. We do not maintain this, and/or accept pull requests; just... |
|
Emerging |
| 3183 |
MaikeMota/comando-voz
Utilizando HTML5 SpeechRecognizer para Reconhecimento de Comandos. |
|
Emerging |
| 3184 |
Zuhef/Text-to-Speech
USING HTML , CSS AND JAVASCRIPT I HAVE BUILD A SIMPLE TEXT TO SPEECH CONVERTER. |
|
Emerging |
| 3185 |
pkprajapati7402/Darvin-Chatbot
Darvin is a Python-based voice-activated chatbot that interacts with users... |
|
Emerging |
| 3186 |
GitPolyakoff/voice-assistant
Voice Assistant — приложение на C# для управления компьютером голосом.... |
|
Emerging |
| 3187 |
wukan1986/KWebSpeaker
保持原排版可选段的网页朗读神器 |
|
Emerging |
| 3188 |
Flux9665/ArticulatoryTextFrontend
This is a text-processing frontend that converts graphemes to phonemes and... |
|
Emerging |
| 3189 |
Ex094/VoiceCom
A Simple Voice Command Application powered by Java and Sphinx4 Speech... |
|
Emerging |
| 3190 |
ognistik/alfred-superwhisper
Use Alfred to Control Superwhisper - AI Powered Voice to Text |
|
Emerging |
| 3191 |
speechnotes/speechnotes-speech-recognizer
The speech recognition engine behind Speechnotes, based on the Webspeech-API |
|
Emerging |
| 3192 |
backpropper/DNN-Activation-Brain
Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016) |
|
Emerging |
| 3193 |
Alan-6666/chinese_asr
a demo of chinese asr |
|
Emerging |
| 3194 |
mayank-kumar-giri/Speech-Recognizer-cum-Voice-Typing-Editor
Speech Recognizer cum text editor that facilitates voice typing using Google... |
|
Emerging |
| 3195 |
CodingWithEnjoy/Speech-To-Text-Python
متن به صدا | Text To Speech 😊🤩 |
|
Emerging |
| 3196 |
HawksLab/narratify
e-book to audiobook convertor |
|
Emerging |
| 3197 |
PalabraAI/palabra-ai-java
Java SDK for Palabra AI's real-time speech-to-speech translation API. Break... |
|
Emerging |
| 3198 |
grayhatdevelopers/deepdub
🗣️ Videos for everyone. Implementation of "Automated Dubbing and Facial... |
|
Emerging |
| 3199 |
mallorbc/brillibot-client
Easy to use voice commands API python client. Create your own commands in... |
|
Emerging |
| 3200 |
VisionBrain/Neural_Voice_Cloning
Open Source Implementation of Neural Voice Cloning with Few Audio Samples... |
|
Emerging |