All Voice AI Tools
8,165 tools ranked by quality score · Page 26 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2501 |
sandy1990418/ChineseTaiwaneseWhisper
This repository focuses on leveraging OpenAI's Whisper model for speech... |
|
Emerging |
| 2502 |
nhaouari/local11labs
Local11Labs allows generating high-quality text-to-speech and podcast... |
|
Emerging |
| 2503 |
LucaDe/text_to_speech_api
A simple wrapper for Google's Text-To-Spech API for Dart and Flutter projects. |
|
Emerging |
| 2504 |
Gaurav890/vocal-stack
vocal-stack is a high-performance utility library for developers building... |
|
Emerging |
| 2505 |
alkhimey/esp32-flite
Speech synthesis running on ESP32 based on Flite engine. |
|
Emerging |
| 2506 |
jiwidi/DeepSpeech-pytorch
Pytorch implementation for DeepSpeech 2.0 |
|
Emerging |
| 2507 |
jianchang512/gemini-speech2srt
使用 Gemini AI 转写音视频为 SRT 字幕 |
|
Emerging |
| 2508 |
ale-grassi/discord-elevenlabs-tts-bot
A simple Discord TTS bot that uses the Eleven Labs API |
|
Emerging |
| 2509 |
medokin/soundpad-text-to-speech
Text-To-Speech for Soundpad |
|
Emerging |
| 2510 |
hwk06023/SONATA
SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced... |
|
Emerging |
| 2511 |
simalexan/speechy
Voice command tool for an easy web speech recognition for your web... |
|
Emerging |
| 2512 |
EuleMitKeule/speaker-recognition
Speaker recognition service for Home Assistant using voice embeddings. Train... |
|
Emerging |
| 2513 |
sskorol/respeaker-websockets
This project reveals full Respeaker Core V2 potential by using bundled... |
|
Emerging |
| 2514 |
JensBorrisholt/GoogleSpeak
This repository demonstrates how to Use Google for implementing Text to... |
|
Emerging |
| 2515 |
r1di/neutts-fastapi
OpenAI-compatible Text-to-Speech API server powered by NeuTTS. Drop-in... |
|
Emerging |
| 2516 |
makeabilitylab/ProtoSound
ProtoSound is a deployable interactive system for personalizing a sound... |
|
Emerging |
| 2517 |
ZhuoZhuoCrayon/AcousticKeyBoard-Web
❓声学键盘|脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。 |
|
Emerging |
| 2518 |
Pallas1303/FestPB
FestPB é um projeto com objetivo de oferecer suporte ao Português Brasileiro... |
|
Emerging |
| 2519 |
Speech-to-text-Kafka-Airflow-Spark/StoTkas
Data engineering pipeline that allows recording millions of Amharic and... |
|
Emerging |
| 2520 |
Supremolink81/TTSCeleb
A TTS app where you can clone the voices of any person you wish. |
|
Emerging |
| 2521 |
felipefacundes/guglinatts
Guglina TTS é um sintetizador de voz, em português do Brasil, que lê telas... |
|
Emerging |
| 2522 |
teyang-lau/YOListenO
Building an AI-powered tool for auto converting audio from lectures/meetings... |
|
Emerging |
| 2523 |
laszukdawid/cracker
Usable GUI for text-to-speech services |
|
Emerging |
| 2524 |
freakingrocky/EmoCh
Emotion Analysis from Speech AI in Python using mfcc, mel, chroma |
|
Emerging |
| 2525 |
Jen-Hung-Ho/ros2_jetbot_voice
Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson... |
|
Emerging |
| 2526 |
ThisModernDay/f5-tts
F5-TTS is a web application that allows users to clone voices and generate... |
|
Emerging |
| 2527 |
nay-cat/LiveKit-PiperTTS-Plugin
Quick integration of Piper TTS (super lightweight, high-quality model) with LiveKit |
|
Emerging |
| 2528 |
shaheennabi/Multi-lingual-AI-Assistant-with-gTTS-and-Gemini-Pro
An end-to-end AI assistant using gTTS for multi-lingual text-to-speech and... |
|
Emerging |
| 2529 |
adrxLV/J.A.R.V.I.S.AI
A AI-powered voice assistant based on JARVIS using ollama. |
|
Emerging |
| 2530 |
sudonitin/Audio-book-generator
Convert your ebooks to audiobooks. 📖->🎧 |
|
Emerging |
| 2531 |
TharanaBope/whisper-v3-diarization
Production-ready audio transcription & speaker diarization CLI & GUI using... |
|
Emerging |
| 2532 |
ctkqiang/ZhuYing
竹影是一款创新的视频语音转录与翻译工具,专注于提供高质量的视频音频转文字服务和多语言翻译功能。本项目采用先进的人工智能技术,为用户提供便捷的视频内容处理解决方案。 |
|
Emerging |
| 2533 |
Dark2C/Viral-Faceless-Shorts-Generator
Automatically generate faceless YouTube Shorts from trending topics using AI... |
|
Emerging |
| 2534 |
ARAI-Telegram/teledash-backend-processing
Optional AI-powered features of Teledash, an open-source software for... |
|
Emerging |
| 2535 |
boochow/TFLite_Micro_MicroSpeech_M5Stack
M5Stack (ESP32) port of TensorFlow Lite for Microcontrollers demo "Micro Speech" |
|
Emerging |
| 2536 |
kaituoxu/Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS)... |
|
Emerging |
| 2537 |
dfop02/auto-sub
Automatically subtitle a video from almost any language to your native... |
|
Emerging |
| 2538 |
rezkyatinnov/capetangjs
A JavaScript library for text to speech vice versa using Web Speech API |
|
Emerging |
| 2539 |
DePasqualeOrg/swift-tiktoken
A pure Swift implementation of OpenAI's tiktoken tokenizer |
|
Emerging |
| 2540 |
twangodev/speak-mintlify
Automatically generate voice narration for your Mintlify documentation. |
|
Emerging |
| 2541 |
upskyy/Paper-Review
Paper Review about Speech Recognition · NLP |
|
Emerging |
| 2542 |
vibhasdutta/PC-ASSISTANT
A voice-operated PC assistant for Windows , enabling hands-free control for... |
|
Emerging |
| 2543 |
tuanio/nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For... |
|
Emerging |
| 2544 |
GENIVI/VCIVING-SpeechRecognition
GENIVI GSoC 2018 and 2019 |
|
Emerging |
| 2545 |
GeoHaberC/Story-to-Video
Create a Movie animation plus Audio plus Subtitle from a text file |
|
Emerging |
| 2546 |
spandan114/AI-realtime-voice-agent
A Python-based real-time voice-to-voice conversation system that lets you... |
|
Emerging |
| 2547 |
Llamacha/asr-htk-quechua
ASR for quechua language is an open source which can run in real time using... |
|
Emerging |
| 2548 |
anooptoffy/DLJeju2018CodeRepoASR
Details on my work on using GANs for speech synthesis for improving Speech... |
|
Emerging |
| 2549 |
eazhary/dctts2
Deep Convolution Text to Speech |
|
Emerging |
| 2550 |
nowickam/facial-animation
Audio-driven facial animation generator with BiLSTM used for transcribing... |
|
Emerging |
| 2551 |
lucasnewman/e2-tts-mlx
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive... |
|
Emerging |
| 2552 |
Bassamejlaoui/Voice-Cloning-Translation-Transcription
Voice cloning, a revolutionary technology, allows us to replicate and... |
|
Emerging |
| 2553 |
zoebchhatriwala/CamWord
CamWord Is an android application that uses character recognition and voice... |
|
Emerging |
| 2554 |
victor369basu/End2EndAutomaticSpeechRecognition
In this repository, I have developed an end to end Automatic speech... |
|
Emerging |
| 2555 |
aishoot/Multi-Hotword_Spotting
Won't it be cool to build a speech assistant like Alexa or Siri yourself... |
|
Emerging |
| 2556 |
pnkvalavala/digitaltwin
Using a single image and just 10 seconds of sample audio, our project... |
|
Emerging |
| 2557 |
prathamsolanki/gender-recognition-by-voice
Identify a voice as male or female. |
|
Emerging |
| 2558 |
tabahi/WebSpeechAnalyzer
JS speech analyzer for fast speech analysis and labeling |
|
Emerging |
| 2559 |
CypherousSkies/reading-for-listeners
A deep-learning powered accessibility application which turns pdfs into... |
|
Emerging |
| 2560 |
AASHISHAG/DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser. |
|
Emerging |
| 2561 |
bhattbhavesh91/speech-python-demos
pyttsx3 is a text-to-speech conversion library in Python. Its a Python-based... |
|
Emerging |
| 2562 |
Issac-Moses/Beacon
Beacon – A lightweight voice-controlled AI assistant using Whisper.cpp. ... |
|
Emerging |
| 2563 |
Enforcer03/voice-cloning
Voice cloning with tortoise-tts |
|
Emerging |
| 2564 |
HerambVD/spoken2written
A source of python package which converts language styles in speech to its... |
|
Emerging |
| 2565 |
MrAliHasan/Sophia-AI-Assistant
Sophia AI Assistant is a Python-based desktop AI that performs a variety of... |
|
Emerging |
| 2566 |
Ishan7390/Jarvis_AI
This is my attempt at building a not so much of an AI, Jarvis |
|
Emerging |
| 2567 |
Zuellni/Orpheus-GGUF
Orpheus-TTS inference. |
|
Emerging |
| 2568 |
thewh1teagle/vad-rs
Speech detection using silero vad in Rust |
|
Emerging |
| 2569 |
The-Data-Dilemma/MediBeng-Whisper-Tiny
MediBeng Whisper Tiny improves doctor-patient transcription by training the... |
|
Emerging |
| 2570 |
RF5/transfusion-asr
Transcribing Speech with Multinomial Diffusion, training code and models. |
|
Emerging |
| 2571 |
stellarloop/bitbat.ai
My father, a journalist, used to painstakingly transcribe interviews from a... |
|
Emerging |
| 2572 |
yakhyo/kokoro-onnx
Kokoro-82m TTS ONNX Runtime inference | Gradio Demo | HuggingFace Demo | Docker |
|
Emerging |
| 2573 |
rhulha/Speech2Speech
A web application that converts speech to speech 100% private |
|
Emerging |
| 2574 |
mravanelli/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained... |
|
Emerging |
| 2575 |
orhun/dialogflowbot
Google's Dialogflow implementation on Android with additional features. |
|
Emerging |
| 2576 |
gogyzzz/beamformit_matlab
A MATLAB implementation of CHiME4 baseline Beamformit |
|
Emerging |
| 2577 |
neosapience/n8n-nodes-typecast
Integrate Typecast AI TTS into your n8n workflows with this community node. |
|
Emerging |
| 2578 |
agentvoiceresponse/avr-tts-deepgram
This project demonstrates the integration of Agent Voice Response with... |
|
Emerging |
| 2579 |
aydinnyunus/LinuxVoiceAssistant
Linux Voice Assistant for to Make Your Work Easier |
|
Emerging |
| 2580 |
Serkali-sudo/auto-subtitle-generator
An Android app that automatically generates subtitles for videos locally,... |
|
Emerging |
| 2581 |
KathyReid/opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in... |
|
Emerging |
| 2582 |
pschatzmann/arduino-simple-tts
A simple TTS solution based on pre-recorded audio |
|
Emerging |
| 2583 |
Madhur215/Chatbot-cum-voice-Assistant
An AI chatbot with features like conversation through voice, fetching events... |
|
Emerging |
| 2584 |
va-kiet/Voice-Assistant-wake-word-detection-model
Build a Wake Word Detection model for Voice Assistant using PyTorch |
|
Emerging |
| 2585 |
daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct... |
|
Emerging |
| 2586 |
codename0og/codename-rvc-fork-3
Codename's rvc fork version 3, based on Applio. |
|
Emerging |
| 2587 |
theoomoregbee/paysense-backend
This is our paysense backend , a sails app |
|
Emerging |
| 2588 |
lucadellalib/audiocodecs
A collections of audio codecs with a standardized API |
|
Emerging |
| 2589 |
mtokar3v/ReversoAPI-NET
🌐 An API Client for the reverso.net, written in C#/.NET (Based on Site API... |
|
Emerging |
| 2590 |
ignabelitzky/easy-subber
A Python-based tool that that takes video files and generates .srt subtitle... |
|
Emerging |
| 2591 |
hanxiao/mls
MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon |
|
Emerging |
| 2592 |
gunarakulangunaretnam/voice-typer
A voice recognition based typing tool for English, Tamil, Sinhala languages. |
|
Emerging |
| 2593 |
shawnrushefsky/talky-talky
MCP server for Audio Generation and Analysis with a Variety of Open Models. |
|
Emerging |
| 2594 |
revsic/tf-glow-tts
Tensorflow implementation of Glow-TTS |
|
Emerging |
| 2595 |
echo8795/react-native-android-text-to-speech
React Native Text-To-Speech wrapper module for android |
|
Emerging |
| 2596 |
Animator617/jasper
Jasper is a AI asistence programm based on deeplearning |
|
Emerging |
| 2597 |
m0wer/aibot
Telegram bot powered by Ollama, capable of handling text and voice messages,... |
|
Emerging |
| 2598 |
fquirin/speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework |
|
Emerging |
| 2599 |
Ahmed5attab/Qaf-QuranSearchAndMemorization
iOS Islamic application for the holy Quran, helps the Muslims to have the... |
|
Emerging |
| 2600 |
rt400/ReversoTTS-HA
ReversoTTS component for HomeAssistant |
|
Emerging |