All Voice AI Tools
8,165 tools ranked by quality score · Page 27 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2601 |
Arvind2903/Accent-Classification-And-Conversion
Tackle accent classification and conversion using audio data, leveraging... |
|
Emerging |
| 2602 |
matthijsvk/TIMITspeech
Speech recognition on the TIMIT (or any other) dataset |
|
Emerging |
| 2603 |
jayesh15111988/SpeechRecognitionLibrary
A pluggable library for speech recognition on iOS - Requires iOS 10.0+ |
|
Emerging |
| 2604 |
tollwerk/speakable
Simple and privacy friendly on-page screenreader / text-to-speech player... |
|
Emerging |
| 2605 |
xinjli/ucla-phonetic-corpus
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH... |
|
Emerging |
| 2606 |
ryhorv/tf-flowavenet
Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio" |
|
Emerging |
| 2607 |
TheMonocledHamster/Hamster-Bot-Prototype
Rudimentary Chatterbot written in Python |
|
Emerging |
| 2608 |
RomainLLC/booking-openai-chatbot
Booking chatbot example app with Django, OpenAI and text to speech |
|
Emerging |
| 2609 |
robmsmt/CommonCorrections
Easily fix common corrections in speech! |
|
Emerging |
| 2610 |
mateusz-kow/auto-subs
Generate, edit and apply subtitles locally using Whisper or any ASR backend |
|
Emerging |
| 2611 |
creafz/kaggle-speech-recognition
Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%) |
|
Emerging |
| 2612 |
lesleyrs/clipboard-narrator
Turn any web page into an audiobook, works in the background on desktop! |
|
Emerging |
| 2613 |
SuryanshNaugraiya/AI-JARVIS
AI JARVIS, an intelligent personal assistant is a software agent that can... |
|
Emerging |
| 2614 |
pkprajapati7402/JARVIS-voice-assistant
JARVIS Voice Assistant is a powerful and intuitive voice-activated assistant... |
|
Emerging |
| 2615 |
Uni-Creator/Jarvis-Desktop-Assistance
A powerful desktop assistant built in Python that combines voice commands,... |
|
Emerging |
| 2616 |
techiaith/trawsgrifiwr-arlein
Cod gwefan Trawsgrifiwr Ar-lein gan Uned Technolegau Iaith, Prifysgol Bangor... |
|
Emerging |
| 2617 |
Lucasfrota/pyssistant
Pyssistant is designed to be an conversational interface builder. |
|
Emerging |
| 2618 |
royshil/obs-squawk
Real-time Text-to-Speech AI Engine built-in OBS, integrative and intuitive |
|
Emerging |
| 2619 |
team-listnr/text-to-speech-api
Listnr Text to speech API |
|
Emerging |
| 2620 |
HelloChatterbox/speech2text
Chatterbox STT engines |
|
Emerging |
| 2621 |
rcdalj/speech2speech
Full speech-to-speech workflow (can be customized to user's requirements) |
|
Emerging |
| 2622 |
angangwa/azure-speech-to-text
Azure speech to text capabilities including OpenAI models. Gradio demo. |
|
Emerging |
| 2623 |
agentvoiceresponse/avr-tts-google-speech-tts
This project demonstrates the integration of Agent Voice Response with... |
|
Emerging |
| 2624 |
charslab/Home-Assistant
Home assistant inspired by Amazon Echo, based on wit.ai with speech recognition |
|
Emerging |
| 2625 |
holgern/ttsforge
Convert EPUB files to audiobooks using Kokoro ONNX TTS |
|
Emerging |
| 2626 |
totalvoice/totalvoice-java
Client Java pra API da TotalVoice |
|
Emerging |
| 2627 |
6-robot/xfyun_waterplus
A xfyun ros package for Waterplus Robots |
|
Emerging |
| 2628 |
Aditya1Jhaveri/AI-Video-Dubbing
AI video dubbing using Google APIs automates translation and dubbing by... |
|
Emerging |
| 2629 |
yandex-cloud-examples/yc-speechkit-streams-recognizer
SpeechKit Streaming Recognizer. |
|
Emerging |
| 2630 |
Gemeri/Discord-Voice-Channel-Bot
A bot that can join voice channels using the OpenAI api and Microsoft's free... |
|
Emerging |
| 2631 |
9jaswag/speechrec
a simple speech recognition app using the Web Speech API Interfaces |
|
Emerging |
| 2632 |
Tinkoff/asterisk-voicekit-modules
Non-blocking Asterisk modules for accessing VoiceKit services for speech... |
|
Emerging |
| 2633 |
haydonryan/epub2audiobook
Blazingly fast EPUB to Audiobook converter |
|
Emerging |
| 2634 |
Drakonis96/whispad
WhisPad is a note management tool where you can write or dictate your notes... |
|
Emerging |
| 2635 |
jawebada/piper-audio-example-streaming-web-worker
Simple piper-js example |
|
Emerging |
| 2636 |
Ayushverma135/Whisper-Hindi-ASR-model-IIT-Bombay-Internship
The Whisper Hindi ASR (Automatic Speech Recognition) model utilizes the... |
|
Emerging |
| 2637 |
huytd/speech
A tool to practice English speaking |
|
Emerging |
| 2638 |
ajaygujja/Kahani-Storytelling-App-For-Children-With-Hearing-Impairment
Storytelling App For Children With Hearing Impairment |
|
Emerging |
| 2639 |
m1el/nemotron-asr.cpp
Nemotron ASR rewrite to GGML |
|
Emerging |
| 2640 |
igorbezsmertnyi/speech
speech recognition and speech synthesis |
|
Emerging |
| 2641 |
jakob-stoeck/speechToText
iOS speech recognition app for voice messages and general audio files |
|
Emerging |
| 2642 |
kanttouchthis/text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui |
|
Emerging |
| 2643 |
revsic/tf-mlptts
Tensorflow implementation of MLP-Mixer based TTS |
|
Emerging |
| 2644 |
speechly/ios-client
The iOS client library for Speechly API |
|
Emerging |
| 2645 |
orbitalsonic/Speech-Recognition-SpeechToTextConverter
The Speech Recognition or Speech-to-Text Converter module in Android,... |
|
Emerging |
| 2646 |
TheDeathDragon/LiveTranslate
Real-time audio translation overlay for Windows — captures system audio +... |
|
Emerging |
| 2647 |
agentvoiceresponse/avr-tts-kokoro
The application sets up an Express.js server that accepts a text string from... |
|
Emerging |
| 2648 |
vdutts7/ai-rapper
Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise... |
|
Emerging |
| 2649 |
musa11971/manhuw
Recognizing and identifying Quran reciters from audio recordings. |
|
Emerging |
| 2650 |
jasonclark/voice-user-interface
Prototypes for voice assistance and UI design based on voice interactions |
|
Emerging |
| 2651 |
prathamesh-mandavkar/AutoTalker
The project focuses on leveraging technology to create new courses,... |
|
Emerging |
| 2652 |
jhudsl/text2speech
Text to Speech |
|
Emerging |
| 2653 |
graphiteSWE/DeSpeect
Codice per il prodotto "DeSpeect: un'interfaccia grafica per Speect" |
|
Emerging |
| 2654 |
Ziyodullodev/useful-codes
@ziyodev |
|
Emerging |
| 2655 |
MaxMax2016/Grad-TTS-Chinese
Huawei Grad-TTS for Chinese |
|
Emerging |
| 2656 |
pingfury108/book2tts
有声书制作工具 |
|
Emerging |
| 2657 |
radkoder/qt-whisper
A Qt & QML wrapper for whisper.cpp |
|
Emerging |
| 2658 |
stefantaubert/tacotron-cli
Command-line interface to train Tacotron 2 using .wav <=> .TextGrid pairs. |
|
Emerging |
| 2659 |
olami-developers/olami-android-hotword-detect-sdk
Hotword Detection (Wake Word Detection) Android library and sample codes |
|
Emerging |
| 2660 |
renaudjenny/swift-tts
A straightforward package containing version for Swift modern concurrency,... |
|
Emerging |
| 2661 |
Mobile-Artificial-Intelligence/maise
Maise is an open-source android speech engine designed to provide a powerful... |
|
Emerging |
| 2662 |
AmSh4/gemini-live-app
A real-time voice AI web app using Google Gemini Live API. Features... |
|
Emerging |
| 2663 |
markokosticdev/cloud_text_to_speech_nodejs
Single interface to Google, Microsoft, and Amazon Text-To-Speech. |
|
Emerging |
| 2664 |
hanxi/epub2mp3
这是一个使用 Microsoft Edge TTS 服务将 EPUB 电子书转换为 MP3 音频文件的工具。 |
|
Emerging |
| 2665 |
parzibyte/reconocimiento-voz-javascript
Usar webkitSpeechRecognition para convertir voz a texto en la web con JavaScript |
|
Emerging |
| 2666 |
masayoshi-louis/microsoft-speech-rs
Rust wrapper for microsoft speech recognition |
|
Emerging |
| 2667 |
SABER-labs/SABER
Semi-Supervised Audio Baseline for Easy Reproduction |
|
Emerging |
| 2668 |
happyf-weallareeuropean/cC
auto Speak lastest chatgpt stream responses. & more room for display chat content |
|
Emerging |
| 2669 |
crazymidnight/speech-recognition
[WIP] Speech recognition microservice |
|
Emerging |
| 2670 |
Mildemelwe/Non-English-Tacotron-2-Training-Notebook
Tacotron 2 training notebook supporting Japanese, French, and Mandarin |
|
Emerging |
| 2671 |
adrianmfi/gpt-tutor
Generate personalized audio lessons for learning languages with GPT and... |
|
Emerging |
| 2672 |
tomik395/ESP32-AI
Speak to your ESP32 and it speaks back! Your new personal assistance is... |
|
Emerging |
| 2673 |
dyazincahya-blog/k-speech
a simple component "text to speech" |
|
Emerging |
| 2674 |
zero-nnkn/vision-assistant-services
👁🗨 Vision Assistant (Backend): Smart Assistant for Visually Impaired People |
|
Emerging |
| 2675 |
HristovB/Speech_Recognition_Macedonian
Speech recognition model for recognising Macedonian spoken language. |
|
Emerging |
| 2676 |
nafiuny/ICRCycleGAN-VC
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and... |
|
Emerging |
| 2677 |
PMO-IT/voiceassistant
Nova, a Java based voice assistant. Runnable on Raspberry Pi. |
|
Emerging |
| 2678 |
Pooventhiran/VSR
Speaker-Independent Speech Recognition using Visual Features |
|
Emerging |
| 2679 |
minji-o-j/AI-Speaker-for-Senior-Citizen
독거노인을 위한 AI스피커 - 일반적인 AI 스피커의 역할 뿐만 아니라 사용자가 있는 환경의 온·습도를 주기적으로 측정하여 필요시 환경... |
|
Emerging |
| 2680 |
LM-Kit/LynxTranscribe
LynxTranscribe is a comprehensive, professional-grade audio transcription... |
|
Emerging |
| 2681 |
Hassi34/NLP-Hub
The NLP Hub consists of multiple NLP services, each providing specific... |
|
Emerging |
| 2682 |
bobo52310/TypeLate
Voice-to-text for macOS and Windows. 100% free — fork it, make it yours, and... |
|
Emerging |
| 2683 |
ramizeid/Discord-Voice-Chat-Text-to-Speech
A text to speech bot for Discord using IBM Watson |
|
Emerging |
| 2684 |
devnamdev2003/PC_Assistant
The virtual assistant is a general-purpose desktop-based application... |
|
Emerging |
| 2685 |
jaywcjlove/TextSoundSaver
Using the TextSoundSaver application, you can convert text into realistic... |
|
Emerging |
| 2686 |
gokulakannant/text-to-speech
A experiment project for react js and electron app. Download binaries here:... |
|
Emerging |
| 2687 |
danielclough/parler-tts-wasm
A Rust and Wasm Demo to generate and play speech from text using Parler-TTS. |
|
Emerging |
| 2688 |
seungwonpark/awesome-tts-samples
Awesome list of TTS papers with audio samples |
|
Emerging |
| 2689 |
streamer45/streamkit
StreamKit is a self-hosted real-time media processing engine with pluggable... |
|
Emerging |
| 2690 |
ywatanabe1989/scitex-notification
Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One... |
|
Emerging |
| 2691 |
FlorianEagox/WeeaBlind
A program to dub non-english media with modern AI speech synthesis,... |
|
Emerging |
| 2692 |
jumadi59/android-game-teka-teki-silang
Simple game Teka-Teki Silang (Word Cross). Available on the play store! |
|
Emerging |
| 2693 |
ikram-shah/iris-fhir-transcribe-summarize-export
A full-stack application that allows practitioners to record voice notes and... |
|
Emerging |
| 2694 |
ddlBoJack/MT4SSL
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL:... |
|
Emerging |
| 2695 |
winstxnhdw/CapGen
A fast CPU-first video/audio transcriber for generating caption files with... |
|
Emerging |
| 2696 |
hoishing/speech-recog
Speech recognition web app powered by Google Speech API |
|
Emerging |
| 2697 |
tometoproject/tometo
:zzz: A text to speech social network. [mirror] |
|
Emerging |
| 2698 |
MotazSabri/Hanami-release
Live translator that captures any audio that comes from a WINDOWS speaker or... |
|
Emerging |
| 2699 |
tirsky/speechpro_wrapper
Wrapper for text to speech speechpro (only russian) |
|
Emerging |
| 2700 |
yyaadet/autosrt_page
AutoSRT is an macOS app that automatically generates dual language subtitles... |
|
Emerging |