All Voice AI Tools

8,165 tools ranked by quality score · Page 21 of 82

Showing 2001–2100 of 8,165
# Tool Score Tier
2001 brailcom/speechd-el

Emacs speech and Braille output interface

37
Emerging
2002 Julia-Roman/pepega-tts

Discord bot for Google and Polly Text-to-Speech

37
Emerging
2003 01-vyom/End_2_End_Automatic_Speech_Recognition_For_Gujarati

[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition...

37
Emerging
2004 Abhishek-op/SR

💡Kivy-android speech recognition

37
Emerging
2005 IndieCoderMM/smart-one-ai

🤖 AI assistant that can listen to user input and provide responses. It...

37
Emerging
2006 soniqo/speech-android

On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation...

37
Emerging
2007 artcore-c/AI-Voice-Clone-with-Qwen3-TTS

Free voice cloning and TTS for creators using Qwen3-TTS on Google Colab....

37
Emerging
2008 jonelo/jAdapterForNativeTTS

A simple pure Java library that allows you to use the native Text To Speech...

37
Emerging
2009 ScottishFold007/Cosyvoice_DPO_NOTES

CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO...

37
Emerging
2010 calinalexandru/pericles

A browser extension offering intuitive text-to-speech functionality, making...

37
Emerging
2011 nchudleigh/sc2-ultra

Voice-controlled StarCraft II - command Zerg, Protoss, or Terran using...

37
Emerging
2012 aks-devs/mod_openai_tts

Freeswitch Speech-To-Text module

37
Emerging
2013 shafaypro/PYSHA

A Simple Virtual Assistant Build in Python 3.5

37
Emerging
2014 scripty-bot/scripty

Speech to text bot for Discord

37
Emerging
2015 iron-mukakin/Emoji-TTS

Irodori-TTSのフォーク、echo-TTSのwebuiになります。

37
Emerging
2016 Martouta/speech_processor

Speech-to-text from videos and audios (including youtube and tiktok links)

37
Emerging
2017 rishikksh20/iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

37
Emerging
2018 parthgupta1208/VoiceCraft

Voice Craft is a desktop AI assistance tool designed to help people with...

37
Emerging
2019 deepily/genie-in-the-box

Genie in the Box: Distill Whisper STT => Mistral-7B =>...

37
Emerging
2020 mozi1924/Qwen3-TTS-EasyFinetuning

Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality...

37
Emerging
2021 kurianbenoy/malayalam_asr_benchmarking

A study to benchmark whisper based ASRs in Malayalam

37
Emerging
2022 audioku/cross-accent-maml-asr

Meta-learning model agnostic (MAML) implementation for cross-accented ASR

37
Emerging
2023 williamxhero/ttsmaker

TTSMaker: A Python library for interacting with the TTSMaker API to easily...

37
Emerging
2024 loushou/flutter_tts_improved

A fork of the Flutter_TTS (https://github.com/dlutton/flutter_tts) plugin,...

37
Emerging
2025 skit-ai/speech-recognition

SDKs and docs for Skit's speech to text service

37
Emerging
2026 superU-ai/voice-agent-QA

A unified benchmarking framework for evaluating Voice AI agents across...

37
Emerging
2027 jfainberg/lattice_combination

Lattice combination algorithm to combine inaccurate transcripts with...

37
Emerging
2028 phineas-pta/speech-synthesis-ngngngan

python script to download & process data to train a speech-synthesis model...

37
Emerging
2029 chameleon-ai/vevo

Simple GUI for Amphion Vevo

37
Emerging
2030 acyclics/speech-to-speech-translator

Enables a device to input speech from a microphone, translate speech to a...

37
Emerging
2031 mirfan899/CTTS

Cantonese TTS frontend

37
Emerging
2032 frrobledo/AutoDub

An advanced AI-powered tool that automatically translates and dubs YouTube...

37
Emerging
2033 hcoles/voices

Fast, in-process text to speech for Java

37
Emerging
2034 ferosai/feros

Open-source voice agent OS. Rust runtime, AI-driven builder, sub second...

37
Emerging
2035 qiujiali/lattice_rnn

Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation

37
Emerging
2036 liou666/audiread

📻 A simple and user-friendly online TTS tool. (简单易用的在线文本转语音工具)

37
Emerging
2037 stevenhillis/awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

37
Emerging
2038 mishrababhishek/chatbot

AI Chatbot answers students' queries about their college program using...

37
Emerging
2039 botbahlul/js-live-audio-video-translate

HTML Web template that can RECOGNIZE any live audio/video streaming (using...

37
Emerging
2040 ameerbadri/twilio-asr-realtime-dashboard

Twilio ASR and Intent Realtime Dashboard

37
Emerging
2041 ndenicolais/SpeechAndText

Android application built with Kotlin and Jetpack Compose that shows how to...

37
Emerging
2042 OpenASR/idiolect

🎙️ Handsfree Audio Development Interface

37
Emerging
2043 SaptakBhoumik/easySpeech

easySpeech is an open-source Python wrapper for google speech to text API...

37
Emerging
2044 weespin/RequestifyTF2

Client side commands for mic spamming and more!

37
Emerging
2045 clloret/speaking-practice

An Android application to practice English pronunciation

37
Emerging
2046 theaifutureguy/Vocal-Agent

A sophisticated real-time voice assistant that seamlessly integrates speech...

37
Emerging
2047 Helow19274/aiogTTS

Async Python library to interface with Google Translate's text-to-speech API

37
Emerging
2048 SkyDocs/speaker-identification

Speaker Identification using Neural Net.

37
Emerging
2049 haiodo/oaitt

An OpenAI compatible transcriber using transformers and whisperx.

37
Emerging
2050 LibraryOfCongress/speech-to-text-viewer

AWS Transcribe evaluation pipeline: bulk-process audio files and view the results

37
Emerging
2051 DrAchernar/location-based-AR-app

This Flutter project is an example for a location based AR app with...

37
Emerging
2052 abinashmeher999/voice-data-extract

A command line interface to combine text information from subtitles with...

37
Emerging
2053 LuluW8071/Conformer

End-to-End Speech Recognition Training with Conformer CTC using PyTorch Lightning⚡

37
Emerging
2054 cmsflash/deep-learning-sota

State-of-the-art results for deep learning tasks in various fields.

37
Emerging
2055 linto-ai/linto-diarization

Speaker diarization service

37
Emerging
2056 ORI-Muchim/One-Click-MB-iSTFT-VITS2

MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making...

37
Emerging
2057 niteshsharmacodes/neutts-ultimate

NeuTTS-Ultimeate - Advanced Text-to-Speech generation with unlimited...

36
Emerging
2058 Mohamed-samy2/Video-Interview-Analysis

PRVIA is an AI-powered system that automates the evaluation of pre-recorded...

36
Emerging
2059 csyan5/AttnGAN-Audio-to-image-geneation

CMPT726 Machine Learning Final Project

36
Emerging
2060 nate-russell/Scholar2Go

Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and...

36
Emerging
2061 arora-r/chatapp-with-voice-and-openai

This project uses OpenAI's GPT-3 model to create a simple assistant that can...

36
Emerging
2062 javichur/fitness-voice

AI voice-controlled trainer in your web browser, using NLP (wit.ai), body...

36
Emerging
2063 speechly/browser-client-example

A demo app showcasing Speechly browser-client and detailed api responses.

36
Emerging
2064 Fraunhofer-AISEC/towards-resistant-audio-adversarial-examples

Generation tool for offset-resistant audio adversarial examples against Deepspeech

36
Emerging
2065 nixonyh/UnityASR

Automatic Speech Recognition in Unity.

36
Emerging
2066 KoalaV2/K.A.I

Home automation program controlled by your voice.

36
Emerging
2067 nheidloff/unity-watson-vr-sample

Virtual Reality Sample using IBM Watson, Unity and Google Cardboard

36
Emerging
2068 piotrkawa/deepfake-whisper-features

Implementation of the paper "Improved DeepFake Detection Using Whisper Features"

36
Emerging
2069 mike-nott/smart-announcements

Intelligent context-aware voice announcements for Home Assistant....

36
Emerging
2070 Vishnu-tppr/NEXORA-AI

Made with Python, crafted by Vishnu 💻✨ Nexora AI – A smart Python voice...

36
Emerging
2071 Franck-Dernoncourt/ASR_benchmark

Program to benchmark various speech recognition APIs

36
Emerging
2072 chirag127/WebSpeak-TextToSpeech-Browser-Extension

High-fidelity browser extension leveraging the Web Speech API for precise,...

36
Emerging
2073 Hagsten/Talkify

Javascript Text to speech library

36
Emerging
2074 arham-kk/openai-tts

This repository features a Gradio interface designed to leverage the OpenAI...

36
Emerging
2075 manab-kb/Voice-Based-Translator

A Voice Based Translator - Speak in English or any of the available selected...

36
Emerging
2076 chattylabs/conversational-flow

The Conversational Flow combines both native built-in resources and cloud...

36
Emerging
2077 gaborvecsei/whisper-live-transcription

Live-Transcription (STT) with Whisper PoC

36
Emerging
2078 thc1006/whisper-colab-tpu-transcriber

High-performance Google Colab Notebook for fast & accurate audio...

36
Emerging
2079 richardassar/SampleRNN_torch

Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio...

36
Emerging
2080 neurlang/gospeak

A Golang Text to Speech System

36
Emerging
2081 b4rtaz/voice-assistant

Voice assistant for Visual Studio Code.

36
Emerging
2082 yh1008/speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

36
Emerging
2083 resemble-ai/resemble-unity-text-to-speech

Resemble's voice cloning engine within Unity

36
Emerging
2084 jvandenaardweg/ssml-split

Splits SSML strings into batches AWS Polly ánd Google's Text to Speech API...

36
Emerging
2085 bdim404/Qwen3-TTS-WebUI

基于阿里巴巴 Qwen3-TTS 模型(17 亿参数)的全栈文本转语音 Web 应用,支持语音定制、语音设计和语音克隆,有声书生成功能。A...

36
Emerging
2086 ArchitParnami/Few-Shot-KWS

Few-Shot Keyword Spotting

36
Emerging
2087 ohmstone/pocket-tts-deno

WASM ONNX build of Pocket TTS with voice cloning adapted from...

36
Emerging
2088 aperepel/claude-mlx-tts

Voice-cloned smart attention TTS notifications for Claude Code. AI...

36
Emerging
2089 azu/vscode-read-aloud-text

VSCode extension that read aloud text like Markdown and text etc...

36
Emerging
2090 AceCentre/TextAloud

iOS app. Built in Swift. Reads out text - sentence by sentence, paragraph by...

36
Emerging
2091 alecokas/BiLatticeRNN-Confidence

Confidence Estimation for Black Box Automatic Speech Recognition Systems...

36
Emerging
2092 manish-4007/YT-video-Transcription

An AI tools which helps to analyze any YouTube video, give the sentiment of...

36
Emerging
2093 ga642381/FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to...

36
Emerging
2094 bhashini-ai/g2p

Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a...

36
Emerging
2095 prateekralhan/Speech2Text-for-Long-Audio-Files

Perform SOTA Speech2Text on Long Audio Files with/without diarization Using...

36
Emerging
2096 vijethph/Insight

A Flutter app to help blind people.

36
Emerging
2097 anwar-gazi/ivrworks

Build IVR, run voice campaign, with machine detection, speech recognition...

36
Emerging
2098 asus4/unity-speech-recognizer

iOS Speech Recognizer for Unity

36
Emerging
2099 marcominerva/TranslatorService

A lightweight library that uses Cognitive Translator Service for text...

36
Emerging
2100 kwebby/Qwen3-TTS-Voice-Studio

A Text to Speech App for Qwen3-TTS Family Models to create custom voices,...

36
Emerging
« Prev 1 2 3 19 20 21 22 23 80 81 82 Next »