All Voice AI Tools

8,165 tools ranked by quality score · Page 22 of 82

Showing 2101–2200 of 8,165
# Tool Score Tier
2101 ElsebaiyMohamed/Modablag

This project presents a comprehensive study on video dubbing techniques and...

36
Emerging
2102 nidi3/swiss-wowbagger

Let yourself be insulted in swiss german. Schöner fluchen auf Berndeutsch.

36
Emerging
2103 jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch

Semi-supervised spoken language understanding (SLU) via self-supervised...

36
Emerging
2104 ayutaz/uCosyVoice

CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot...

36
Emerging
2105 gokhaneraslan/XTTS_V2-finetuning

Training XTTS V2 and PEFT LORA Text-to-Speech (TTS)

36
Emerging
2106 crimson0829/RecordVoiceView

录音控件 for Android,支持实时语音转化为文字

36
Emerging
2107 GuruCharan94/az-podcast-transcriber

A podcast transcription service built on Azure that transcribes any new...

36
Emerging
2108 d-kavinraja/MouthMap

MouthMap is a deep learning-based lip reading system that converts silent...

36
Emerging
2109 TejasQ/praise

Do stuff with your voice in the browser.

36
Emerging
2110 shervinemami/practice_speechrec_mappings

A game to help design a better character mapping and to learn the mapping...

36
Emerging
2111 StachePL/ExcelToAmazonPolly

Simple text-to-speech tool combining powers of Excel and Amazon Polly.

36
Emerging
2112 rudra00434/SoulPlayer

My own music application build with Django , Tailwind CSS and Spacy...

36
Emerging
2113 deeheber/text-to-speech-converter

A serverless application that converts blobs of text to speech in an audio file

36
Emerging
2114 Yuan-ManX/ComfyUI-ChatterboxTTS

ComfyUI-ChatterboxTTS is now available in ComfyUI, Chatterbox is the first...

36
Emerging
2115 techiaith/docker-huggingface-stt-cy

Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech...

36
Emerging
2116 heyseth/Piper_TTS

Use Piper TTS in Visual Studio Code

36
Emerging
2117 Malith-Rukshan/whisper-transcriber-bot

🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI...

36
Emerging
2118 hay/audio2text

Python command line utility wrappers for Whispercpp and other speech-to-text...

36
Emerging
2119 wulee510505/Text2Speach

一句代码搞定语音合成,文字转语音

36
Emerging
2120 uzbekvoice/UzbekVoiceBot

Current and Live Telegram bot for collecting dataset

36
Emerging
2121 ducnt18121997/Viet-Text-Normalization

A Python library for text normalization, specifically designed for...

36
Emerging
2122 Jugendhackt/synthi-tts

Hackathon project to digitize your own voice and have it speak for you!...

36
Emerging
2123 playerony/TensorFlowTTS-ts

This project implements TensorflowTTS in Tensorflow.js using Typescript,...

36
Emerging
2124 poretsky/rulex

Russian pronunciation dictionary

36
Emerging
2125 Harshit-Raj-14/JARVIS-Python-Voice-Assistant

J.A.R.V.I.S - Python Smart AI Voice Assistant

36
Emerging
2126 momalekiii/VTT

Extract Speech/Text from Video

36
Emerging
2127 nishantnnb/spectrolipi

A tool designed to manage annotations for bioacoustics.

36
Emerging
2128 MitchellAW/Discord-Bot

My own Discord chat bot built in Python using the discord.py API. Has been...

36
Emerging
2129 theinlinaung2010/Azure_speech_to_test

Sample code for testing speech recognition (speech-to-text) of Burmese...

36
Emerging
2130 ismailperim/reportcast

Transform reports into podcasts with AI - Nobody reads your reports. But...

36
Emerging
2131 aflr-archive/apiaudio-python

api.audio Python SDK

36
Emerging
2132 cloudcommunity/Text-to-Speech-Engines

A list of different text to speech engines.

36
Emerging
2133 LWalone/fish-speech

🐟 Enhance communication with Fish Speech, a powerful multilingual...

36
Emerging
2134 MontrealAI/sign2text-v0

Sign Language to Text (A to Z) with Artificial Intelligence | Pre-Alpha Demo

36
Emerging
2135 neosun100/Step-Audio-R1.1

Step-Audio-R1.1: The First Audio Language Model with Test-Time Compute...

36
Emerging
2136 sahu-adarsh/intervyu

Practice job interviews with Neerja, an AI interviewer powered by Claude....

36
Emerging
2137 jcsilva/docker-kaldi-android

Dockerfile for compiling Kaldi for Android.

36
Emerging
2138 parzibyte/conversor-imagen-a-texto-js

Extraer texto de imagen utilizando JavaScript y Tesseract.js

36
Emerging
2139 ThePlasmak/faster-whisper

An OpenClaw skill that uses faster-whisper (a faster implementation of the...

36
Emerging
2140 syb0rg/Khronos

The open source intelligent personal assistant

36
Emerging
2141 morfeusys/porfir

Голосовой ассистент Порфирьевич

36
Emerging
2142 Voice-Privacy-Challenge/Voice-Privacy-Challenge-2020

Baseline Recipe for VoicePrivacy Challenge 2020:...

36
Emerging
2143 CodersCreative/faster-whisper-rs

a rust crate for easily implementing faster-whisper stt into your rust programs.

36
Emerging
2144 LinqLover/simple-openai-tts-playground

Try out the OpenAI Text to Speech API in your browser.

36
Emerging
2145 LearnedVector/Wav2Letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

36
Emerging
2146 egorsmkv/tts_uk

High-fidelity speech synthesis for Ukrainian using modern neural networks.

36
Emerging
2147 ontypehq/mlx-swift-asr

On-device speech recognition for Apple Silicon, powered by MLX.

36
Emerging
2148 atosystem/SpeechCLIP

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model,...

36
Emerging
2149 rafalimadev/piper-tts-call

Python wrapper for Piper TTS with real-time CLI/GUI, global hotkeys, and...

36
Emerging
2150 NeoKazuya/qwen3-tts-enhanced

Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation...

36
Emerging
2151 Degon3399/XTTS_V2

This repository offers a framework for fine-tuning the XTTS_V2 model,...

36
Emerging
2152 aviaryan/Very-Fast-Dictation

Instant dictation app for Mac

36
Emerging
2153 mikex86/DeepSpeech-Java-Bindings

Java Bindings for the C++ library DeepSpeech

36
Emerging
2154 QuantiusBenignus/blurt

Gnome shell extension for accurate OFFLINE speech to text input in Linux...

36
Emerging
2155 MahtaFetrat/ManaTTS-Persian-Tacotron2-Model

Tacotron2 Persian Text-to-Speech Model trained on ManaTTS, the largest open...

36
Emerging
2156 daslearning-org/text-to-speech-offline

A lightweight cross-platform Text-To-Speech application which works on...

36
Emerging
2157 oleksandr-g-rock/speech2text

speech2text

36
Emerging
2158 Saganaki22/ComfyUI-KugelAudio

🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice...

36
Emerging
2159 winedarkmoon/ElevenGUI

A user-friendly interface for ElevenLabs' API with added audio transcription...

36
Emerging
2160 1038lab/ComfyUI-VoxCPMTTS

A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech)...

36
Emerging
2161 greg-kennedy/p5-NRL-TextToPhoneme

Perl implementation of the Naval Research Laboratory text-to-phoneme...

36
Emerging
2162 wildminder/ComfyUI-KaniTTS

ComfyUI node for modular, human‑like Kani TTS. Generate natural,...

36
Emerging
2163 mu-hashmi/personaplex-mlx

PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex...

36
Emerging
2164 tim-gromeyer/VoiceAssistant

Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your...

36
Emerging
2165 echonoshy/tingshu

Tingshu 听舒 | Bringing the author’s voice directly to you

36
Emerging
2166 llami-team/wake-me

AI-based React component library that detects clapping sounds or finger...

36
Emerging
2167 Robofied/Voicenet

Comprehensive Python library for speech and voice.

36
Emerging
2168 stefantaubert/mean-opinion-score

Python library for calculating the mean opinion score and 95% confidence...

36
Emerging
2169 kaloprojects/KALO-ESP32-Voice-Assistant

Code snippets showing how to record I2S audio and store as .wav file on...

36
Emerging
2170 fernicar/Parakeet_GUI_TINS_Edition

A desktop application built using the TINS paradigm for transcribing audio...

36
Emerging
2171 sydkwests/kwest-whisper-analysis

Conducted a comprehensive technical analysis of the Whisper model on...

36
Emerging
2172 Oct4Pie/persian-stt

A Text-To-Speech Model Developed Using 🐸STT

36
Emerging
2173 Ma-Dan/asr-decode

从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库

36
Emerging
2174 wblgers/hmm_speech_recognition_demo

A demo for simple isolated Chinese speech word recognition using GMMHMM in Python

36
Emerging
2175 htn-l/htn-l.github.io

Takes in audio feed from lectures or meetings, performs speech to text...

36
Emerging
2176 supershaneski/openai-chatterbox

A sample Nuxt 3 application that listens to chatter in the background and...

36
Emerging
2177 tsengia/JSGFKit_Plus_Plus

A C++ library for parsing and manipulating JSGF grammar files.

36
Emerging
2178 bundlab/voice-stream

🎙️ Lightweight offline Python TTS engine. Thread-safe, CLI-ready, and...

36
Emerging
2179 MahtaFetrat/ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 114+ hours of...

36
Emerging
2180 sooftware/lightning-asr

Modular and extensible speech recognition library leveraging...

36
Emerging
2181 sayyedrizwan/TextConvertor

Convert Text into Voice(Speech) and Speech into Text..

36
Emerging
2182 edouardpoitras/eva

Open source voice-enabled personal assistant

36
Emerging
2183 vigonotion/tts.astromech

Text to Astromech integration for Home Assistant (R2D2 Beep Boop Sounds)

36
Emerging
2184 notebook-nexus/chatterbox-tts-colab

Transform any text into natural-sounding speech, clone voices from audio...

36
Emerging
2185 smartgic/docker-mycroft

Mycroft AI Voice Assistant Docker images and docker-compose.yml files for...

36
Emerging
2186 amitpatil321/VoiceForm

Voice Controlled Form, Which can be filled, cleared, submitted using only...

36
Emerging
2187 maemreyo/omnivoice-server

OpenAI-compatible HTTP server for OmniVoice text-to-speech

36
Emerging
2188 cottongeeks/podscript

Generate podcast transcripts using language and speech-to-text models

36
Emerging
2189 Sundy1219/ctc_beam_search_lm

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

36
Emerging
2190 shanghaimoon888/mod_vadasr

This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.

36
Emerging
2191 mahimairaja/openrtc-python

OpenRTC lets developers run multiple LiveKit voice agents in one Python...

36
Emerging
2192 DKMitt/speech-to-text-js

The Voice Note App's purpose is to experiment with the Web Speech API by...

36
Emerging
2193 Sri-Krishna-V/Elu

AI-powered Chrome extension that makes any web article accessible —...

36
Emerging
2194 vectominist/MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

36
Emerging
2195 lucko515/Speech-commands-recognition

Recognizing common speech commands using Keras and Tensorflow.

36
Emerging
2196 Zoomicon/SpeechLib

Library for Speech Synthesis and Recognition using Windows.Speech or...

36
Emerging
2197 GuangChen2333/FindUrVoicesPJSK

《世界计划 : 缤纷舞台》单角色语音数据集一键获取小工具 | 无需手动打标 | wav无压缩 | A simple tool for obtaining...

36
Emerging
2198 aks-devs/mod_google_tts

Freeswitch Text-To-Speech module

36
Emerging
2199 hmeutzner/kaldi-avsr

Kaldi-based audio-visual speech recognition

36
Emerging
2200 lissettecarlr/kuon

久远:一个开发中的大模型语音助手,当前关注易用性,简单上手,支持对话选择性记忆和Model Context Protocol (MCP)服务。...

36
Emerging
« Prev 1 2 3 20 21 22 23 24 80 81 82 Next »