All Voice AI Tools

8,165 tools ranked by quality score · Page 11 of 82

Showing 1001–1100 of 8,165
# Tool Score Tier
1001 shenasa-ai/speech2text

A Deep-Learning-Based Persian Speech Recognition System

44
Emerging
1002 maum-ai/assem-vc

Official Code for Assem-VC @ICASSP2022

44
Emerging
1003 siva-sub/NekoSpeak

Private, offline AI Text-to-Speech for Android with Kokoro, KittenTTS,...

44
Emerging
1004 wangz-code/legado-edge-tts

edge大声朗读微软TTS服务, 在阅读legado中配置语音引擎方式收听微软TTS / Edge大声朗读, 如果没有 vps 部署可以看看阅读内置...

44
Emerging
1005 SynHub/syn-speech

Syn.Speech is a flexible speaker independent continuous speech recognition...

44
Emerging
1006 talin190/Qwen3-TTS-Daggr-UI

🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for...

44
Emerging
1007 husniadil/cc-hooks

Audio feedback plugin for Claude Code with TTS announcements, sound effects,...

44
Emerging
1008 jim-schwoebel/download_audioset

📁 This repo makes it easy to download the raw audio files from AudioSet...

44
Emerging
1009 DrDroidLab/voicesummary

Open Source AI Database for Voice Agent Transcripts | Call Analysis &...

44
Emerging
1010 OpenMOSS/MOSS-Speech

MOSS-Speech is a true speech-to-speech large language model without text guidance.

44
Emerging
1011 bookbot-kids/speech-recognizer-bahasa-indonesian

A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer...

44
Emerging
1012 cuinjune/text2video

A software tool that converts text to video for more engaging learning experience

44
Emerging
1013 yerfor/SyntaSpeech

SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022;...

44
Emerging
1014 Pikurrot/whisper-gui

A simple GUI to use Whisper.

44
Emerging
1015 r0227n/flutter_whisper_kit

🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device,...

44
Emerging
1016 drmfinlay/pyjsgf

JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.

44
Emerging
1017 djmango/obsidian-transcription

Obsidian plugin to create high-quality transcriptions from markdown linked...

44
Emerging
1018 lucasnewman/f5-tts-swift

Implementation of F5-TTS in Swift using MLX

44
Emerging
1019 murf-ai/murf-python-sdk

Python sdk for Murf text to speech API

44
Emerging
1020 holgern/kokorog2p

A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.

44
Emerging
1021 algolia/voice-overlay-android

🗣 An overlay that gets your user’s voice permission and input as text in a...

44
Emerging
1022 BernieTv/ElevenLabs-Clone

A self-hosted ElevenLabs clone for text-to-speech, voice conversion, and AI...

44
Emerging
1023 Candida18/Virtual-Assistance-For-The-Blind

The proposed Voice-based Email System uses AI (voice commands) that will...

44
Emerging
1024 nixonyh/UnityTTS

Text to Speech in Unity.

44
Emerging
1025 isaiahbjork/expo-kokoro-onnx

Run Kokoro TTS locally on device using Expo & ONNX Runtime

44
Emerging
1026 jpescada/TwitterPiBot

A Python based bot for Raspberry Pi that grabs tweets with a specific...

44
Emerging
1027 mozilla-ai/speech-to-text-finetune

Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language

44
Emerging
1028 rishikksh20/TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for...

44
Emerging
1029 zycv/awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting...

44
Emerging
1030 tonesto7/echo-speaks

Integrate your Amazon Echo devices into your Hubitat environment to create...

44
Emerging
1031 travisvn/edge-tts-extension

Chrome extension to generate free, high-quality text-to-speech using...

44
Emerging
1032 Amirrezahmi/Zozo-Assistant

Zozo Assistant is a voice-activated chatbot that performs tasks based on...

44
Emerging
1033 Berkeley-Speech-Group/sylber

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

44
Emerging
1034 verbio-technologies/python-verbio-speech-center

Python integration with the Verbio Speech Center Cloud....

44
Emerging
1035 kosich/rxjs-tts

RxJS wrapper for Text-to-Speech Web API

44
Emerging
1036 ttaoREtw/Tacotron-pytorch

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

44
Emerging
1037 pufanyi/GenderRecognitionByVoice

NTU SC1015 Group Project - Gender Recognition by Voice

44
Emerging
1038 matteo-convertino/vosk-build-model

How to create your own model for vosk

44
Emerging
1039 hirofumi0810/asr_preprocessing

Python implementation of pre-processing for End-to-End speech recognition

44
Emerging
1040 apaar97/translate

Android app to translate text conversations, supporting 90+ languages with...

44
Emerging
1041 momysnow/Momy-Desk-Robot

Smart desktop robot.

44
Emerging
1042 CheshireCC/faster-whisper-GUI

faster_whisper GUI with PySide6

44
Emerging
1043 m3hrdadfi/soxan

Wav2Vec for speech recognition, classification, and audio classification

44
Emerging
1044 Azure-Samples/sonic-brief

Sonic Brief Project is an Azure-based system that transcribes and...

44
Emerging
1045 JosefAlbers/e2tts-mlx

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX

44
Emerging
1046 seven-io/home-assistant

HACS supporting Home Assistant integration for seven

44
Emerging
1047 Aratako/MioTTS-Inference

Inference server for MioTTS, a lightweight and fast LLM-based TTS model.

44
Emerging
1048 resemble-ai/resemble-alexa

This is sample code for an Alexa skill that uses realistic voice cloning...

44
Emerging
1049 Justmalhar/open-audio

Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech...

44
Emerging
1050 n0th1ng-else/voice-to-text-bot

Telegram bot that converts Voice messages into text

44
Emerging
1051 vieledatengutedaten/better-teletask-extension

Browser extension that adds useful features like subtitles to HPI Tele-Task.

44
Emerging
1052 ycyy/faster-whisper-webui

a gradio webui for faster whisper

44
Emerging
1053 syntithenai/hermod

voice services stack from audio hardware through hotword, ASR, NLU, AI...

44
Emerging
1054 subho406/TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge...

44
Emerging
1055 iamjanvijay/rnnt_decoder_cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

44
Emerging
1056 bytectlgo/edge-tts

Edge TTS is a command-line tool based on Microsoft Edge's text-to-speech...

44
Emerging
1057 Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

44
Emerging
1058 just-ai/aimybox-android-sdk

Voice assistant SDK for Android

44
Emerging
1059 am-sokolov/videodubber

The program for automatic dubbing any video file for a lot of languages.

44
Emerging
1060 nl8590687/ASRT_SDK_Java

ASRT Speech Recognition SDK for Java. 用于ASRT语音识别系统的Java SDK

44
Emerging
1061 ShaerWare/AI_Secretary_System

📞 Локальный AI-секретарь, тех. поддержка и менеджер по продажам с...

44
Emerging
1062 PABannier/bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech generation

44
Emerging
1063 botbahlul/vosk_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...

44
Emerging
1064 huschen/kaggle_speech_recognition

Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.

44
Emerging
1065 igorshmukler/kokoro-ruslan

Kokoro Language Model Training Script for Russian (Ruslan Corpus)

44
Emerging
1066 rajkishorbgp/JARVIS-AI-Assistant

JARVIS AI Assistant 🤖 A virtual assistant project inspired by Tony Stark's...

44
Emerging
1067 byhow/yanyu

A Text-to-Speech node package with pinyin audio library.

44
Emerging
1068 mobilepadawan/Speakit-JS

Elevate your web applications with the power of JavaScript speech synthesis.

43
Emerging
1069 bakaburg1/minutemaker

Generate meeting minutes starting from an audio recording or a transcripts...

43
Emerging
1070 BobRandomNumber/ComfyUI-DiaTTS

ComfyUI Dia safetensors implementation

43
Emerging
1071 huakunyang/SummerTTS

SummerTTS...

43
Emerging
1072 ryanleary/patter

speech-to-text in pytorch

43
Emerging
1073 beyondwords-io/wordpress-plugin

BeyondWords is the AI voice platform that brings frictionless audio...

43
Emerging
1074 keonlee9420/VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based...

43
Emerging
1075 caizexin/tf_multispeakerTTS_fc

the Tensorflow version of multi-speaker TTS training with feedback constraint

43
Emerging
1076 gladiaio/normalization

A lightweight library for normalizing speech transcripts before computing WER

43
Emerging
1077 asticode/go-astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

43
Emerging
1078 andresayac/edge-tts-php

Edge TTS is a PHP package that allows access to the online text-to-speech...

43
Emerging
1079 jianchang512/zh_recogn

将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型

43
Emerging
1080 mgonzs13/tts_ros

Text-to-Speech for ROS 2

43
Emerging
1081 lukaszliniewicz/Pandrator

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos...

43
Emerging
1082 metame-ai/awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation,...

43
Emerging
1083 Kardbord/hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

43
Emerging
1084 nodef/extra-amazontts

Generate speech audio from super long text through machine (via "Amazon...

43
Emerging
1085 Sgvkamalakar/Azure-Talking-Avatar

Explore the power of Azure Text-to-Speech with interactive talking avatar,...

43
Emerging
1086 agentvoiceresponse/avr-tts-elevenlabs

This repository demonstrates the integration between Agent Voice Response...

43
Emerging
1087 mgonzs13/piper_ros

piper Text-to-Speech for ROS 2

43
Emerging
1088 hi-paris/Prosody-Control-French-TTS

An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control

43
Emerging
1089 Jakobovski/free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

43
Emerging
1090 meemalabs/laravel-text-to-speech

💬 A wrapper for popular TTS services to create a more simple & uniform API....

43
Emerging
1091 cdimascio/watson-html5-speech-recognition

Speech Recognition for Browsers via Webkit, HTML5, and Watson

43
Emerging
1092 mush42/sonata

A cross-platform inference engine for neural TTS models.

43
Emerging
1093 bjoernkarmann/project_alias

Alias is a teachable “parasite” that is designed to give users more control...

43
Emerging
1094 agan-j/xiaoniu

小牛视频翻译 是一款支持本地视频翻译、字幕翻译和 YouTube 视频翻译下载的 AI...

43
Emerging
1095 p0p4k/pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

43
Emerging
1096 xxbb1234021/speech_recognition

中文语音识别

43
Emerging
1097 garvys-org/rustfst

Rust re-implementation of OpenFST - library for constructing, combining,...

43
Emerging
1098 devnen/Kitten-TTS-Server

Self-host the ultra-lightweight Kitten TTS model with this enhanced API...

43
Emerging
1099 sdip15fa/safecantonese.ai.app

Free, open-source, offline, safe and secure AI Cantonese transcription, in...

43
Emerging
1100 algolia/voice-overlay-ios

🗣 An overlay that gets your user’s voice permission and input as text in a...

43
Emerging
« Prev 1 2 3 9 10 11 12 13 80 81 82 Next »