All Voice AI Tools

8,165 tools ranked by quality score · Page 12 of 82

Showing 1101–1200 of 8,165
# Tool Score Tier
1101 fedden/RenderMan

Command line C++ and Python VSTi Host library with MFCC, FFT, RMS and audio...

43
Emerging
1102 stefantaubert/en-tts

Command-line interface and Python library for synthesizing English texts into speech.

43
Emerging
1103 alexpinel/Dot

Text-To-Speech, RAG, and LLMs. All local!

43
Emerging
1104 tema6120/ForgetMeNot

A flashcard app for Android.

43
Emerging
1105 OpenCOVID19CoughCheck/CoughCheckApp

Development of AI audio app to compare the cough of a Coronavirus (COVID-19)...

43
Emerging
1106 bold-ronin/lira

A Voice-First AI Companion

43
Emerging
1107 superstarryeyes/lue

Terminal eBook Reader with Audiobook-Quality Text-to-Speech — Supports EPUB,...

43
Emerging
1108 stefantaubert/mel-cepstral-distance

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral...

43
Emerging
1109 pnlpal/pnl-reader

PNL Reader: read quietly or read aloud

43
Emerging
1110 nobody132/masr

中文语音识别; Mandarin Automatic Speech Recognition;

43
Emerging
1111 kurianbenoy/Indic-Subtitler

Open source subtitling platform 💻 for transcribing and translating...

43
Emerging
1112 keonlee9420/PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative...

43
Emerging
1113 Rongjiehuang/GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model...

43
Emerging
1114 AASHISHAG/deepspeech-german

Automatic Speech Recognition (ASR) - German

43
Emerging
1115 benmaster82/writher

Voice-powered productivity for Windows

43
Emerging
1116 TimoBolkart/voca

This codebase demonstrates how to synthesize realistic 3D character...

43
Emerging
1117 deepgram-starters/django-voice-agent

Get started using Deepgram's Voice Agent with this Django demo app

43
Emerging
1118 DmitryRyumin/OpenAV

An open-source library for recognition of speech commands in the user...

43
Emerging
1119 sai9640nayak/StreamingKokoroJS

Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%...

43
Emerging
1120 goodmike31/pl-asr-bigos-tools

Extendable toolkit for comprehensive evaluation of ASR systems. Currently...

43
Emerging
1121 mikopbx/ModuleSmartIVR

Модуль умной маршрутизации для 1C:Предприятия

43
Emerging
1122 huawei-noah/Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei...

43
Emerging
1123 t0mer/tts-stt

Small pyhon flask container allowing us to convert Text to Speech and Speech to Text

43
Emerging
1124 sp-nitech/DNN-HSMM

pytorch implementation of DNN-HSMM for TTS

43
Emerging
1125 sovaai/sova-asr

SOVA ASR (Automatic Speech Recognition)

43
Emerging
1126 rhulha/StreamingKokoroJS

Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%...

43
Emerging
1127 ponlponl123/-Prototype-AIVTuber

a open-source Artificial Intelligence Virtual Youtuber (AI VTuber), (this...

43
Emerging
1128 novoic/surfboard

Novoic's audio feature extraction library

43
Emerging
1129 EricBatlle/UnityAndroidSpeechRecognizer

🗣️ Speech recognition on Unity and Android without the annoying google popup!

43
Emerging
1130 timmo001/home-assistant-assist-desktop

Use Home Assistant Assist on the desktop. Compatible with Windows, MacOS, and Linux

43
Emerging
1131 AIFSH/ComfyUI-XTTS

a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages...

43
Emerging
1132 soundhound/hound-sdk-web-example

An example of how to work with text and voice requests using the Houndify...

43
Emerging
1133 hujingshuang/MTrans

Multi-source Translation

43
Emerging
1134 rishikksh20/melgan

MelGAN implementation with Multi-Band and Full Band supports...

43
Emerging
1135 JosefAlbers/WTM

Blazing fast whisper turbo for ASR (speech-to-text) tasks

43
Emerging
1136 wangkaisine/mrcp-plugin-with-freeswitch

使用FreeSWITCH接受用户手机呼叫,通过UniMRCP...

43
Emerging
1137 FireRedTeam/FireRedASR2S

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc...

43
Emerging
1138 SamYuan1990/flet_sherpa_onnx

flet_sherpa_onnx an ASR/STT library for flet basing on sherpa-onnx

43
Emerging
1139 Picovoice/speech-to-intent-benchmark

benchmark for Speech-to-Intent engines

43
Emerging
1140 George0828Zhang/torch_cif

A fast parallel PyTorch implementation of the "CIF: Continuous...

43
Emerging
1141 qianchang/zici

字词:收集国学/汉语字词拼音相关资源

43
Emerging
1142 Appen/UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech...

43
Emerging
1143 chandran-jr/Noteify

🔎A Currency Detection app for the visually impaired which automatically...

43
Emerging
1144 tomasz-oponowicz/spoken_language_identification

Identify a spoken language using artificial intelligence (LID).

43
Emerging
1145 keonlee9420/WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement...

43
Emerging
1146 zceng/LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

43
Emerging
1147 haguro/elevenlabs-go

A Go API client library for the ElevenLabs speech synthesis platform

43
Emerging
1148 Ezdokz1337/sunona-v0.001

🎤 Build and deploy intelligent voice AI agents in minutes with Sunona, your...

43
Emerging
1149 mitchib1440/SpeakThat

The world's most comprehensive notification reader for Android devices.

43
Emerging
1150 darkautism/sensevoice-rs

A Rust-based, SenseVoiceSmall

43
Emerging
1151 xyqfer/reader

毕业设计-基于智能手机的报纸阅读器

43
Emerging
1152 GinoShun/Accent-Activation-Steering

Official code for "Activation Steering for Accent Adaptation in Speech...

43
Emerging
1153 HachiroSan/google-pronouncer

🔊 Download pronunciation audio files from Google's dictionary service....

43
Emerging
1154 jonatasgrosman/asrecognition

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

43
Emerging
1155 ai-learning-tools/viva-translate

Real-time translation copilot for your browser

43
Emerging
1156 karim23657/Persian-tts-coqui

Persian/Farsi text to speech(TTS) training using coqui tts

43
Emerging
1157 felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch

Classification of 11 types of audio clips using MFCCs features and LSTM....

43
Emerging
1158 sevangelatos/py-ttspico

Python svox picotts wrapper

43
Emerging
1159 thetobysiu/Deepstory

Deepstory turns a text/generated text into a video where the character is...

43
Emerging
1160 thewh1teagle/piper-onnx

Use piper TTS with onnxruntime

43
Emerging
1161 aws-solutions/content-localization-on-aws

Automatically generate multi-language subtitles using AWS AI/ML services....

43
Emerging
1162 MohammedRashad/FPGA-Speech-Recognition

Expiremental Speech Recognition System using VHDL & MATLAB.

43
Emerging
1163 R1ckShi/AESRC2020

[ICASSP2021] Data preperation scripts, training pipeline and baseline...

43
Emerging
1164 rorpage/openfaas-text-to-speech

Generate an MP3 of text using Google's Text-to-Speech

43
Emerging
1165 dbklim/Voice_ChatBot

Chatbot in russian with speech recognition using PocketSphinx and speech...

43
Emerging
1166 wit-ai/android-voice-demo

Example on how to build a voice-enabled Android app with Wit.ai

43
Emerging
1167 lablab-ai/OpenAI_Whisper_Streamlit

A minimalistic automatic speech recognition streamlit based webapp powered...

43
Emerging
1168 gooofy/py-marytts

Python MaryTTS HTTP client library

43
Emerging
1169 rainygirl/rspeaker

말귀를 알아듣고 뉴스도 요약해 읽어줍니다

43
Emerging
1170 yl4579/StyleTTS-VC

Official Implementation of StyleTTS-VC

43
Emerging
1171 upskyy/Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech...

43
Emerging
1172 LiberSonora/LiberSonora

LiberSonora,寓意“自由的声音”,是一个 AI 赋能的、强大的、开源有声书工具集,包含智能字幕提取、AI标题生成、多语言翻译等功能,支持...

43
Emerging
1173 developers-cosmos/Mimasa

Real time multilingual face translator

43
Emerging
1174 keonlee9420/Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based...

43
Emerging
1175 opensource-spraakherkenning-nl/Kaldi_NL

Code related to the Dutch instance and user groups of the KALDI speech...

43
Emerging
1176 hopkira/k9

Latest main K9 robot repository with 3D vision, local STT/TTS with GPT-3 and...

43
Emerging
1177 Gmzxdotzz/Dia-TTS-Server

Self-host the powerful Dia TTS model. This server offers a user-friendly Web...

43
Emerging
1178 taresh18/TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific...

43
Emerging
1179 pandeydivesh15/AVSR-Deep-Speech

Google Summer of Code 2017 Project: Development of Speech Recognition Module...

43
Emerging
1180 yuhr/langue

A modern platform for conlanging. Currently in the planning stage.

43
Emerging
1181 mozilla/DeepSpeech-examples

Examples of how to use or integrate DeepSpeech

43
Emerging
1182 niker/EdgeTtsSharp

EdgeTTS Sharp is a library that provides an easy-to-use, realtime-streaming,...

43
Emerging
1183 alex-vt/WhisperInput

Offline voice input panel & keyboard with punctuation for Android.

43
Emerging
1184 candlewill/Speech-Corpus-Collection

A Collection of Speech Corpus for ASR and TTS

43
Emerging
1185 Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from...

43
Emerging
1186 AmphionTeam/FlexiCodec

[ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

43
Emerging
1187 jtkim-kaist/VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM...

43
Emerging
1188 kaituoxu/Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with...

43
Emerging
1189 Pankaj-Baranwal/pocketsphinx

Updated ROS bindings to pocketsphinx

43
Emerging
1190 ttop32/coqui_tts_korea

Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS

43
Emerging
1191 bawangxx/XZVoice

Free and open source text-to-speech software

43
Emerging
1192 journey-ad/CosyVoice2-Ex

CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)

43
Emerging
1193 tover0314-w/opentypeless

Talkmore with Opentypeless. Type with your voice. Anywhere. Talk -...

43
Emerging
1194 nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps...

43
Emerging
1195 chenmingxiang110/Chinese-automatic-speech-recognition

Chinese speech recognition

43
Emerging
1196 jojojaeger/whisper-streamlit

this master thesis project is based on OpenAI Whisper with the goal to...

43
Emerging
1197 flogy/gatsby-mdx-tts

🗣 Adds speech output to your Gatsby site using Amazon Polly.

43
Emerging
1198 jsugg/ser

The AI-powered ser Python package is a tool for recognizing and analyzing...

43
Emerging
1199 linux-speakup/espeakup

a light weight connector for espeak-ng and speakup

43
Emerging
1200 seanghay/KLEA

An open-source Khmer Word to Speech Model. Just single word not sentence!

43
Emerging
« Prev 1 2 3 10 11 12 13 14 80 81 82 Next »