All Voice AI Tools

8,165 tools ranked by quality score · Page 23 of 82

Showing 2201–2300 of 8,165
# Tool Score Tier
2201 deepgram-devs/flask-live-chatgpt-text-to-speech

Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app

36
Emerging
2202 silenterus/deepspeech-cleaner

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

36
Emerging
2203 parzibyte/tts-js

Demostración de speechSynthesis con JavaScript: TTS o Síntesis de habla

36
Emerging
2204 Hamahmi/kaldi-tut

This is a Kaldi tutorial for beginners

36
Emerging
2205 OssiaAI/OssiaVoice

Ossia is an accessibility tool for those unable to speak or type; Ossia...

36
Emerging
2206 nico-byte/whisper-web

The Whisper Web Transcription Server is a Python-based real-time...

36
Emerging
2207 BayramAnnakov/gmail-to-podcast

Transform Gmail newsletters into AI-generated podcast conversations using...

36
Emerging
2208 LonePheasantWarrior/TalkifyTTS

云端大模型驱动的 Android 语音合成应用(TTS引擎)。支持豆包、腾讯、微软、千问等模型。An Android text-to-speech...

36
Emerging
2209 LonePheasantWarrior/VolcengineTTS

基于火山引擎豆包语音服务的在线TTS安卓应用 (An online TTS Android application based on the...

36
Emerging
2210 MiguelsPizza/local-transcription-mcp--parakeet-tdt-0.6b-v2--

Local MCP server that converts and transcribes video and audio files 100% on device

36
Emerging
2211 rishikksh20/LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

36
Emerging
2212 prohetamine/tor-speech

🔉 Yandex & Google + Tor

36
Emerging
2213 ankushbhatia2/django-speech-to-text

A small API for speech to text made in Django.

36
Emerging
2214 6Morpheus6/Chattered

All in one Gradio interface for chatterbox. Voice cloning from uploaded...

35
Emerging
2215 ikfly/java-tts

java-tts 文本转语音

35
Emerging
2216 golemfactory/g-flite

g-flite: flite app distributed over Golem Network

35
Emerging
2217 purvanshjoshi/IndiVoice-DeepASR

Deep Learning framework for Indian-accented Speech-to-Text using Whisper and...

35
Emerging
2218 Lightning-Universe/Echo

Production-ready audio and video transcription app that can run on your...

35
Emerging
2219 adhadse/Deepdubpy

A complete end-to-end Deep Learning system to generate high quality human...

35
Emerging
2220 innovatorved/whisper-openai-gradio-implementation

Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation

35
Emerging
2221 jaoafa/ChatWatcher

🗣 Discord voice-chat speech recognition

35
Emerging
2222 timoil/whisper-subtitles

🎬 AI-powered localhost subtitle generator for hearing-impaired users....

35
Emerging
2223 M86xKC/edge-tts

Simple TTS using MS Edge built-in voices

35
Emerging
2224 PareekshithPalat/Transcriptor

The Transcriptor is a subtitle extractor, lightweight web application built...

35
Emerging
2225 jim11662418/General_Instrument_CTS256_SP0256_Speech_Synthesizer

Vintage General Instrument Speech Synthesizer CTS256 with SP0256

35
Emerging
2226 samsad35/source-filter-vae

[SpeechCom Journal] Learning and controlling the source-filter...

35
Emerging
2227 BenLubar/espeak

Package espeak is a wrapper around espeak-ng that works both natively and in...

35
Emerging
2228 Kaljurand/Diktofon

An Android app, a dictaphone with Estonian speech-to-text

35
Emerging
2229 nexxeln/spotify-voice-control

Voice control for Spotify through the terminal

35
Emerging
2230 junjie-xyz/whisper-video

Generate subtitles for all the videos in a folder with OpenAI's Whisper...

35
Emerging
2231 heartsuit/BaiduASRAndTTS

Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech;...

35
Emerging
2232 jx1100370217/DFCNN-master

这是一个基于全卷积神经网络的语音识别系统

35
Emerging
2233 Yukaii/gakuon

Review Anki cards using Generative AI voice

35
Emerging
2234 JustinGOSSES/spoken-floodplain

Website that verbally tells users when they enter or leave a floodplain in...

35
Emerging
2235 Babakinha/Dectalk

A Simple package for using Dectalk

35
Emerging
2236 zerospeech/benchmarks

A command line tool that helps use the "Zero Ressource Challenge" benchmarks

35
Emerging
2237 MelvilQ/stacksrs

A simple Spaced Repetition app for Android.

35
Emerging
2238 vectominist/spin

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for...

35
Emerging
2239 Leonard2310/LibrAI

iOS app with AI for an immersive audiobook experience, text-to-speech and...

35
Emerging
2240 ikarago/Talkinator

Talkinator is an easy to use text-to-speech-app for Windows 10-devices

35
Emerging
2241 lelosaiyan/J.A.R.V.I.S.

A voice virtual desktop assistant for Windows 7/10

35
Emerging
2242 matusstas/openai-whisper-microservice

This is an OpenAI Whisper automatic speech recognition microservice

35
Emerging
2243 noir-neo/UniSpeech

iOS speech framework native plugin for Unity

35
Emerging
2244 qkl9527/voice-assistant

基于Funasr的[实时]AI语音助手

35
Emerging
2245 orianemartin/WhispGrid

A Whisper to TextGrid script that I use to automatize Corpus Annotation on...

35
Emerging
2246 charstorm/vilberta

Voice chatbot with voice+screen output to show that "not everything needs to...

35
Emerging
2247 dcavar/ELAN2split

Split ELAN Annotation Files and corresponding speech files into a corpus...

35
Emerging
2248 systoolz/dosbtalk

unofficial API implementation for Text-to-Speech Engine by First Byte

35
Emerging
2249 alisolphp/EchoTalk

A browser-based language training app using Shadowing technique with...

35
Emerging
2250 tuhinpal/text-to-speech

Text to Speech using Google's Library (Made for Fun)

35
Emerging
2251 SupernovifieD/FreeSpeechToText

A python program that extracts text from audio files - .mp3 or .wav - for free!

35
Emerging
2252 MazueraAlvaro/speech-recognition-asterisk

A script for speech recognition in asterisk

35
Emerging
2253 ORI-Muchim/One-Click-VITS-Training

VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification...

35
Emerging
2254 chienhsiang-hung/voice-and-wav-cloning

通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 ),並提供多種音頻處理技術來提升音質和真實感。

35
Emerging
2255 codekraft-studio/vue-speech

Vue integration and components for the Web Speech API

35
Emerging
2256 yc9701/pansori-tedxkr-corpus

Korean ASR Corpus generated from TEDx talks

35
Emerging
2257 dialpad/mucs_2021_dialpad

Dialpad team's submission to the MUCS 2021 workshop

35
Emerging
2258 huckiyang/QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing...

35
Emerging
2259 hebbihebb/MBook

EPUB to M4B using Maya1

35
Emerging
2260 nhut-ngnn/Voice-Based-Age-and-Gender-Recogniton

[ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of...

35
Emerging
2261 HarunoriKawano/BEST-RQ

Implementation of the paper "Self-supervised Learning with Random-projection...

35
Emerging
2262 placebokkk/e6870

assignments for e6870 ASR class

35
Emerging
2263 maetshju/flux-blstm-implementation

An implementation of the Graves & Schmidhuber (2005) bidirectional LSTM in Flux.

35
Emerging
2264 mattzzz/rick-voice

Give any bot the voice of Rick Sanchez

35
Emerging
2265 indonesian-nlp/multilingual-asr

Multilingual Speech Recognition for Indonesian Languages

35
Emerging
2266 HuuHuy227/XphoneBert_Vits2

VITS2 extended with XPhoneBERT encoder

35
Emerging
2267 markhliu/mpt

Code repository for the book Make Python Talk

35
Emerging
2268 darsh-1010/Jarvis-A-Voice-Based-Assistant-Powered-by-LLaMA

Jarvis is a voice-based assistant built in Python that simplifies daily...

35
Emerging
2269 kostas2370/Video-Creator

This project is to automate the video creation.

35
Emerging
2270 thevickypedia/Jarvis_UI

Light weight UI to interact with Jarvis via API calls

35
Emerging
2271 yanorei32/winrt-tts-server

A simple Web Based Windows Runtime (WinRT) Speech Synthesis API

35
Emerging
2272 mo7amedaliEbaid/run-tracker

A flutter run tracker app - clean architecture

35
Emerging
2273 go-restream/supertts

🎧 Supertonic TTS ONNX Inference Openai Speech REST API

35
Emerging
2274 opensource-spraakherkenning-nl/asr_nl

Dutch Speech Recognition webservice

35
Emerging
2275 Vaibhavs10/ml-with-audio

HF's ML for Audio study group

35
Emerging
2276 botbahlul/Live-Subtitle

ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free...

35
Emerging
2277 void-xtreme/audible-text-editor

An automated Sinhala audio Text Editor for visually impaired and blind students

35
Emerging
2278 drivendataorg/childrens-speech-recognition-benchmark-pub

Tutorial code for the On Top of Pasketti: Children’s Speech Recognition Challenge

35
Emerging
2279 shreyasnisal/SpeechProgrammer

The Speech Programmer writes code based on voice commands. Right now it only...

35
Emerging
2280 chimechallenge/chime-utils

Scripts for data generation, scoring and data manifest preparation for...

35
Emerging
2281 Tristan296/Universal-MacAssistant

Advanced Personal Assistant created for macOS that utilises AppleScripts,...

35
Emerging
2282 saurabhchalke/whisper-meta-quest

Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model

35
Emerging
2283 Hamtech-ai/wav2vec2-fa

fine-tune Wav2vec2. an ASR model released by Facebook

35
Emerging
2284 HaoQChen/iflytek_awaken_asr

use iflytek's technology to realize awaken and order recognition

35
Emerging
2285 pncnmnp/phoenix10.1

Creates personalized radio stations with your own radio jockey!

35
Emerging
2286 heyfoz/python-youtube-transcription

This repository contains Python scripts and a local Flask web application...

35
Emerging
2287 Ralireza/spoken-digit-recognition

Classifying English spoken digit by Hidden Markov Model

35
Emerging
2288 syntithenai/opensnips

Open source projects related to Snips https://snips.ai/.

35
Emerging
2289 yokawasa/vscode-translator-voice

VS Code extension for multi-language text translation and TTS...

35
Emerging
2290 AceCentre/pasco

Phrase Auditory Scanning COmmunicator - AAC App for iOS and the Web

35
Emerging
2291 theamazing0/global-subtitles-main

Closed Captioning Everywhere, With Assembly AI

35
Emerging
2292 candlewill/Ossian

Ossian: A simple language-independent Text-to-speech frontend

35
Emerging
2293 atomicoo/Tacotron2-PyTorch

PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。

35
Emerging
2294 dokuniev/claude-voice

Hear which Claude Code session needs you — speaks the repo and branch name out loud

35
Emerging
2295 Helther/voice-pick-tbot

Text To Speech Synthesis Telegram Bot with voice customization

35
Emerging
2296 18F/tts-buy-challengegov-ideation

Market research documents related to the Challenge.gov Ideation Platform.

35
Emerging
2297 BullShark/JSpeak

A Text to Speech Reader Front-end that Reads from the Clipboard and with...

35
Emerging
2298 GetProjectsIdea/Convert-Text-to-Speech-in-Python

Text to speech is a process to convert any text into voice. Text to speech...

35
Emerging
2299 HasnainDarkNet/DarKVoice

DarKVoice is an open-source voice assistant and audio processing tool built...

35
Emerging
2300 AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming

speech-enhacement

35
Emerging
« Prev 1 2 3 21 22 23 24 25 80 81 82 Next »