All Voice AI Tools

8,165 tools ranked by quality score · Page 6 of 82

Showing 501–600 of 8,165
# Tool Score Tier
501 AlexandreSajus/JARVIS

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed...

50
Established
502 keshavbhatt/glate

Open Source Google Translator and TTS App for Linux Desktop

50
Established
503 sveinbjornt/hear

Command line interface for the built-in speech recognition and transcription...

50
Established
504 yl4579/StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for...

50
Established
505 goodatlas/zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

50
Established
506 amanvirparhar/chaplin

A real-time silent speech recognition tool.

50
Established
507 zzw922cn/Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

50
Established
508 VRCWizard/TTS-Voice-Wizard

Speech to Text to Speech. Song now playing. Sends text as OSC messages to...

50
Established
509 Finrandojin/alexandria-audiobook

AI-powered multi-voice audiobook generator — LLM script annotation, voice...

50
Established
510 Azure-Samples/Cognitive-Services-Voice-Assistant

Welcome to the Microsoft Voice Assistant samples repository! Here you will...

50
Established
511 moeru-ai/unspeech

🗣️🔊 Your Text-to-Speech Services, All-in-One.

50
Established
512 svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

50
Established
513 gustavostz/whisper-clip

WhisperClip simplifies your life by automatically transcribing audio...

50
Established
514 deepgram-starters/flask-transcription

Get started using Deepgram's Pre-Recorded Transcription with this Flask demo app

50
Established
515 NaomiProject/Naomi

The Naomi Project is an open source, technology agnostic platform for...

50
Established
516 SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in...

50
Established
517 travisvn/openai-edge-tts

Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or...

50
Established
518 XnneHangLab/XnneHangLab

不会聊天的字幕提取器不是一个好 B 站下载器~

50
Established
519 davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support...

50
Established
520 ekwek1/soprano-factory

Soprano-Factory: Train your own 2000x realtime text-to-speech model

50
Established
521 FunAudioLLM/Fun-ASR

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

50
Established
522 sergenes/runandread-audiobook

🚀 Open-source project for creating high-quality AI TTS-narrated audiobooks...

49
Emerging
523 Lex-au/Vocalis

Speech-to-speech AI assistant with natural conversation flow, mid-speech...

49
Emerging
524 junzew/HanTTS

Chinese Text-to-Speech web service

49
Emerging
525 PriesiaMioShirakana/DragonianVoice

多个SVC/TTS的C++推理库

49
Emerging
526 tugstugi/pytorch-dc-tts

Text to Speech with PyTorch (English and Mongolian)

49
Emerging
527 NevilPatel01/RVC-WebUI-MacOS

Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...

49
Emerging
528 DragonComputer/Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

49
Emerging
529 dessa-oss/fake-voice-detection

Using temporal convolution to detect Audio Deepfakes

49
Emerging
530 dhruvapte26/B.E.N.J.I.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

49
Emerging
531 techiaith/pyfestival

Amlapiwr Python C ar gyfer hwyluso rhaglennu gyda Festival | A Python C...

49
Emerging
532 p0p4k/vits2_pytorch

unofficial vits2-TTS implementation in pytorch

49
Emerging
533 OpenVoiceOS/ovos-buildroot

Open Voice Operating System - Buildroot edition is a minimalistic linux OS...

49
Emerging
534 gionanide/Speech_Signal_Processing_and_Classification

Front-end speech processing aims at extracting proper features from short-...

49
Emerging
535 botbahlul/PyAutoSRT

PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free...

49
Emerging
536 arghyasur1991/Spark-TTS-Unity

Unity package for using Spark-TTS on-device models. This is a C# port of...

49
Emerging
537 juntaosun/ComeCut

「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用,功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor....

49
Emerging
538 createcandle/voco

Privacy friendly voice control for the Candle Controller / WebThings...

49
Emerging
539 nitaiaharoni1/whisper-speech-to-text

Whisper Speech-to-Text is a JavaScript library for recording and...

49
Emerging
540 Poeschl/Hassio-Addons

The repository for my Home Assistant Supervisor Add-ons.

49
Emerging
541 Artrajz/vits-simple-api

A simple VITS HTTP API, developed by extending Moegoe with additional features.

49
Emerging
542 myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

49
Emerging
543 CodersCreative/natural-tts

A rust crate for easily implementing Text-To-Speech into your rust programs.

49
Emerging
544 vasistalodagala/whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR)...

49
Emerging
545 speechmatics/speechmatics-python-sdk

Python SDKs for Speechmatics APIs

49
Emerging
546 rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating...

49
Emerging
547 metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

49
Emerging
548 lixiangyu890601/EasyAICC-Easy-AI-Call-Center

外呼系统,智能外呼,自动外呼系统,人工外呼,呼叫中心

49
Emerging
549 OpenBMB/UltraEval-Audio

Your faithful, impartial partner for audio evaluation — know yourself, know...

49
Emerging
550 C-Loftus/QuickPiperAudiobook

With one command, create a natural-sounding audiobook from a variety of...

49
Emerging
551 thuhcsi/Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS...

49
Emerging
552 snakers4/silero-stress

Silero Stress — pre-trained enterprise-grade automated stress and homograph...

49
Emerging
553 JJWRoeloffs/transcribe_align_textgrid

A small wrapper package around whisper-timestamped. Create force-aligned...

49
Emerging
554 ARBML/klaam

Arabic speech recognition, classification and text-to-speech.

49
Emerging
555 artibex/piper-http

Creates a docker image that runs the piper http service

49
Emerging
556 nullabork/talkbot

Text-to-speech and translation bot for Discord

49
Emerging
557 robmsmt/KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

49
Emerging
558 drankush/VoxRad

VOXRAD is a voice transcription application for radiologists leveraging...

49
Emerging
559 zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis,...

49
Emerging
560 Steve0929/tiktok-tts

Provides a simple way to generate text-to-speech audio files using TikTok's...

49
Emerging
561 Audio-WestlakeU/VINP

Official PyTorch implementation of 'VINP: Variational Bayesian Inference...

49
Emerging
562 deepgram/deepgram-go-sdk

Official Go SDK for Deepgram.

49
Emerging
563 rakeshvar/rnn_ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with...

49
Emerging
564 google/tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end...

49
Emerging
565 litagin02/rvc-tts-webui

Text-to-Speech Gradio webui using RVC and edge-tts

49
Emerging
566 SlapBot/stephanie-va

Stephanie is an open-source platform built specifically for voice-controlled...

49
Emerging
567 nvidia-riva/common

Protocol buffers and other common resources.

49
Emerging
568 ceuk/speech-recognition-aws-polyfill

Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback

49
Emerging
569 iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and...

49
Emerging
570 santi-pdp/pase

Problem Agnostic Speech Encoder

49
Emerging
571 NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

49
Emerging
572 Picovoice/leopard

On-device speech-to-text engine powered by deep learning

49
Emerging
573 woheller69/whisperIME

Android Input Method Editor (IME) based on Whisper

49
Emerging
574 seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

49
Emerging
575 stimm-ai/stimm

The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI...

49
Emerging
576 belambert/asr-evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word...

49
Emerging
577 modal-labs/quillman

A voice chat app

49
Emerging
578 mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text...

49
Emerging
579 pedroetb/tts-api

Text to speech REST API for multiple TTS engines

49
Emerging
580 hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

49
Emerging
581 eheikes/tts

Tools to convert text to speech :books::speech_balloon:

49
Emerging
582 voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

49
Emerging
583 thevickypedia/py3-tts

Offline Text To Speech library for python

49
Emerging
584 davidamacey/OpenTranscribe

Self-hosted AI-powered transcription platform with speaker diarization,...

49
Emerging
585 jim-schwoebel/voicebook

🗣️ A book and repo to get you started programming voice computing...

49
Emerging
586 savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

49
Emerging
587 ddPn08/rvc-webui

liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project

49
Emerging
588 opendilab/CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single...

49
Emerging
589 ActiveNick/HoloBot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to...

48
Emerging
590 keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and...

48
Emerging
591 lucoiso/UEAzSpeech

This plugin integrates Azure Speech Cognitive Services in Unreal Engine.

48
Emerging
592 liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

48
Emerging
593 avinashvarna/sanskrit_tts

Sanskrit text to speech

48
Emerging
594 advanced-media-inc/amivoice-api-client-library

AmiVoice API Client Library and the sample programs

48
Emerging
595 travisvn/edge-tts-client

Client-side (web browser) implementation of Edge TTS package — Microsoft...

48
Emerging
596 albirrkarim/react-speech-highlight-demo

React / Vanilla JS Text to Speech with highlighting the words and sentences...

48
Emerging
597 ModelTC/LightTTS

LightTTS is a lightweight TTS inference framework optimized for CosyVoice2...

48
Emerging
598 zlargon/google-tts

Google TTS (Text-To-Speech) for node.js

48
Emerging
599 enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

48
Emerging
600 Aivis-Project/AIVM-Generator

Aivis Voice Model File (.aivm/.aivmx) Generator / Editor

48
Emerging
« Prev 1 2 3 4 5 6 7 8 80 81 82 Next »