All Voice AI Tools

8,165 tools ranked by quality score · Page 54 of 82

Showing 5301–5400 of 8,165
# Tool Score Tier
5301 phil1px/voice-message-transcriber

An iOS share-action extension that transcribes voice messages using Google...

21
Experimental
5302 voothi/20251228104300-subtitles

This repository is dedicated to preparing subtitles as part of working with...

21
Experimental
5303 malob/serverless-tts-podcast

WIP rewrite of article-to-audio-cloud-function and...

21
Experimental
5304 cheeweijie/qwen3-tts-lora-finetuning

Qwen3‑TTS LoRA fine‑tuning tools (companion repo) for custom voice adaptation

21
Experimental
5305 itsmemotivist/qwen-tts2api

🗣️ Enable text-to-speech with Qwen TTS, a simple API solution that...

21
Experimental
5306 arrdel/voice-assistant

Python script that utilizes natural language processing (NLP) and machine...

21
Experimental
5307 mfirozahmed/iTranslator

Project using OCR and TTS

21
Experimental
5308 BertanDogancay/Multi-Functional-AI-Assistant

An advanced AI assistant that can make object detections and uses dialogpt...

21
Experimental
5309 safikhanSoofiyani/VoicePrescription

An android application that uses speech to text functionality to produce...

21
Experimental
5310 naver/multilingual-distilwhisper

This repository contains all the code necessary for running the multilingual...

21
Experimental
5311 lyle-mlengineer/timesnap

A web service for extracting timestamps from youtube videos.

21
Experimental
5312 marklubin/kairix

Voice-first AI agent with persistent memory, background reflection, and...

21
Experimental
5313 wyatt-avilla/discord-tiktok-tts-bot

discord bot that can play tiktok tts in voice

21
Experimental
5314 chalotrasahil/AI-Lecture-Studio

AI Lecture Studio is an NLP-driven system that transforms audio and video...

21
Experimental
5315 krishn1122/voice-agent-local

Specially designed for AI Team

21
Experimental
5316 ryanfb/ancientgreekspeak

Transliterate Ancient Greek to Apple phonemes for text-to-speech synthesis

21
Experimental
5317 ADT109119/WhisperX-GUI

一個使用者友善的圖形介面,用於輕鬆調用 WhisperX,這是一個提供精確轉錄、強大語者分離和詞級時間戳對齊的自動語音辨識 (ASR) 工具。此 GUI...

21
Experimental
5318 incubated-geek-cc/whisper-onnx

A Vite-ReactJS setup to run Whisper OpenAI models locally to transcribe...

21
Experimental
5319 samuelebh/CNN-Spoken-Digit-Classifier

Repository containing Python code of a classifier that recognizes spoken...

21
Experimental
5320 PhysisVerse/physis-vad-swift

Modular Swift package for on-device voice activity detection on Apple...

21
Experimental
5321 SuJun-Hub/voiceId

借鉴CapsWriter修改的windows端语音输入工具

21
Experimental
5322 8G6/rtts

rtts is an open source JavaScript package for text to speech conversion

21
Experimental
5323 fann1993814/whisper.cpy

Python wrapper for Whisper.cpp

21
Experimental
5324 terkelg/utters

Small (257B) promise wrapper for SpeechSynthesisUtterance

21
Experimental
5325 MahtaFetrat/Mana-Forced-Aligner

A robust forced alignment tool for low-resource languages using multiple ASR...

21
Experimental
5326 zhangmei126/TextToSpeech

UE4 集成TTS文字转语音,使用SAPI5.3版本

21
Experimental
5327 1abhishekpandey/FastScribe

Fast parallel video-to-text transcription powered by OpenAI's Whisper AI.

21
Experimental
5328 aristech-de/tts-clients

Clients to communicate with the Aristech TTS service

21
Experimental
5329 leanhtech/TextToSpeech_EN_VN

Đồ Án Text To Speech (Môn Hệ Điều Hành - PTITHCM)

21
Experimental
5330 mym-br/gnuspeech_sa

Articulatory speech synthesizer

21
Experimental
5331 wenhuahuo/Cross-Device-Acoustic-Communication-Python-Implementation

Digital acoustic communication tools using QFSK and Convolutional Encode. 跨设备声学通信。

21
Experimental
5332 cowdude/flapi

FLAPI is an offline, containerized speech recognition websocket API

21
Experimental
5333 1ytic/edit-distance-papers

A curated list of papers dedicated to edit-distance as objective function

21
Experimental
5334 Wonbin-Jung/e3-vits

Official GitHub page of E3-VITS

21
Experimental
5335 iamarunbrahma/smart-voice-assistant

A simple voice assistant to get your queries in speech format and generate...

21
Experimental
5336 marttirandma/tipi

Tipi Web v2

21
Experimental
5337 cjbayron/audiate

Ear training game using machine learning models in the browser

21
Experimental
5338 ChrisRobinT/realtime-translation

Real-time WebRTC voice translation using Whisper STT, Azure Translate, and...

21
Experimental
5339 asrajeh/kaldi-arabic

HHM-based Arabic ASR using Kaldi engine

21
Experimental
5340 kowaalczyk/reformer-tts

An adaptation of Reformer: The Efficient Transformer for text-to-speech task.

21
Experimental
5341 IRSPlays/ProjectCortexV2

A $300 wearable that gives visually impaired users real-time scene...

21
Experimental
5342 kevinjalbert/spellspoon

Spellspoon is a macOS tool built using Hammerspoon that enables...

21
Experimental
5343 WaelShaikh/OmniVerse-Desktop

OmniVerse-Desktop is your local LLM based AI assistant that integrates...

21
Experimental
5344 anubhav-n-mishra/xtts-api

Production-ready Text-to-Speech API with XTTS-v2, voice cloning,...

21
Experimental
5345 jp1924/HF_builders

🤗 Datasets의 builder script를 모와둔 repo

21
Experimental
5346 marcogenna/epub2audiobook

Convert EPUB books to M4B audiobooks with AI-powered TTS (Edge TTS, Kokoro, Piper)

21
Experimental
5347 fulviodenza/go-gladia-client

Client Go for Gladia APIs

21
Experimental
5348 AryanVBW/AiVoiceClone

Transform Your Voice: Replicate Your Unique Sound in a Pristine Pre-Trained...

21
Experimental
5349 SyedHuzaifa007/Robbie-12.20-Personal-Virtual-Assistant

It is a Speech Recognition Personal Virtual Assistant made with Python that...

21
Experimental
5350 sandeepswain54/Yukti-Care

Yukti Care is a mobile app that enables pharmacies, medical distributors,...

21
Experimental
5351 cydanix/voice-agent

Real-time voice AI assistant

21
Experimental
5352 Aketirani/audio-mnist

Gender Recognition By Voice Analysis

21
Experimental
5353 theablemo/Voice-Captcha-Verification

This repository contains the code for the Captcha Verification by voice...

21
Experimental
5354 Nexdata-AI/100-Hours-Thai-Children-Spontaneous-Speech-Data

Thai Child's Spontaneous Speech Data

21
Experimental
5355 Fdr3iZzz/YoutubeVideoTranslate

Get a translated YouTube video with AI voiceover

21
Experimental
5356 RumitPatel/android-continues-speech-recognition

This project is a demonstration to continues recognition of speech using...

21
Experimental
5357 traderpedroso/xphoneBR

XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and...

21
Experimental
5358 CrispStrobe/CrispTTS

(wip) python command-line Text-to-Speech (TTS) tool esp. for German,...

21
Experimental
5359 chihakuro/attendance-check

Face recognition for attendance checking system

21
Experimental
5360 NhanPhamThanh-IT/Vietnamese-Voice-Search-Engine

🔎 Vietnamese Voice Search Engine - Vietnamese news search app with voice...

21
Experimental
5361 Davi20044/Chat-de-Voz-GPT-3.5

Este projeto consiste em um assistente de conversação que utiliza a...

21
Experimental
5362 kundan-6646/Musica

Musica is an online audio splitter. It works with the power of AI which...

21
Experimental
5363 WinsDominoes/sanskrit-tts

Sanskrit Text-To-Speech Web-App - Made this for my Sanskrit Learning Journey

21
Experimental
5364 HKAB/vietnamese-rnnt-tutorial

A tutorial on how to train RNN-T from scratch with Whisper encoder

21
Experimental
5365 shesuyo/isi

alibaba 智能语音交互(Intelligent Speech Interaction) GO SDK

21
Experimental
5366 uigiporc/icon-sr

Progetto di Ingegneria della conoscenza, autori: Porcelli Luigi, Nicolo Cucinotta.

21
Experimental
5367 rgychiu/docbot

Personal doctor bot for all your common medical needs.

21
Experimental
5368 IHKYoung/AhaTTS

TTS Fast Web,一个简单优雅的本地文字转语音的前端与API接口。A localized, cross-platform,...

21
Experimental
5369 khakhasshi/myOwnTTS

A lightweight, high-performance voice cloning TTS system based on Coqui TTS...

21
Experimental
5370 ayutaz/uZipVoice

Unity implementation of ZipVoice - lightweight zero-shot text-to-speech...

21
Experimental
5371 andreehrlich/Daily-Briefing-Voice-Assistant

Conversational voice agent to brief you on your schedule for the day....

21
Experimental
5372 corbinr40/RTCC

A piece of software that converts voice to text in a visual output, as an...

20
Experimental
5373 vislupus/Bulgarian-TTS-dataset

LibriVox dataset for Bulgarian language TTS

20
Experimental
5374 AppleHolic/2020AIChallengeSpeechRecognition

2020 AI Challenge 음성 인식 코드

20
Experimental
5375 pika-online/Foreign_Pronunciation_Generator_for_Code-Switch_ASR

a socket script to obtain chinese phones-sequence for any english word

20
Experimental
5376 atharva9167j/Sign-Language-Translator

Sign Language Recognition Platform - A real-time American Sign Language...

20
Experimental
5377 Kavindu-Rankothge/tiktok-bot

TikTok video generation from scraping Reddit community posts

20
Experimental
5378 shahad-mahmud/incremental_learning_for_asr

Incremental learning for automatic speech recognition (ASR)

20
Experimental
5379 voidful/whisper-live-asr-demo

run whisper on CPU/GPU server

20
Experimental
5380 4over7/SpeakOut

Offline-first AI voice input for macOS. Hold-to-speak or tap-to-toggle,...

20
Experimental
5381 timothypesi/Speech-to-Text-Converter

This GitHub repository contains a Python Streamlit app that utilizes machine...

20
Experimental
5382 bfackland/replica_dialog_generator

🗣 Auto-generate dialog audio files using the Replica Studios 'AI Voices' API...

20
Experimental
5383 oscurprof/Realtime-Subtitles-Generator-using-Python

LiveScript: Real-time Live Captioning Software, generates subtitles in...

20
Experimental
5384 maziac/currah_uspeech_tests

Tests for the ZX Spectrums speech synthesizer peripheral: Currah uSpeech...

20
Experimental
5385 gerlaxrex/parrot

PARRoT: Precise Audio Recognition and Recap over Transcription

20
Experimental
5386 SSobol77/Say-Salomon-AI

Asynchronous text-to-speech conversion, asynchronous speech-to-text...

20
Experimental
5387 xingchensong/ASR-Wavnet

some ASR-system implementations (via tensorflow 1.x)

20
Experimental
5388 morikeli/Xcalibur

A speech recognition and translation website built with Django in addition...

20
Experimental
5389 MorrisXu-Driving/Improving_DeepSpeech_2_by_RNN_Transducer_Pytorch_Implementation

In this repository, based on Deep Speech 2, two losses, CTC and RNN-T are compared.

20
Experimental
5390 Androz2091/Cicero

Great speaker, Cicero is a text-to-speech Discord Bot!

20
Experimental
5391 rossriserose/Real-time-Voice-cloning

Clone a voice to generate arbitrary speech in real-time

20
Experimental
5392 marcosfelt/latex2speech

Convert Latex to speech

20
Experimental
5393 shreyashghag/OfflineSpeechRecognition

Offline Speech Recognition For Android Library

20
Experimental
5394 eray-yuztyurk/python-ai-voice-chatbot

AI-powered voice chatbot with Gradio web interface. Talk or type your...

20
Experimental
5395 Sec-ant/etts

edge-tts in Bun.

20
Experimental
5396 HarunoriKawano/Conformer

Implementation of the paper "Conformer: Convolution-augmented Transformer...

20
Experimental
5397 dibbed/TTSKit-multi-engine-tts

Python Text-to-Speech toolkit (multi-engine) with FastAPI, CLI and Telegram...

20
Experimental
5398 technout/tts_gtk

Graphical interface for Coqui TTS (Text to Speech) command line. Made in...

20
Experimental
5399 Tombarr/TranscriberApp

Local-first macOS Tahoe Transcription App & CLI Tool

20
Experimental
5400 Dalia-Sher/Speech-Emotion-Recognition-using-BLSTM-with-Attention

We present a study of a neural network based method for speech emotion...

20
Experimental
« Prev 1 2 3 52 53 54 55 56 80 81 82 Next »