All Voice AI Tools

8,165 tools ranked by quality score · Page 35 of 82

Showing 3401–3500 of 8,165
# Tool Score Tier
3401 shockless/asr-transformer

Transformer for Automatic Speech Recognition

30
Emerging
3402 aishoot/DTWSpeech

A simple application of DTW Algorithm in isolate word speech recognition.

30
Emerging
3403 hackzilla/SpeechRecognition

A simple yet powerful SwiftUI app for iOS that demonstrates speech...

30
Emerging
3404 rainu/wow-quest-reader

A World of Warcraft Addon which can read the quest text with meant of AI...

30
Emerging
3405 avarayr/yap-for-cursor

Yap for Cursor - Voice To Text integration for Cursor IDE

30
Emerging
3406 France-Travail/TradEmploi-FrontEnd

Frontend of TradEmploi

30
Emerging
3407 DanielLin94144/Test-time-adaptation-ASR-SUTA

Test-time adaptation for speech recognition model by single utterance. The...

30
Emerging
3408 Audio-WestlakeU/UMA-ASR

This repository is the official implementation of unimodal aggregation (UMA)...

30
Emerging
3409 AndreCoutinhom/voice_translator

With the help from a Youtube channel tutorial video, Chat GPT instructions...

30
Emerging
3410 kindo-tk/virtual_assistant

Personal Voice assistant using python

30
Emerging
3411 b4rtaz/voice-assistant-net-server

Voice Assistant Server for VSCode

30
Emerging
3412 AWAS666/Pngify.me

Pngtuber app build on Avalonia.UI with twitch integration and a ttspet

30
Emerging
3413 victormgross/RealVideo

📹 Create engaging video calls with RealVideo, a WebSocket-based system that...

30
Emerging
3414 poretsky/freespeech

English text preprocessor for MBROLA speech synthesizer

30
Emerging
3415 lingualogic/speech-framework

Javascript/Typescript Framework für Spracheingabe/ausgaben und Dialogverarbeitung.

30
Emerging
3416 mohanchandrass/Sentient-NPC-Lightweight-Offline-AI-Voice-Dialogue-Framework

A research-oriented lightweight offline AI NPC dialogue and voice...

30
Emerging
3417 Ryan-M-Smith/Quinton-VoiceAssistant

A simple voice assistant

30
Emerging
3418 allpaqa-jgk/twitch_text_to_speech_bot

Text to Speech bot using Twitch IRC for mac and (linux and windows

30
Emerging
3419 AEJays/edge-tts-nodejs

Node version of edge-tts / Node版本的edge-tts

30
Emerging
3420 SaranshKejriwal/Harold_Finch

Face recognition via voice Commands (OpenCV Python + SpeechRecognition 3.1.3)

30
Emerging
3421 mohaimenulislamshawon/text-to-voice-speech-converter

The program is created based on google text to speech or voice converter...

30
Emerging
3422 CMsmartvoice/Unet-TTS

One-shot TTS with Improved Unseen Speaker and Style Transfer

30
Emerging
3423 jharrilim/RasaDocker

Docker image with Rasa + Anaconda + Tensorflow + portaudio + PyAudio +...

30
Emerging
3424 Aditya-Mishra799/NLP-Speech-Translator-Website

A modern web application for translating and converting speech to text in...

30
Emerging
3425 dilukshann7/Vocaluxe

Python program to extract vocals from YouTube videos for free

30
Emerging
3426 nikolaStanojkovski/Assistive_Bus_Helper

An Android application that allows visually impaired people to hear which...

30
Emerging
3427 svarlamov/aws-polly-node-typescript-demo

Demo of how to use AWS Polly text-to-speech in a web app using NodeJS,...

30
Emerging
3428 SzLeaves/asr-webapp

ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块

30
Emerging
3429 netcookies/Edge-TTS-Proxy

Edge-TTS-Proxy 插件将 Microsoft Edge TTS(文本到语音)服务集成到 Home Assistant...

30
Emerging
3430 davealaw/kokoro-electron

Kokoro TTS GUI - a user-friendly Electron application for local neural...

29
Experimental
3431 Avatar-Home-Automation/A.V.A.T.A.R-Server

Agnostic Virtual Assistant for The Automated Residences

29
Experimental
3432 ltphen/martha

Free text to speech synthesizer made with coqui-ai/TTS and flask

29
Experimental
3433 ryuuji06/keyword-spotting

In this repository, I implement a system for detecting specific spoken words...

29
Experimental
3434 answersolutionsapps/runandread-android

Ultimate Text-to-Speech and Audiobook Player for Android

29
Experimental
3435 Sls0n/desktop-assistant

A python-based desktop assistant that can perform a few mundane tasks!

29
Experimental
3436 wavekat/wavekat-turn

Turn detection library for Rust with a unified trait interface over multiple...

29
Experimental
3437 InuInu2022/NodoAme.Home

An official website for NodoAme

29
Experimental
3438 cmirnow/Google-Cloud-TTS-Rails

Using the power of Google Cloud Text-to-Speech API and ruby here is a simple...

29
Experimental
3439 yousefhany77/tts-ai

The Text-to-Speech Library provides a simple unified interface for...

29
Experimental
3440 nclv/RecoVoc

Projet de reconnaissance vocale

29
Experimental
3441 rajjitlai/MimicTTS

MimicTTS is a tool for Voice cloning from a short audio clip. Powered by...

29
Experimental
3442 Ilikepizza2/localspeech-AI

A one command Voice AI deployment script for MacOS. Supports Sesame, Kokoro,...

29
Experimental
3443 Rushi128/voice_assistance

The application is built using Python with Flask for the backend,...

29
Experimental
3444 djelia-org/djelia-js-sdk

Javascript client for interaction with djelia models throught it's API

29
Experimental
3445 hchiam/please

An experimental programming language (transpiler) to make it easier to write...

29
Experimental
3446 yash2410/Avon

A speech recognition based home automation system

29
Experimental
3447 luan78zaoha/TTS_tflite_cpp

TTS inference in C++ based on TFlite model

29
Experimental
3448 khaykingleb/hifi-gan

Neural vocoder for high-fidelity speech synthesis (implementation of the...

29
Experimental
3449 nacerbaaziz/nbsapi

a python library that helps you to control the sapi5 TTS

29
Experimental
3450 r9y9/jsut-lab

HTS-style full-context labels for JSUT v1.1

29
Experimental
3451 vietai/ASR

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

29
Experimental
3452 oloflarsson/whisper-spoon

🎙️ Whisper STT Shortcut for Hammerspoon (macOS)

29
Experimental
3453 itsanuragkumarjha/Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access

An open-source project that uses cutting-edge NLP models and real-time web...

29
Experimental
3454 zemags/golang-yandex-speech-kit

SDK for converting text to audio by Yandex premium voices

29
Experimental
3455 GmEsoft/CTS256A-AL2

Commented disassembly of the GI(tm) CTS256A-AL2(tm) Code-To-Speech Processor

29
Experimental
3456 KelvinCampelo/open-aiudio-client

This Next.js application provides a user interface for interacting with...

29
Experimental
3457 deepgram-starters/csharp-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this C# demo app

29
Experimental
3458 zolomohan/speech-recognition-in-javascript-starter

Starter Code for Speech Recognition in JavaScript tutorial.

29
Experimental
3459 mochi-neko/VOICEVOX-API-unity

Binds VOICEVOX text to speech API to pure C# on Unity.

29
Experimental
3460 led-mirage/CoeiroClip

COEIROINKでクリップボードに貼り付けられたテキストを読み上げるアプリです。

29
Experimental
3461 matievisthekat/MyOnlyFriend

A program I made so I could talk to someone ;(

29
Experimental
3462 nsoojin/VoiceControlSample-iOS

Creating a stateful UI with GameplayKit - Voice Control

29
Experimental
3463 smswg/callwg

语音呼叫系统-外呼系统,2026年真正可商用CALLWG语音呼叫系统,语音呼叫系统功能:机器人话术外呼系统|呼叫中心|VIP队列|来电记忆|ASR语音识别...

29
Experimental
3464 guozhonghao1994/Voice_Activity_Detection_V1

2018 Lenovo AI Lab Summer Intern

29
Experimental
3465 StanGirard/quivr-whisper

Talk to your second brain personal assistant using speech 🧠

29
Experimental
3466 MycroftAI/ZZZ-RETIRED__openstt

RETIRED - OpenSTT is now retired. If you would like more information on...

29
Experimental
3467 UserBeingOfficial/ai-dictionary-koreader

📖 Enhance your reading experience with AI Dictionary, a KOReader plugin that...

29
Experimental
3468 LohChiaHeung/TechTutor

TechTutor is an Augmented Reality (AR) and AI-assisted mobile learning...

29
Experimental
3469 eujuliu/anki-deck-generator

This tool allows users to create Anki cards with words, meanings, examples,...

29
Experimental
3470 mmerlyn/asl-translator

Empowering the deaf and speech-impaired with a real-time ASL translator that...

29
Experimental
3471 Langhalsdino/StageMate

StageMate is the smart assistant for your presentation. It will cover all...

29
Experimental
3472 botbahlul/VOSK-Powered-LIVE-SUBTITLE

ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using VOSK...

29
Experimental
3473 Siemko/boar

boarBot :boar: voice assistant

29
Experimental
3474 primepake/learnable-speech

This repo is text to speech with learnable audio encoder without alignment...

29
Experimental
3475 oasisnoehub/OsisnoeAISpeech

English Text to Speech AI web app: You can better practice your english...

29
Experimental
3476 bykemalh/S2ST

Speech to Speech Translation Python

29
Experimental
3477 makeuseofcode/PDF-to-Audiobook

Python project to convert an eBook pdf to an audiobook.

29
Experimental
3478 DillionLowry/NeuralCodecs

Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia

29
Experimental
3479 rohankishore/Submind

🎧 Submind is a modern PyQt6 app for generating subtitles (SRT) using Whisper...

29
Experimental
3480 TheIncredibleVee/sqlized

Easy to use, flexible, and user-friendly SQL running app with voice command support

29
Experimental
3481 hezhizheng/cantonese-cool

一个能讲广东话(粤语)的小程序

29
Experimental
3482 6Morpheus6/IndexTTS2

[NVIDIA, MAC, ROCM] Emotionally Expressive and Duration-Controlled...

29
Experimental
3483 dalehumby/openWakeWord-rhasspy

openWakeWord for Rhasspy

29
Experimental
3484 stefanbringuier/youtube-transcripts

Pass in a YouTube URL and to generate a transcript of the audio

29
Experimental
3485 Qappevox/Voice-Assistant

I'ts just a voice asistant for windows.

29
Experimental
3486 OpenVoiceOS/ovos-docker-tts

Open Voice OS TTS Docker images

29
Experimental
3487 riedemannai/parakeet-mlx-server

OpenAI-compatible FastAPI server for German neurology and neuro-oncology...

29
Experimental
3488 Hexanol777/Kikiyomu

聞き読む. real-time text-to-speech tool for VNs

29
Experimental
3489 pschatzmann/arduino-flite

A small fast portable speech synthesis system

29
Experimental
3490 mpoyraz/wav2vec2-turkish

Turkish Speech Recognition using Facebook's Wav2vec 2.0 models

29
Experimental
3491 sooftware/jasper

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural...

29
Experimental
3492 stitchng/infobip

A NodeJS Wrapper for InfoBip

29
Experimental
3493 german-asr/megs

A merged version of multiple open-source German speech datasets.

29
Experimental
3494 bharathraj-v/fastconformer-ctc-telugu

NVIDIA NeMo's stt_en_fastconformer_ctc_large finetuned on open-source telugu...

29
Experimental
3495 Syedjunaid30/Video_Dubbing_with_ML_driven_Lip_Synchronization

AI-powered video dubbing tool that translates and synchronizes speech with...

29
Experimental
3496 Tugaytalha/NarraPhon

NarraPhon: Advanced Text-to-Speech Conversion Pipeline NarraPhon is a...

29
Experimental
3497 deepgram-starters/go-text-to-speech

Get started using Deepgram's Text-to-Speech with this Go demo app

29
Experimental
3498 chinasilva/MySmartPc

利用微信文件助手,进行语音或者文字控制电脑

29
Experimental
3499 Br3n0k/transcriber

AI-powered transcription for audio & video with Whisper — self-hosted, fast,...

29
Experimental
3500 ictnlp/SLED-TTS

Streamable Text-to-Speech model using a language modeling approach, without...

29
Experimental
« Prev 1 2 3 33 34 35 36 37 80 81 82 Next »