All Voice AI Tools

8,165 tools ranked by quality score · Page 73 of 82

Showing 7201–7300 of 8,165
# Tool Score Tier
7201 Arushi-Srivastava-16/SpatialAudio

SpatialAudio detects key objects using YOLOv8, identifies their location in...

12
Experimental
7202 NusrathFarheen/talkbuddy

An AI-powered voice chatbot built with JavaScript and Node.js to help you...

12
Experimental
7203 partha-09/VenomX-Windows-Voice-Assistant

VenomX is an advanced voice assistant for Windows, utilizing Python and AI...

12
Experimental
7204 djdhairya/J.A.R

JAR represents a paradigm shift in desktop interaction, optimizing task...

12
Experimental
7205 jayeshbhandarkar/GlobalSpeak

GlobalSpeak is a Python Flask web application designed to overcome language...

12
Experimental
7206 BatuhanYilmaz26/Youtube-Transcriber

Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file.

12
Experimental
7207 zeeshanmahar007/Sign-Talk---Bridge-the-Gap-of-Communication

SignTalk is an android based application in which Hearing, or speech...

12
Experimental
7208 thuantn210823/ASR

This repo utilizes the popular and highly effective Conformer Encoder from...

12
Experimental
7209 louis030195/book2audiobook

text to speech public domain / free audio books

12
Experimental
7210 Alkohole/machine-reading-text

A small extension that adds the ability to voice text from YaBrowser to...

12
Experimental
7211 phil1px/voice-cloner

Flask/FastAPI + Gradio app for voice cloning with Resemble AI — upload,...

12
Experimental
7212 Rulikkk/digit-tutor

Digit Tutor is a simple online game for kids in Svelte, which uses speech...

12
Experimental
7213 Harsha0431/News-Scraper-Summarizer-Text-to-Speech

This project scrapes news articles, summarizes content using BERT or Gemini...

12
Experimental
7214 AbdulGani11/Vocably

Text-to-speech web application built with React, FastAPI, JWT...

12
Experimental
7215 VinnyVanGogh/cli-whisperer

🎤 Professional Voice-to-Text TUI Application - OpenAI Whisper + GPT with...

12
Experimental
7216 leeorhelps/SpeechBird

Speech Bird is a speech recognition system which makes complete hands-free...

12
Experimental
7217 dhcgn/ai-sample-scripts

Simple shell scripts for AI tasks (image description, transcription, TTS,...

12
Experimental
7218 DJJ547/CMPE273-Book-Reader-Django

An AI-powered book reader app with search, library management, and...

12
Experimental
7219 HuyDang05/Ringurooma

AI-powered Japanese speaking practice platform using n8n, and Azure AI for...

12
Experimental
7220 muhammadazhariqbal/schedule-ai-backend

serverless AI-powered service that helps convert audio to text and extract...

12
Experimental
7221 Mamrez/speech-recognition

Analogue speech recognition based on physical computing

12
Experimental
7222 manasgandhi99/Lappy-Voice-Assistant

A Laptop Voice assistant built using python that can perform multiple...

12
Experimental
7223 Cyan903/zundamon-yomitan

Fallback audio source for Yomitan which uses ずんだもん TTS.

11
Experimental
7224 verbio-technologies/cpp-verbio-speech-center

C++ integration with the Verbio Speech Center Cloud. https://speechcenter.verbio.com/

11
Experimental
7225 bryanrandell/ChatGPT_speech_to_speech

OpenAI and Google Cloud for a speech answer to speech respond from OpenAI ChatGPT

11
Experimental
7226 myselfaryan/uchchaaran

A sophisticated text-to-speech (TTS) system specifically designed for...

11
Experimental
7227 shubhomoydas/ai_raspberrypi

Voice instructions with AI for controlling LEGO motor connected to Raspberry Pi 5

11
Experimental
7228 sungjae-cho/ICASSP2020_STDemo

Show and Tell demonstration homepage

11
Experimental
7229 seven-io/StackStorm

Send SMS and make text-to-speech calls via StackStorm

11
Experimental
7230 MohamedNabill7/Alex-Virtual-Assistant-in-Smart-Home

Machine Learning Behind AI Smart Assistant in Smart Home

11
Experimental
7231 sj2tpgk/voiceroid-docker

Voiceroid+ in docker on X64/Arm linux + web interface (mirrored from...

11
Experimental
7232 RichFesler/node-red-tts-flask

Fast local text-to-speech system for Node-RED using Flask, Coqui TTS, and FFmpeg

11
Experimental
7233 driftingruby/395-transcribing-with-artificial-intelligence

In this episode, we look at creating an audio transcription service which...

11
Experimental
7234 Rtiwary-1/Voice-Based-Music-Playlist-Generator

This project was done as coursework for the subject of Database Management System

11
Experimental
7235 iamvon/AudioRead

Turn PDFs into audio with chunked LLMs and OpenAI TTS

11
Experimental
7236 aghezzafmohamed/Chatbot-with-Python-and-Deep-Learning

ChatBot that will help students in university. In order to reduce the...

11
Experimental
7237 taresh18/livekit-orpheus

LiveKit TTS plugin with Orpheus streaming support

11
Experimental
7238 test-dan-run/squim-report

Using TorchAudio-SQUIM to create dataset quality reports

11
Experimental
7239 trautodiag/Free-Local-AI-Voice-Cloning-Unlimited

Stop paying monthly fees. High-fidelity voice cloning on your own GPU. No...

11
Experimental
7240 polojudayamani-crypto/Shashitha-voice-assistant

Python voice assistant project

11
Experimental
7241 wasabina67/openai-tts-example

Openai tts example

11
Experimental
7242 fabianzimber/atelier-of-synthetic-voice

A professional, iOS-inspired studio for high-fidelity voice cloning and...

11
Experimental
7243 mrdiamonddirt/python-llm-interpreter

A python script that listens for a request plans a response tells you the...

11
Experimental
7244 Sunkware/notthatstuff

Large-language-model + Text-to-speech + Voice-cloning doppelgangers, trained...

11
Experimental
7245 alifarrokh/asr-from-scratch

ASR models implemented from scratch in PyTorch

11
Experimental
7246 Nexdata-AI/347-Hours-Italian-Speech-Data-Collected-by-Mobile-Phone

Italian Speech Dataset

11
Experimental
7247 rwmicro/voice-backend

Voice backend that provides acces to Kokoro, Chatterbox and F5-TTS.

11
Experimental
7248 AleefBilal/tts_srt_gen

A runpod serverless docker that generates TTS using chatterbox-tts along with .srt

11
Experimental
7249 JacobCoffee/dpo-reader

For those too lazy to read a DPO thread that is far too long. Option of good...

11
Experimental
7250 KunalSingh5431/smartPDF

AI-powered PDF summarizer with text-to-speech built using MERN stack.

11
Experimental
7251 r-shafi/bangla-speech-to-text

Automatic speech recognition for the Bangla language, one of the world's...

11
Experimental
7252 agdm/chatterbox-api

Fast API in front of Chatterbox

11
Experimental
7253 0x3EF8/Unified-API-Server

A modular, auto-loading REST API server built with FastAPI. Drop a service...

11
Experimental
7254 roperi/Podcast2Wordpress

Podcast2Wordpress automates podcast-to-blog post conversion on WordPress. It...

11
Experimental
7255 thomasthaddeus/TTSSolution

TTS Application written in C#

11
Experimental
7256 ittia-research/speak

Education oriented TTS inference server

11
Experimental
7257 codysnider/kokoro

Dockerized Kokoro TTS

11
Experimental
7258 jpdiazpardo/gutural_nlp

Gutural and scream automatic speech recognition (ASR) system using a...

11
Experimental
7259 GermanCentralLibraryForTheBlind/TTSOnDemand

Text to speech technology to speech-enable web sites

11
Experimental
7260 cookerwatcher/ChopItUp

Python scripts to perform speech recognition on video files, then chop them...

11
Experimental
7261 laafeiak/ai_text_reader

text

11
Experimental
7262 ayushirastogi15/Flask-Application-Development

This repository tells you how to develop a flask application for the speech...

11
Experimental
7263 jefflai108/scale

Some of my public work at https://hltcoe.jhu.edu/research/scale/scale-2017/

11
Experimental
7264 ReadieFur/AWS-Polly-for-SpeechChat

Reads out twitch, youtube and mixer chat from Speechchat using AWS Polly.

11
Experimental
7265 coco-whisper/Voice-Conversation-Audio-Generation-Platform-TTS-

A self-hosted platform for text-to-speech, voice conversion, and AI audio...

11
Experimental
7266 bliptron/Google-TTS-Server

A FastAPI server for Google Gemini Text-to-Speech with modern web interface....

11
Experimental
7267 Tailmc/Syaberunoda

VoiceVoxを使ったシンプルな読み上げボット

11
Experimental
7268 t1seo/karina-voice-notification

Clone any voice from YouTube to create custom Claude Code notification...

11
Experimental
7269 rounayak/Virtual-assistant

Python based virtual assistant that can understand speech,respond via speech...

11
Experimental
7270 dwain-barnes/vui-fastapi-server

A OpenAI-compatible Text-to-Speech API server powered by VUI - a small...

11
Experimental
7271 roopesharch/EchoSonic

Built and deployed a full-stack AI text-to-speech platform using FastAPI and...

11
Experimental
7272 ohboundless/HeyWindows

A basic voice command interface for Windows.

11
Experimental
7273 MrBlueBird2/jarvis-in-python

An amazing AI which will talk with you and, wikipedia, questions.

11
Experimental
7274 SyedSohail786/SaaS-Website

This project supports Text to image and Text to speech functionality which...

11
Experimental
7275 saroshfarhan/story-teller

Story-Teller

11
Experimental
7276 apribeiro/TextToSpeechApp

A simple C# console application that converts user input text to speech.

11
Experimental
7277 jokio/sdk

SDK for building decentralised localfirst web apps. Provides tts ai model...

11
Experimental
7278 E-Asrar-Haghighi/farsi-tts-generator-with-music

Convert Farsi text to speech using OpenAI TTS, with optional background...

11
Experimental
7279 Sarasadeghii/Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

11
Experimental
7280 alokbhateshwar/virtual-assistant

"Python-based virtual assistant with voice recognition and text-to-speech...

11
Experimental
7281 chasmack/translate

Translation and Text-to-Speech for Anki Card Decks

11
Experimental
7282 Matthias84/speech2josm

JOSM presets via voice control

11
Experimental
7283 G3VV/Twine

🌿 A tool to automatically generate Reddit TTS, Comment Screenshots and JSON Data

11
Experimental
7284 nisheethjaiswal/Speech-to-Text

Speech to text implementation using transformers in PyTorch.

11
Experimental
7285 IJCS/Trainer-app

A lightweight and highly flexible tool designed to assist coaches....

11
Experimental
7286 nfreear/simple-speak

Power-tool wrapper around the browser Web Speech API —

11
Experimental
7287 shestaya-liniya/accentless

Shape your accent with AI

11
Experimental
7288 nafiuny/voice_conversion_dataset

top dataset for voice conversion models

11
Experimental
7289 caraleeqiu/mememeow

Practice English speaking with a carrot cat! Read along with YouTube/TikTok...

11
Experimental
7290 joeybronner/meeting-live-translation

🎤Live translation for your meetings using HTML5 Speech Recognition API

11
Experimental
7291 alecproj/microphone-module

Smart Home Microphone Module

11
Experimental
7292 aloukikjoshi/FinSpeak

🎙️ Voice-powered mutual fund assistant — Ask about NAV & returns in English,...

11
Experimental
7293 mateogon/Cadence

Cadence: immersive reading pipeline from EPUB to audiobook with synchronized...

11
Experimental
7294 speakingofdata/LJ2_Corpus

Single speaker, 26,200 transcribed audio recordings, 48 hours total

11
Experimental
7295 singleshade8/japanese-subtitle-generator

GPU-accelerated Japanese → English subtitle generator using faster-whisper...

11
Experimental
7296 bivex/voice_to_text

A Python application for real-time Russian voice-to-text transcription and...

11
Experimental
7297 D-Keqi/Implementation-for-ASR-by-API-of-Baidu

This is an open source code that you can use to connect to Baidu's API to...

11
Experimental
7298 balas-world/kitten-tts-web-demo

Kitten TTS Web Demo showcases the Kitten TTS Nano in your browser—a...

11
Experimental
7299 birros/dictations

Experimental progressive web application for dictations

11
Experimental
7300 ArshCypherZ/text-to-speech

Text to Speech API using kokoro.

11
Experimental
« Prev 1 2 3 71 72 73 74 75 80 81 82 Next »