All Voice AI Tools

8,165 tools ranked by quality score · Page 71 of 82

Showing 7001–7100 of 8,165
# Tool Score Tier
7001 traceypooh/audio2text

creates text from audio of A/V input file, using docker, sphinx. extracts...

13
Experimental
7002 BenjaminDanker/Audio-Cleaner-Web

AI-powered video audio noise reduction in the cloud using DeepFilterNet3 and...

13
Experimental
7003 littleAvel/voice-agent

End-to-end voice AI system demonstrating ASR, LLM-based planning, vector...

13
Experimental
7004 sridattb96/MeetingStory

A project I built while doing research for a professor in the Visual &...

13
Experimental
7005 Vlad1343/Sign-Wave

Real-time Ukrainian Sign Language translator using computer vision and...

13
Experimental
7006 ZET-Speech/ZET-Speech-Demo

ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis...

13
Experimental
7007 cser245086272/ComfyUI-FL-Qwen3TTS

🎤 Create realistic text-to-speech outputs with advanced voice cloning and...

13
Experimental
7008 stefanpietrusky/QUEST

Repository for the QUEST App prototype.

13
Experimental
7009 Rayyan9477/speech-app

AI Language Processor is a powerful application that leverages...

13
Experimental
7010 saharshmehrotra/Stutter-Detection-and-Classification

System for classifying stuttering in speech and identification of various...

13
Experimental
7011 avd1729/Textify

Textify is a privacy-first Android keyboard that learns from your typing...

13
Experimental
7012 aidayang/Spark-TTS-OneClick

Spark-TTS文字转语音及声音克隆软件免安装一键启动整合包

13
Experimental
7013 rendchevi/daisy-tts

🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding...

13
Experimental
7014 ZhichuCen/ChunJi

唇记-一种助盲语音文字编辑系统 A text editor with Chinese voice control

13
Experimental
7015 asiff00/Orpheus-TTS-Local

Run Orpheus TTS locally.

13
Experimental
7016 NafisRayan/AI-Voice-Assistant-ST

AI voice assistant made with Streamlit python and powered by Gemini, Mistral...

13
Experimental
7017 fclaeys/nix-nerd-dictation

🎤 Nix flake for offline French speech-to-text with nerd-dictation....

13
Experimental
7018 carlfm01/my-speech-datasets

My public domain speech index

13
Experimental
7019 kocharvishal/Fast-Speech-Transcription-Grammar-Scoring-Engine

Built a transcription system using OpenAI’s Whisper and Fine-tuned...

13
Experimental
7020 maum-ai/sane-tts

SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech

13
Experimental
7021 Sumit0ubey/TorvixAI

TorchAI is an Android app that combines AI chat and voice assistance with...

13
Experimental
7022 lymcho/story-to-video

Create a fully narrated YouTube audiobook channel in one command. AI...

13
Experimental
7023 vantix-code/VoiceSnap

AI-powered voice memo app that transforms recordings into bullet points,...

13
Experimental
7024 kavanatn/EchoVerse

AI-powered audiobook generator that converts text and documents into...

13
Experimental
7025 morelen17/tts-papers

List of papers about TTS / Список статей о TTS

13
Experimental
7026 danielrosehill/ASR-And-STT-AI-Notebook

Propmts and outputs (and some notes) on STT + ASR + fine-tuning. LLM: Claude

13
Experimental
7027 moego0/ai-assistant

A powerful AI assistant built with Python for desktop control, smart...

13
Experimental
7028 NimbleAINinja/swift-scribe-rs

Fast, on-device speech-to-text transcription for macOS using Apple's Speech framework

13
Experimental
7029 DaivikPatel0/Speech-to-Text_Traditional_Method

A speech recognition project using traditional methods like HMM (Hidden...

13
Experimental
7030 onwurahben/meeting-assistant

Transform raw meeting audio into speaker-aware transcripts, summaries, and...

13
Experimental
7031 FlyingFathead/huuda

Finnish TTS (text-to-speech) framework with Finglish capabilities

13
Experimental
7032 Bilal742/text-to-speech-converter

simple Text-to-Speech Converter web app built with HTML, CSS, and...

13
Experimental
7033 Gokila-S/smart-translate

Smart Translator is a modern MERN stack application that allows users to...

13
Experimental
7034 lucasvmigotto/emotion-analysis

Audio emotion classifier with fine tuned openai/whisper-large-v3

13
Experimental
7035 IshaanLabs/Text-to-Speech-TTS

Open Source Text-to-Speech (TTS) repository

13
Experimental
7036 thc1006/MTK-Breeze-ASR-25-colab-transcriptor

Taiwan Mandarin speech-to-text transcriber using MediaTek Breeze-ASR-25....

13
Experimental
7037 antarades/emotion-aware-automatic-speech-recognition

An intelligent speech recognition system that combines OpenAI's Whisper for...

13
Experimental
7038 apptornado/speechdown

Building a speech recognition app with three coding agents

13
Experimental
7039 ItxMatti/tts

🗣️ Deploy high-quality text-to-speech services with Gemini, OpenAI, and...

13
Experimental
7040 magdalena-trivina/goethe-zertifikat-b2-wortliste

Goethe Zertifikat B2 Vocabulary Companion

13
Experimental
7041 elloza/slides2video-pinokio-script

Pinokio script for installing the app slides2video

13
Experimental
7042 hwanyyy/preprocessing-of-speech

VAD + resampling | High resolution spectrogram

13
Experimental
7043 remsky/prebuilt_tts_wheels

Prebult wheels for dependencies of TTS service; Kokoro-FastAPI

13
Experimental
7044 deeplearningcafe/animespeechdataset

Dataset Generation for Language Model Training and Text-to-Speech Synthesis...

13
Experimental
7045 AdityaKshettri/Speech_Recognition_Using_MATLAB

Implementation of Speech Recognition System in MATLAB Environment using...

12
Experimental
7046 theshajha/whisper-realtime-speech-to-text-summary

Transcribe real-world speech with an API call. Based on Whisper(ASR by...

12
Experimental
7047 fr45201-collab/Jarvis-ai-assistance-Python

A Python-based voice assistance project using text and voice command

12
Experimental
7048 parthshiv/use-googleAI-with-python

AI-powered voice assistant built with Python, Google Gemini, and...

12
Experimental
7049 unnatii14/sleepytales-bedtime-stories

A beautiful bedtime story app for children with 35+ stories, sleep music,...

12
Experimental
7050 Kabilduke/VoiceBot

Voice-Bot using Streamlit, Groq, SpeechRegconition, Pyttsx3, Python.

12
Experimental
7051 adhikary97/Epub-Reader

Convert epub to text then read text

12
Experimental
7052 CatanduYago/Subtitler

[ESP] Aplicación para transcribir a texto el audio recibido por micrófono. |...

12
Experimental
7053 ragymorkos/Subtitle-Alignment-Algorithm

This repository contains tested pseudo-code for a subtitle alignment...

12
Experimental
7054 duonghieu7104/TikTok-Video-Scan

🤖 TikTok video analyzer using AI: Speech transcription, OCR, object...

12
Experimental
7055 ggalmury/ai-tutor-app

Smartphone education app for the elderly

12
Experimental
7056 Muzammil-crypto/KidsQueApp

PakQueKId is an interacting app for kids to learn about Pak history,...

12
Experimental
7057 anubhavparas/automated_minutes_of_meeting_generator

Developing a tool to convert audio calls into structured documents as...

12
Experimental
7058 Swathi-88/JARVIS-AI

A voice-controlled desktop AI assistant for Windows featuring OpenAI...

12
Experimental
7059 NicosNicolaou16/TextToSpeechSetup

This project demonstrates the setup for Text-to-Speech.

12
Experimental
7060 IkwhanChang/Twilio-IVR-Chatbot

Twilio + IVR-based chatbot with Google Speech-to-Text API

12
Experimental
7061 gogabs/pyscrout

Output text to speech and braille

12
Experimental
7062 Abhijit-71/Jarvis

A command line ai-chatbot with support for speech input and output , written...

12
Experimental
7063 DozenPartsOfTheKing/Kyutai-STT-TTS-service

Updating the original Kyutai with Docker service, fine-tuning with Russian...

12
Experimental
7064 Nahuel1819/Nexus-assistant

Nexus is a high-performance virtual assistant designed to run in the...

12
Experimental
7065 emmaly/elevenlabs-tts

ElevenLabs TTS for Home Assistant

12
Experimental
7066 oortur/text-to-speech

Text-to-speech system with Mel-spectrogram generator and duration predictor

12
Experimental
7067 WorldWideDevelop/text-to-video

Transform your text into captivating, lipsynced animated videos...

12
Experimental
7068 dj-ayush/MetaSynAI

MetaSynAI is an AI‑driven accessibility framework that enables seamless...

12
Experimental
7069 yqli2420/speech_synthesis_and_speech_recognition_papers

tts papers: http://yqli.tech/page/tts_paper.html

12
Experimental
7070 thangtran480/SpeechApp

speech to text

12
Experimental
7071 SheidaAbedpour/Smart-Home-Assistant

AI-powered multilingual smart home assistant with voice control using LLaMA...

12
Experimental
7072 MaximGorshunov/SaluteSpeechTools

Speech synthesis and recognition tools that uses SaluteSpeech API

12
Experimental
7073 Yangyangii/F0-DCTTS

DCTTS with F0

12
Experimental
7074 shark6438/SmartPartyLearning

基于大模型 (LLM) + RAG 检索增强 + TTS 语音合成的智慧党建理论学习系统。Smart Party Building Learning...

12
Experimental
7075 teekennedy/glados-piper-addon

GLaDOS as a Home Assistant TTS integration

12
Experimental
7076 Xeven777/supertonic-demo

Supertonic TTS is a text-to-speech system built for speed and efficiency. It...

12
Experimental
7077 jaygajera17/Text-Editor-React

Text editor,speech,analysis using react

12
Experimental
7078 matthew-trump/speech-synthesis-angular

Angular app providing demo of how to use SSML-based text-to-speech services...

12
Experimental
7079 MaharshPatelX/Speechitive

A Video analytics tool converting videos to M3U8 playlists using HLS and...

12
Experimental
7080 parham-ab/Voicy

text-to-speech & speech-to-text web application using vanilla JavaScript &...

12
Experimental
7081 hlorenzi/vowel-analysis

Vowel formant frequency synthesis and analysis on the browser --...

12
Experimental
7082 emjose/kboard

A virtual keyboard with English and Russian modes, with speech recognition...

12
Experimental
7083 JagratiVerma1408/GoogleAssistant

Andriod App clone Google Assistant

12
Experimental
7084 resatDev/Voirex

Voirex - Speech to Function API

12
Experimental
7085 Adibian/Persian-TTS-Zoo

A collection of Persian text-to-speech models using implementations and techniques.

12
Experimental
7086 DJJ547/CMPE273-Book-Reader-React

An AI-powered book reader app with search, library management, and...

12
Experimental
7087 bhalla98/LinguisticTagger

Segments natural language text and tags it with different parts of speech.

12
Experimental
7088 aryansh77/Voice_Assistant_Energy_Saver

A Java-based Voice Assistant that automates energy-saving tasks and...

12
Experimental
7089 Abhi5h3k/Android-project-ASR-Demo

Google Cloud Speech-to-Text

12
Experimental
7090 coderpawan/MultiTranslator

This is my Multi Language Translator App designed to ease the learning of...

12
Experimental
7091 Guuri11/ISI

ISI is a highly advanced artificial intelligence system designed to provide...

12
Experimental
7092 yehuohan/ln-asr

Automatic Speech Recognition

12
Experimental
7093 pboechat/psittsa

An offline Text-To-Speech service you can host at home

12
Experimental
7094 OZIOisgood/alfa

AI-powered educational video generator - transforms problems into animated...

12
Experimental
7095 muqadasejaz/Speech-Recognition-System-

Speech Recognition System is a Python-based project that converts speech to...

12
Experimental
7096 modernecotech/Automatic_puppet_theatre

An application to translate text to robot mouth movements including...

12
Experimental
7097 aJIEw/BaiduVoiceRecognitionDemo

百度语音识别 REST-API Demo (React-Native)

12
Experimental
7098 tjysdsg/aidatatang_force_align

Perform force alignment on Mandarin data using aidatatang pretrained model...

12
Experimental
7099 Sreyan88/Indic-ASR

Repository for pre-trained wav2vec 2.0 models on 7 Indian languages

12
Experimental
7100 Agrover112/Kaldi-notes

Resources helpful for Kaldi

12
Experimental
« Prev 1 2 3 69 70 71 72 73 80 81 82 Next »