All Voice AI Tools

8,165 tools ranked by quality score · Page 70 of 82

Showing 6901–7000 of 8,165
# Tool Score Tier
6901 egorsmkv/whisper-ukrainian

Trainer and Evaluation scripts for fine-tuning Whisper models for the...

14
Experimental
6902 mandar3051982/tiny-tts

Deliver natural English speech with an ultra-lightweight, end-to-end...

14
Experimental
6903 NAJL123/voice-ai-assistant

Local Voice AI Assistant — faster-whisper STT + Ollama LLM + pyttsx3 TTS

14
Experimental
6904 kyugakai/NeuraVoice

🗣️ Elevate your workflow with NeuraVoice, an AI desktop assistant that...

14
Experimental
6905 rutchanon17493/Sakura-Voice

Build real-time, low-latency voice assistants supporting 23 Indian languages...

14
Experimental
6906 H0NEYP0T-466/NeuralMate

NeuralMate 🤖 is your smart AI personal assistant 🧠, built to help you work...

14
Experimental
6907 RighteousW/sign_avatar

Real-time bidirectional translation between speech and Namibian Sign...

14
Experimental
6908 rizwiz104/voicely

Coach structured answers in real time during mock interviews with question...

14
Experimental
6909 mtepenner/vi

Meet Vi, a modular, voice-activated AI assistant built in Python. It...

14
Experimental
6910 ChanikyaSaiL/VoicePay

Voice & face-based secure payment and authentication platform with real-time...

14
Experimental
6911 Subhas6033/Talk2Hire

Talk2Hire is an AI-powered hiring platform for secure online interviews with...

14
Experimental
6912 89891383/Polish-Kick-TTS

🎙️ Darmowy system Text-to-Speech dla polskich streamerów Kick.com. Łatwa...

14
Experimental
6913 maritza310308/audiobook-toolkit

🎧 Manage your audiobooks efficiently with this toolkit that converts Audible...

14
Experimental
6914 naseem1amjad/Python-AI-VoiceChatGPT

Use ChatGpt (openAi) by Voice i.e. using text to speech and speech to text....

14
Experimental
6915 cameroncruz/dog-voicebot

Voice-enabled dog chatbot for emotional therapy. 🐶

14
Experimental
6916 adrianwedd/spark

SPARK — a Claude-powered robot companion for a neurodivergent kid. Built on...

14
Experimental
6917 MendoLeo/tts-dataset-pipeline

Democratizing speech technology: the simplest way to create custom TTS and...

14
Experimental
6918 deepgram-devs/dg-sagemaker

Example code to call Deepgram APIs on Amazon SageMaker

14
Experimental
6919 Emmanuel-PaulMaah/liguscribe

Real-time courtroom transcription

14
Experimental
6920 asainov1/voice-ai-agent

Voice cloning pipeline for AI agents — F5-TTS zero-shot inference, Whisper...

14
Experimental
6921 Lishadsza/my-city-speaks

My City Speaks is an innovative web application that combines AI-powered...

14
Experimental
6922 Fencelineanapsid199/music-scribe

Analyze any YouTube track's audio to extract key, BPM, chords, time...

14
Experimental
6923 manchenkoff/python-assistant

Simple GUI application to emulate voice assistant workflow [just for fun]

14
Experimental
6924 Shantika123/Jarvis

Developed a Python-based virtual assistant that performs voice-controlled...

14
Experimental
6925 Daliaalkilani/Sign-Language-Translator

A Python-based system for real-time two-way translation between sign...

14
Experimental
6926 bmwasaru/kiswahili-speech-normalization

Kiswahili text normalization utilities for speech datasets (ASR/TTS)

14
Experimental
6927 alijavid110/SeeSense-AI

👁️🗨️ Empower vision with SeeSense-AI, a browser-based tool that enhances...

14
Experimental
6928 Vasanth2005kk/VoxLibri

VoxLibri: The Ultimate AI-Powered eBook to Audiobook Converter. 🎧📚 Transform...

14
Experimental
6929 voothi/20250902105308-anki-no-tts

A simple Anki add-on to globally disable all Text-to-Speech (TTS) playback

14
Experimental
6930 iLuiz07/DesiYatra

✨ Streamline your travel with DesiYatra, an AI system that negotiates local...

14
Experimental
6931 Jaya30102003/Voice-Assistant-for-Blind

A web-based voice assistant that empowers visually impaired users to perform...

14
Experimental
6932 Verma-Siddharth/empathy-engine

AI-powered TTS that detects emotion and modulates voice — speed, pitch — to...

14
Experimental
6933 funkyfranky/TTS-Radio

Create voice overs with radio effects for DCS

13
Experimental
6934 metacore-stack/Voice-to-Insights

Enterprise AI platform that transforms audio meetings into structured...

13
Experimental
6935 codekraft-studio/react-speech

A simple React component to deal with browser SpeechRecognition

13
Experimental
6936 kingjethro999/silero-test

Made Silero Hostable for api requests

13
Experimental
6937 namphung134/ASR-Vietnamese

Fine-tuning the openai/whisper-small model on the 250h dataset for...

13
Experimental
6938 AnshGaikwad/Personal-Voice-Assistant

Personal Voice Assistant: Easy to change the code and making it suitable for...

13
Experimental
6939 Diluksha-Upeka/Voxis

Voxis is an intelligent voice assistant powered by Groq's AI models,...

13
Experimental
6940 jaychampaneri14/voice-to-video-avatar

Convert voice/text to animated avatar video

13
Experimental
6941 metacore-stack/AuraVoice

Production-grade on-device AI meeting assistant featuring real-time...

13
Experimental
6942 Rumeysakeskin/ASR-Quantization

Post-training quantization on Nvidia Nemo ASR model

13
Experimental
6943 siddbhatt18/30-days-of-voice-agents

Murf AI's 30 Days of AI Voice Agents Challenge

13
Experimental
6944 RedDotz20/speech-to-text-recognition

🎤 Effortlessly integrate speech recognition capabilities into your React...

13
Experimental
6945 harlanx/voice_recorder_recognizer

An audio recorder and speech to text with commands recognition created using...

13
Experimental
6946 allvoicelab/allvoicelab

AI-powered audio creation platform offering TTS, Voice Cloning, Voice...

13
Experimental
6947 joachimhodana/rtTranslator

Simple overlay for Windows, that listens for background sound and translates...

13
Experimental
6948 madebyaris/dsw-voice

Real-time voice noise reduction app for macOS with virtual microphone support

13
Experimental
6949 m-mohsin-ali/closed-captioning-azure-speech-ai

This project demonstrates how to use Azure Cognitive Services with a...

13
Experimental
6950 Shubham8831/Article-to-Audio

An AI-powered web application that converts articles and URLs into...

13
Experimental
6951 Her-mia/Imgspeaker

An Android app written in Kotlin that performs OCR on Simplified Chinese...

13
Experimental
6952 labestia2/Qwen3-Audiobook-Converter

🎧 Convert various document formats into high-quality audiobooks with Qwen3...

13
Experimental
6953 wangjialiang678/speaklow-macvoiceinput

SpeakLow — a lightweight macOS menu bar app for voice-to-text input. Press a...

13
Experimental
6954 quochuy242/VNAVC

Data Pipeline for Text to Speech Project

13
Experimental
6955 RamirJunior/idox-ia-project

Projeto MVP com processamento de áudio com IA local

13
Experimental
6956 nipponjo/tts-german-pytorch

🎙️ German TTS (FastPitch) with Thorsten voice / emotional

13
Experimental
6957 upskaling/voice-keyboard

an interface for nerd-dictation in gtk

13
Experimental
6958 duanxianpi/AI-Voice-Diary

Using voice to keep a journal.

13
Experimental
6959 max-lt/voxtral-cpp

Local implementation for voxtral

13
Experimental
6960 kjanjua26/HearPapers

HearPapers allows you to listen to PDFs (by converting them to audiobooks,...

13
Experimental
6961 sammwyy/chat-tts

Chat TTS for your streams.

13
Experimental
6962 rk-vashista/TTS-Story_Generator

A versatile app that converts images into short stories and lifelike audio...

13
Experimental
6963 mzhang027/Gemini-Live-TTS

🎤 Transform text into natural-sounding speech with Gemini-Live-TTS, offering...

13
Experimental
6964 SelimHorri/txt-to-speech-funny-random-jokes

Consume random jokes APIs and make them as a speech

13
Experimental
6965 appsdothingsiguess/LocalStream-Transcriber

Transcribe local files and browser streams (Canvas, YouTube, and more) using...

13
Experimental
6966 chandankumarm55/Evolve-ai

future - image based answer , UI Improvements , youtube link based summary

13
Experimental
6967 JonPark0/web_audio_splitter

AI-powered audio source separation using Meta Demucs - Split songs into...

13
Experimental
6968 StrawTe/Comfyui-HAIGC-QwenTTS

🎤 Generate and customize voices with ComfyUI HAIGC Qwen3TTS, integrating...

13
Experimental
6969 quangkhai5122/signlanguagetrans

The application is deployed on the web of the ASL_Pytorch project, with...

13
Experimental
6970 SMIL-SPCRAS/DAVIS

Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle...

13
Experimental
6971 DOLMA-NLP/asr

Automatic Speech Recognition for Low-Resourced Middle Eastern Languages -...

13
Experimental
6972 manhph2211/ViTTS

In this repo, I developed a step-by-step pipeline for a standard...

13
Experimental
6973 kiraping1337/ChatTwitchTTS

Twitch TTS бот с клонированием голоса через XTTS v2. Озвучивание сообщений...

13
Experimental
6974 strcoder4007/S2S-Lipsync-UnrealAvatar-Backend

Unreal Metahuman Conversation Speech to Speech backend and frontend.

13
Experimental
6975 Srinath-N-R/IPA-Wav2Vec2-Phoneme-Recognition

End-to-end IPA-based phoneme recognition pipeline using Wav2Vec2, featuring...

13
Experimental
6976 oddvoices/oddvoices

An indie singing synthesizer

13
Experimental
6977 Irham-Azka17/AI-Audio-Transcriber

Transcribe offline audio recordings quickly with AI-powered, privacy-focused...

13
Experimental
6978 Karan36k/text2speech

A Basic But Useful Online Text to Speech Converter with a male voice...

13
Experimental
6979 di37/speech-to-text-fine-tuning-on-unseen-language

This projects aims to show how whisper model can be fine-tuned on language...

13
Experimental
6980 HealSpeak/HealSpeak-App

A free of cost Triage Assistant, this is the HealSpeak app.

13
Experimental
6981 hannabdul/etf4asr

Official repo for the paper "An Effective Training Framework for...

13
Experimental
6982 LauraKokkarinen/AzureAI.TextToSpeech

A console application for converting long-form plain-text files into speech...

13
Experimental
6983 Aryan9inja/Krishi-Setu

Voice-based AI system helping farmers access agricultural guidance via phone...

13
Experimental
6984 jfainberg/sincnet_adapt

Raw waveform adaptation with SincNet

13
Experimental
6985 YossefMohamed/covid-app-api

An Api for testing covid using cough sound

13
Experimental
6986 dom96/texttospeech

A Nim client for the Google Cloud Text to Speech API.

13
Experimental
6987 RutronikSystemSolutions/RDK3_BLE_EnOcean

Project used to illustrate how to use a RDK3 to interact with EnOcean BLE...

13
Experimental
6988 QuantumBeto/chines

🎤 Convert spoken Chinese into pinyin with this simple voice recognition...

13
Experimental
6989 unicodeveloper/voicery

Play with voices. Speak any language. Clone your vibe.

13
Experimental
6990 vshmyhlo/listen-attend-and-speell-pytorch

Implementation of Automatic Speech Recognition inspired by "Listen, Attend...

13
Experimental
6991 Maidana0/My-App

FullStack App - NextJs 14 - Nest JS - Deployment

13
Experimental
6992 dgop92/speech2diet

FitVoice/Speech2Diet is an application that allows people to track their...

13
Experimental
6993 Giuseppe-Della-Corte/IESTAC

A corpus that can be used to train English-to-Italian End-to-End...

13
Experimental
6994 akhilachiju/AI-Audio-Transcriber

Audio transcription app using Whisper AI for accurate speech-to-text...

13
Experimental
6995 nakshatra-garg/rvc-no-gui

Headless RVC voice cloning & training pipeline - Train and run voice...

13
Experimental
6996 Kiran8053/Speech-Emotion-Recognition

This project focuses on real-time Speech Emotion Recognition (SER) using the...

13
Experimental
6997 Himanshi-2519/Speech-To-Text-API

Capturing the Rhythm of your words. Real-time AI transcription with a...

13
Experimental
6998 AnjaneyaBhardwaj/Deafine_Frontend

A real-time audio transcription web application designed to make...

13
Experimental
6999 pukaa900/reagana

Ko taqaku konqamatuqa mo nqaaqaku meqa.

13
Experimental
7000 karim23657/ParsiGoo

ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It...

13
Experimental
« Prev 1 2 3 68 69 70 71 72 80 81 82 Next »