All Voice AI Tools

8,165 tools ranked by quality score · Page 48 of 82

Showing 4701–4800 of 8,165
# Tool Score Tier
4701 thewh1teagle/whisper.zig

Transcribe audio with whisper in zig

23
Experimental
4702 halisuyanik/speech-recognition-note-app-vue.js-regex

Note application that converts voice command to text and performs voice...

23
Experimental
4703 MichaelFeng87/CGN_speech_recognition

Speech recognition using DNNs, script to create features, use kaldi for...

23
Experimental
4704 keymastervn/htksupport

Minimal HTK for supporting HTK in Vietnamese.

23
Experimental
4705 sherry-exec/urdu-tts-lib

Microsoft Speech SDK 11 - C# .Net 4 - Urdu Text-to-Speech System

23
Experimental
4706 JmKanmo/VoiceRecognitionMemoApp

Speech recognition and memo application

23
Experimental
4707 brihijoshi/iterative-feature-normalisation-ICASSP-2011

This repository contains a Python implementation of the paper "Iterative...

23
Experimental
4708 balavenkatesh3322/speech_to_text

It will convert our voice into text using Google speech API

23
Experimental
4709 mict-zhaw/chall_e2e_stt

End-to-end ASR experiments for language learning, focusing on...

23
Experimental
4710 ACinesi/nao-strips-planner

Ai project work about NAO robot strips planner.

23
Experimental
4711 auralshin/python

python tryout projects

23
Experimental
4712 Shaashwat05/Smart_clock

A smart clock which understands voice command and performs tasks accordingy

23
Experimental
4713 rahulkarda/Speech-Recognition

A Speech Recognition web app that converts speech to text in real time.

23
Experimental
4714 arch-ith/voice_to_signLanguage

Voice to Sign Language Conversion

23
Experimental
4715 UltraInstinct0x/vlc-auto-dub

AI-powered automatic video dubbing and transcription extension for VLC....

23
Experimental
4716 restacksyj/speech-emotion-detection

Final Year Project on Speech Emotion Recognition with CNN and LSTM.

23
Experimental
4717 alexiusstrauss/AudioTopic

Aplicação que processa arquivos de áudio (.mp3 ou .wav), convertendo-os em...

23
Experimental
4718 Rajesh42/VoiceAssistant

Build your own AI personal assistant using Python (Alexa and Jarvis both are...

23
Experimental
4719 divyanshuio/GPT_App

its a smart assistant that can answer any question

23
Experimental
4720 tellang/sonote

AI 에이전트를 위한 소리 노트 — 실시간 한국어 음성 전사 CLI

23
Experimental
4721 zsl24/Speech-Processing-Doc

一个关于语音算法技术汇总的文档

23
Experimental
4722 chirag127/ComicSpeak-AI-Web-Comic-Dubber-Browser-Extension

Transforms web comics into audio with AI-powered OCR and TTS

23
Experimental
4723 the-bird-F/Expressive-Vectors

[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal...

23
Experimental
4724 dvamsidhar2002/Project-VIVA-Personal-Desktop-and-Voice-Assistant

This is a personal desktop assistant which will do few tasks for you. It is...

23
Experimental
4725 Heatwave114/wazobia-open-speech-mobile

This is an open-source mobile application that augments the wazobia...

23
Experimental
4726 yuhanwang14/ASR-Pipeline

Local GPU-accelerated speech transcription pipeline with speaker diarization...

23
Experimental
4727 chandong83/NaverTTS_with_CSharp

NaverTTS with C#

23
Experimental
4728 Vilhaem/Teams-Notification-Bot

Notification Bot that calls user via teams or phone number and plays a...

23
Experimental
4729 NikhilKalloli/Voice-Recognition

A Streamlit web application for Voice recognition using a pre-trained speech...

23
Experimental
4730 DuyguA/TSD2025-Mind-the-Gap

Innovative ASR model to keep named entities intact, offered as a conference paper.

23
Experimental
4731 lliWcWill/maVoice-Linux

🎙️ Lightning-fast voice dictation Desktop Web App powered by Groq's Whisper...

23
Experimental
4732 yulinliu101/ASR_ATC

speech recognition system to transcribe ATC voice data

23
Experimental
4733 sse-digital-man/TTS-Core

数字人项目-TTS部分

23
Experimental
4734 standing-o/Combined_Dataset_for_Speech_Emotion_Recognition

A collection of dataset consists of a total of 8 English speech datasets for SER

23
Experimental
4735 jaju/voissistant

Voiss Aceistant - Apple only, with mlx.

23
Experimental
4736 daniel-keogh/wwtbam

A voice-controlled spin on "Who Wants to Be a Millionaire?", made with Unity

23
Experimental
4737 mbailey/push2type

Turn CAPSLOCK key into Dictation Key

23
Experimental
4738 code-spirit-369/text-to-speech-yt

This AI TTS web application allows you to convert any text into realistic,...

23
Experimental
4739 andreluizsecco/IoTVoiceControl

Demonstração do acionamento de dispositivos IoT através de comandos de voz,...

23
Experimental
4740 nihal-5/ditch-speechify

Free Speechify alternative - Stop paying $139/year. Listen to PDFs,...

23
Experimental
4741 KubiakJakub01/Valle2

Implementation of TTS and ASR model based on VALL-E X architecture

23
Experimental
4742 WhaddaMakers/RPi-colour-checker-tutorial

A Raspberry Pi is useful in all kinds of ways, even if you are looking to...

23
Experimental
4743 suryanktiwari/Artlet

Concept of a multi-content sharing and reading social platform. In app...

23
Experimental
4744 Kadir-Atmaca/Asistan-STT-Vosk

Bu depo stt yani speech to text Türkçesiyle sesi yazıya çevirme Türkçe şekilde

23
Experimental
4745 lgpearson1771/openwakeword-trainer

Train custom wake word models with openWakeWord. A granular 13-step pipeline...

23
Experimental
4746 hash2004/conformer-fine-tuned-urdu

This repository includes all the essential scripts and notebooks required...

23
Experimental
4747 GioPicci/videowise

VideoWise is a video transcription and AI-powered analysis tool that helps...

23
Experimental
4748 suzumushi0/SoundObject_source

SoundObject source code distribution.

23
Experimental
4749 xaeksx/ComfyUI-AudioSR

🎶 Enhance audio quality with ComfyUI-AudioSR, a versatile tool for upscaling...

23
Experimental
4750 Monal5031/TextToSpeech-Converter

A Simple Text To Speech Converter in java

23
Experimental
4751 ankuragrwl/google-tts

Application to try out Google Text to Speech API

23
Experimental
4752 yujiliu/oresta

Oresta - is the first voice assistant in the Ukrainian language.

23
Experimental
4753 zvz23/vProfanity

A software solution that automates the detection and censorship of profanity...

23
Experimental
4754 anujsahani01/Classification-Project

Intent and Entity Extraction and Classification from audio files

23
Experimental
4755 chiragjoshi12/pdf-to-podcast

Convert any PDF into a podcast episode using Gemini and Elevenlabs!

23
Experimental
4756 nyumaya/libnyumaya_esp32

Experimental support for nyumaya audio recognition on ESP32

23
Experimental
4757 InboraStudio/Google-Cloud-Speech-Recognition-Unity

Unity Speech Recognition with Google Cloud A cross-platform speech...

23
Experimental
4758 pulkitsxn059/Jarvis-PC-Assistant-

Implemented a Desktop PC Assistant Application in Java. The Application can...

23
Experimental
4759 ldl805/QuickSpeechPi

Very, very lightweight and simple text to speech (TTS) program that outputs...

23
Experimental
4760 bonniepeng2002/Apollo

Apollo: your intuitive, virtual nurse.

23
Experimental
4761 brayden-s-haws/speak_easy_text_to_speech

A straightforward way to convert text to speech.

23
Experimental
4762 alvarosg88/Talk-to-the-Bot

A WebGL demo that combines virtual reality, speech recognition and synthetic...

23
Experimental
4763 yepicaiaaron/awesome-audio-generation-2026

🎙️ Curated collection of open-source audio generation models released in...

23
Experimental
4764 python019/subui-speech-assistant

Python AI project

23
Experimental
4765 Kaljurand/K6nele-service

Kõnele service is an Android app that offers a speech-to-text service to...

23
Experimental
4766 Simone-Convertini/Speech-Summarization-Demo

A Web Api written using Go and Gin capable to perform Speech Summarization...

23
Experimental
4767 nicolas-dufour/self-supervised-low-res-speech

This project transfert the self supervised Wav2vec2 representation to low...

23
Experimental
4768 supevil/SoulX-Singer-Eval

🎤 Evaluate zero-shot Singing Voice Synthesis systems for quality, accuracy,...

22
Experimental
4769 atmehedi/Speech-to-text-in-Assamese

TASK ORIENTED DIALOG SYSTEM IN NATIVE LANGUAGE(ASSAMESE)

22
Experimental
4770 gaelic-ghost/speak-to-user

Local FastMCP text-to-speech server for shared macOS playback, voice...

22
Experimental
4771 leszini/spoken-mcp

Voice interface for Claude Desktop — hands-free conversations using...

22
Experimental
4772 k1rk11/CriTTS

A modern, free Text-to-Speech (TTS) application using Microsoft Edge's TTS engine

22
Experimental
4773 smivv/python-vosk-trial

Vosk Speech Recognition Trial

22
Experimental
4774 donapart/klatsch

Klatsch 🐾 — OpenClaw Local Agent: always-on voice assistant, peer...

22
Experimental
4775 seanox/seanox-ai-podcast

Automated podcast generation pipeline using a YAML-defined structure and...

22
Experimental
4776 hwpoison/vosk-voice-recognition-c

Offline voice recognition using pure C and vosk lib. (from file and from...

22
Experimental
4777 Chrisisaac948/RealWonder

Generate real-time videos conditioned on physical actions from a single...

22
Experimental
4778 ouracademy/speech-to-text

A project that show input text with speech recognition trought angular directive

22
Experimental
4779 ArMohadWaseem90/text2epub

📚 Convert TXT files to EPUB quickly with this Python script, ensuring smooth...

22
Experimental
4780 abcname61/audiobook-creator

🎧 Convert MP3 files into professional-quality audiobooks in M4B format with...

22
Experimental
4781 edwindoremi/Asterisk

🎮 Streamline esports tournaments with Asterisk, a real-time management...

22
Experimental
4782 jibon57/nativescript-azure-cognitiveservices

Azure cognitive services implementation for NativeScript.

22
Experimental
4783 0x61space/pu-cit371-helicopter-commander

Control a helicopter in Grand Theft Auto: San Andreas using speech recognition

22
Experimental
4784 ivsergeev/voicer

Голосовой ввод, GigaAM v3 e2e, opencode-plugin, русский язык

22
Experimental
4785 Noor-khalid/Selena

🚀 Accelerate your .NET applications with Selena, a zero-dependency library...

22
Experimental
4786 orbxball/timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

22
Experimental
4787 Mliviu79/cartesia-go

Go SDK for the Cartesia AI API — TTS, STT, voice cloning, agents, WebSocket streaming

22
Experimental
4788 yauhenipakala/Yandex.SpeechKit.Xamarin

Yandex SpeechKit Mobile SDK for Xamarin

22
Experimental
4789 Artavazd2009/yandex-speechkit-php

Provide easy PHP access to Yandex SpeechKit API for audio transcription,...

22
Experimental
4790 MarceloSalazarV/Multimodal_Med_Ai_with_Deployment

🩺 Enhance patient care with MediBot 2.0, an AI doctor assistant that...

22
Experimental
4791 Ashish-Patnaik/Sonya-TTS

High-fidelity AI speech with emotion, rhythm, and audiobook mode

22
Experimental
4792 A-AhkUser/Dictation-Interface

dictation interface using UI automation via a chrome extension

22
Experimental
4793 priyanshu-baran/Voice_Assistant_Using_Java

Tried to make JARVIS (Voice Assistant) using Java

22
Experimental
4794 denz-pro/CoAI-PCB

CoAI-PCB offers an AI-driven PCB inspection module that detects defects with...

22
Experimental
4795 gustavhartz/voxtir

Collaborative transcription service that keeps getting better

22
Experimental
4796 rshivam08/Deaf-Assistant

An Android application for assisting deaf people

22
Experimental
4797 aitoraznar/ionic2-speech-recognition

ionic2 JS Speech Recognition

22
Experimental
4798 zry98/pomumd

Wyoming Protocol TTS and STT & MLX LLM server for iOS/macOS

22
Experimental
4799 duongdz-create/Voicebot-Reservation-system-for-Hotels

🛏️ Explore and book hotels effortlessly with our AI-driven voicebot,...

22
Experimental
4800 lancetodjk14/react-native-sherpa-onnx-stt

🎤 Enable offline speech recognition in React Native using sherpa-onnx,...

22
Experimental
« Prev 1 2 3 46 47 48 49 50 80 81 82 Next »