All Voice AI Tools

8,165 tools ranked by quality score · Page 55 of 82

Showing 5401–5500 of 8,165
# Tool Score Tier
5401 nvjob/rain-voice-note

Rain Voice Note (Speech To Text). CW Frame App. JavaScript.

20
Experimental
5402 kdorichev/text2speech

Text-To-Speech Dataset Preparation and Architecture

20
Experimental
5403 Murlors/VITS_Japanese

VITS implementation of Japanese

20
Experimental
5404 AlinaBaber/Arabic-Speech-Recognition-by-Machine-learning-and-feature-extraction

This project implements an Arabic Speech Recognition system using an...

20
Experimental
5405 erendogan6/Translateify

An interactive English learning app with personalized daily word...

20
Experimental
5406 ruslanmv/VRSecretary

VRSecretary is a production-ready reference implementation for building...

20
Experimental
5407 jeswanthmukesh20/VocalText-Contrastive-Embedding

This repository features a CLIP-inspired contrastive model that aligns audio...

20
Experimental
5408 Obstacleee/StreamVoice

Ce projet permet de convertir des flux RSS en format audio, offrant ainsi la...

20
Experimental
5409 privapps/TTS-Parakeet

an easy to use English Text To Speech tool

20
Experimental
5410 theolepage/wavlm_ssl_sv

SOTA method for self-supervised speaker verification leveraging a...

20
Experimental
5411 Aqib121201/YOLO-R-CNN-Vision-Assistant-for-Visually-Impaired-Navigation

Edge-deployed assistive vision system with object detection + audio for...

20
Experimental
5412 eLearningHub/text2talk

Making training videos

20
Experimental
5413 lsa-pucrs-old/donnie-assistive-robot-sw

Donnie's software (Arduino Firmware, Player Drivers, Stage simulation...

20
Experimental
5414 d4rkmen/flatsphere

Clock with TTS for WaveShare ESP32-S3 Touch LCD 1.85"

20
Experimental
5415 volltin/xiaodou-bot

A simple voice-to-voice chatbot.

20
Experimental
5416 InuInu2022/LibSasara

The utility library for CeVIO project file (.ccs / .ccst) and timing label...

20
Experimental
5417 Soumo-git-hub/AI-News-Aggregator

An intelligent news aggregator (Python/JS) using spaCy for NLP topic...

20
Experimental
5418 arshc0der/Javscript-Mini-Projects

🧩 JavaScript Mini Projects – Beginner-Friendly Practice Projects This...

20
Experimental
5419 Nazmul0005/Personal_Voice_Assistant_Mili

Mili is a smart voice assistant built with Python and Google Gemini AI. It...

20
Experimental
5420 dreamerc/twitch-tts

Twitch Text-To-Speech Tool

20
Experimental
5421 dfgHiatus/NeosVoiceRecognition

Speech to text for NeosVR

20
Experimental
5422 NOime22/Web-listen

🎧 AI语音朗读助手 - Chrome浏览器扩展,支持划词朗读和截图OCR朗读

20
Experimental
5423 chicogong/ffvoice-engine

🎙️ 高性能 C++ 语音引擎 - 实时音频处理 + AI 语音识别 + 边录边转写 | High-performance C++ voice...

20
Experimental
5424 Nostalgiaaa/CyberClone

快速构建数字仿生人并存储在 Relic ( PC ) 中

20
Experimental
5425 petitwhito/Speech_to_text_project

Complete Speech-to-Text pipeline: from-scratch architectures (MLP, CNN, RNN,...

20
Experimental
5426 rodrigues-aline/wav2vec2_interpretation

Investigating wav2vec2 context representations and the effects of fine-tuning

20
Experimental
5427 mostafabahaa25/mediguide_MVP

AI-powered accessibility app that helps blind and low-vision users manage...

20
Experimental
5428 BillDuke13/cosyvoice-ray-serve-api

This project provides a Ray Serve-based HTTP API wrapper around CosyVoice, a...

20
Experimental
5429 danilop/easy-sonic

A simple, high-level Python SDK for Amazon Nova 2 Sonic speech-to-speech...

20
Experimental
5430 emre-guler/jarvis

A sophisticated AI-powered personal assistant inspired by Iron Man's JARVIS,...

20
Experimental
5431 thedigitalchief/voice-command-assistant

Powerful assistant performing powerful automated tasks from user’s voice...

20
Experimental
5432 jshperalta/ai-englishTutor

Artificial Intelligence English Tutor

20
Experimental
5433 alessandropec/data_driven_ai_voice_cloning

This repository contain the code of the main part of my master thesis degree...

20
Experimental
5434 Arnav3241/WebSpeechRecognition

v0.1.4 released: A Python library for speech-to-text integration using...

20
Experimental
5435 will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for...

20
Experimental
5436 anikashawarma/Silent-Voice-Lip-Reader

This is an AI enhanced lip reading application based on real-world videos...

20
Experimental
5437 webKing021/VoiceFlow-An-Automatic-NLP-Transcriber

VoiceFlow is a Windows push-to-talk voice-to-text application that...

20
Experimental
5438 LENSS/EMSAssist

This is the official artifact for EMSAssist paper on MobiSys'23. EMSAssist:...

20
Experimental
5439 elllusion/calibre

为linux发行版的Calibre添加Edge TTS | Add Edge TTS for calibre of linux

20
Experimental
5440 FioPio/pepper-language-grounding-system

This repo contains the implelemtation for a simple language grounding in...

20
Experimental
5441 lucylow/Yeezy-Taught-Me

Yeezy Taught Me Text Generation. Training next character predictions RNN...

20
Experimental
5442 giribabu22/assistant-Nikki-python

i developed this assistant using speech-recognition, selenium,...

20
Experimental
5443 ragibson/MFCC-speech-recognition

Real-time speech recognition via "Mel-Frequency Cepstral Coefficients"...

20
Experimental
5444 criadacasa/podcastfy-saas

SaaS platform for generating AI podcasts from multimodal content - Built...

20
Experimental
5445 lhg96/stt-demo-korean

Korean Speech-to-Text app with Whisper & Vosk | 한국어 음성인식 데모 애플리케이션

20
Experimental
5446 Oldes/Rebol-Speak

Rebol text-to-speech extension

20
Experimental
5447 awaseem/2day-api

Transform your writing into engaging AI-generated podcasts. Ditch the mics...

20
Experimental
5448 agent-whisper/grpc-whisper

gRPC server for OpenAI's Whisper Models

20
Experimental
5449 daisy/tobi

Tobi is a free, open source, multimedia book production authoring tool for...

20
Experimental
5450 ayzem88/text-to-speech-converter

أداة لتحويل النصوص العربية إلى ملفات صوتية باستخدام OpenAI TTS / Tool for...

20
Experimental
5451 cvcwebsolutions/vibe-local

Local voice-to-text with AI-powered text cleanup. Privacy-focused...

20
Experimental
5452 ugyenn-tsheringg/Image-Captioning-System-for-Visually-Impaired-Individals-using-CNN-LSTM-VQA-TTS

Developed a web-based image captioning system that evaluates feature...

20
Experimental
5453 Pendrokar/xVA-Synth-HFSpace

HuggingFace Space for xVASynth

20
Experimental
5454 dibasdauliya/better-speech-recognition

An improved speech recognition library with TypeScript support

20
Experimental
5455 Sxriptor/Whispra-Download

Whispra's Offical Download | AI-powered real-time voice and subtitle...

20
Experimental
5456 ShadowLp174/stt-example-bot

A basic discord bot but with voice commands

20
Experimental
5457 Entity047/Voice_AI_Creator

Python TTS and voice cloning framework for educational AI/ML demonstrations.

20
Experimental
5458 Sang-Buster/AeroLex-Editor

A powerful web-based editor for transcription and subtitle files with...

20
Experimental
5459 sanastasiou/dictation-service

GPU-accelerated speech-to-text service that types what you say, powered by...

20
Experimental
5460 neosun100/llasa-tts-8b-webui

🎙️ High-quality Text-to-Speech system based on Llasa-8B with intelligent GPU...

20
Experimental
5461 hd996/material-local

🎬 素材本地化

20
Experimental
5462 alpereee/SpeakerRecognition

🎙️ Makine öğrenmesi ile konuşmacı tanıma, sesten duygu analizi ve metne...

20
Experimental
5463 LucaBallan/wikipedia-aloud-reader

Read aloud wikipedia pages

20
Experimental
5464 gregunger-microsoft/Jarvis

AI-powered Microsoft Teams meeting assistant with voice interaction,...

20
Experimental
5465 wq2012/mdeval

Python implementation of the NIST md-eval.pl script for evaluating rich...

20
Experimental
5466 kolonist/edgetts

Use free Microsoft Edge's online text-to-speech service from golang

20
Experimental
5467 SirCryptic/cli-sms

use clicksend to send either sms or text to speech to a phone number via the...

20
Experimental
5468 Praneeth-Gandodi/Tars

TARS is a voice AI assistant that listens to your voice and responds in...

20
Experimental
5469 gikonyob/speake

Speake library provides a wrapper around Espeak to easily write efficient...

20
Experimental
5470 amirmohammadraei/cloud-services

Familiarity with some cloud services

20
Experimental
5471 RiteshGenAI/openai_whisper_transcribe_yt_videos

This project is a Streamlit-based application that allows users to download...

20
Experimental
5472 mastashake08/OCRTTS

Javascript package that uses the TextDetector API and Speech Synthesis to...

20
Experimental
5473 IG-onGit/TexeT

TexeT is the tool you need to take your interaction and content control to...

20
Experimental
5474 techiaith/docker-deepspeech-cy

Hyfforddi modelau adnabod lleferydd Cymraeg gyda Mozilla DeepSpeech // Train...

20
Experimental
5475 juancarlospaco/nim-espeak

Nim Espeak NG wrapper, for super easy Voice and Text-To-Speech

20
Experimental
5476 CSroseX/PizzAI-EmbeddableAI-Project

Experiments with building an AI-powered web app using Flask, integrating...

20
Experimental
5477 sap1119/voice_agent_0.02

An open‑source voice AI platform for building real‑time, scalable, and...

20
Experimental
5478 neosun100/orpheus-tts-docker

Production-ready Docker deployment for Orpheus TTS with GPU management,...

20
Experimental
5479 ArpitaChatterjee/Covid-19-Tracker-with-VoiceAssistant

Built a Covid-19 tracker in python, where data of total no. of cases, total...

20
Experimental
5480 matthiaaas/otto-assistant

Voice assistant called "Otto"

20
Experimental
5481 egorsmkv/flashlight-ukrainian

The Ukrainian Acoustic Model for Flashlight

20
Experimental
5482 osandadeshan/MySight

Android application for blind community to read books, papers, shopping...

20
Experimental
5483 TTomas65/Text-to-Speech-with-AI

A simple web application that uses OpenAI's GPT-4o mini TTS (text-to-speech)...

20
Experimental
5484 neosun100/fish-speech

🐟 Advanced multilingual Text-to-Speech system with speaker management,...

20
Experimental
5485 Vagabond-K/Speechabler

루게릭병 환우의 목소리 프로젝트

20
Experimental
5486 awesome-german/speaking

Resources and methods to improve spoken German, pronunciation, and real-life...

20
Experimental
5487 serkanyasr/vocavoice

AI-powered podcast generator for language learners. Creates custom scripts...

20
Experimental
5488 MohammadarefAhmadpoor/Speech-translation

Speech recognition, language detection, translation, and speech synthesis

20
Experimental
5489 sudarsan15/speech-sentiment-analyser

Speech Sentiment Analyser is a ML & AI based tool to help analyse the user...

20
Experimental
5490 lwdovico/zonos

Basic Zonos setup for seamless integration with multiple sentence inference tasks.

20
Experimental
5491 lukinhas-programando/ace-step-studio

🎵 Create and manage local-first AI-powered music with a fast, self-hosted...

20
Experimental
5492 kyegomez/AST

Implementation of AST from the paper: "AST: Audio Spectrogram Transformer'...

20
Experimental
5493 belambert/cl-asr

A (not entirely working) stand-alone speech recognizer written in Common Lisp

20
Experimental
5494 xAlpharax/whisper-stt-gradio

Gradio Interface for Transcription and Translation using the Whisper Large...

20
Experimental
5495 victorwoo/transcript-video

A PowerShell script that automatically generates subtitles in bulk for video...

20
Experimental
5496 darshkaushik/cough-it

Cough It is an android app that leverages deep learning and acoustics to...

20
Experimental
5497 dbry/skipper

Detection and selective purging of talk or music in audio streams

20
Experimental
5498 NeuralForge6000/steve-voice-assistant

Secure voice assistant powered by OpenAI Whisper & Google Gemini AI....

20
Experimental
5499 lzfelipe/discord-ai-tts-bot

Discord Bot that combines functionalities from Eleven Labs and OpenAI API.

20
Experimental
5500 kamtasingh27/minor

BAE - Being Assistant Eyes - An App for the Visually Impaired People with...

20
Experimental
« Prev 1 2 3 53 54 55 56 57 80 81 82 Next »