All Voice AI Tools

8,165 tools ranked by quality score · Page 29 of 82

Showing 2801–2900 of 8,165
# Tool Score Tier
2801 LiaTemplates/Speech-Recognition-Quiz

Create quizzes that check spoken text

32
Emerging
2802 ScottishFold007/TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production,...

32
Emerging
2803 Medvedu/Yandex-Speech-API

Text to speech translation. Supports next languages: english, turkey,...

32
Emerging
2804 madushan1000/voxcpm_rs

Rust (using burn) implementation of VoxCPM

32
Emerging
2805 ThaaoBlues/Blue

An open source vocal assistant for windows and Linux. Made to be upgraded...

32
Emerging
2806 EnjiRouz/Habr-Reader-Extension

Простое расширение-читалка для Chrome/Opera, позволяющее воспроизводить...

32
Emerging
2807 SnappsiSnappes/Jarvis-free-bingGPT-voice-assistant

Голосовой помощник - чат с bingGPT / Bard (на русском) / ChatGPT 3.5 для...

32
Emerging
2808 kubo/ruby-flite

a small speech synthesis library for ruby using CMU Flite(http://cmuflite.org)

32
Emerging
2809 Issac-Moses/liebea

AI voice-activated girlfriend assistant with wake word detection, speech...

32
Emerging
2810 Sgvkamalakar/Azure_AI_Speech_Services

This repository contains a Streamlit-based application that leverages Azure...

32
Emerging
2811 JN513/Ana

Assistente feita em Python utilizando Speech_recognition, e APIs do Google

32
Emerging
2812 Snesnopic/Morser

SwiftUI recreation of my UIKit Morse Code experiment

32
Emerging
2813 sipeter/CloneTTS

A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless...

32
Emerging
2814 jianchang512/kokoro-uiapi

用于kokoro TTS的webui界面和兼容openai api

32
Emerging
2815 152334H/CTN-webapp

Refactored ControllableTalkNet with Flask/uwsgi

32
Emerging
2816 ErnestAroozoo/GPT-Discord-Chatbot

Discord chatbot powered by OpenAI and ElevenLabs that enables natural and...

32
Emerging
2817 turinaf/Sagalee

Automatic Speech Recognition Dataset for Oromo Language

32
Emerging
2818 YizheZhang-Ervin/AI_FinTech

Artifical Intelligence (React+Flask RESTful+Sqlite+Antd+Echarts)

32
Emerging
2819 gokhaneraslan/tacotron2-tts-training

Training Tacotron 2 Text-to-Speech (TTS)

32
Emerging
2820 QuantiusBenignus/NoteWhispers

Voice memos recorded from the microphone, transcribed offline to text and...

32
Emerging
2821 YChenL/DS-TDNN

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic...

32
Emerging
2822 super13/tensorflow-speech-recognition-pai

Speech recognition using tensorflow in aliyun pai.

32
Emerging
2823 DominicTWHV/LJSpeech_Dataset_Generator

LJSpeech dataset generator for TTS model training/fine tuning

32
Emerging
2824 dsrivastavv/Android-Continuous-SpeechRecognition

Code to continuously detect spoken language and convert to text using Google...

32
Emerging
2825 Aadv1k/reddit-tts-gui

A GUI to auto-generate TTS videos from reddit posts and comments

32
Emerging
2826 harshil748/VoiceAPI

A lightweight, multi-lingual Text-to-Speech system supporting 11 Indian...

32
Emerging
2827 kaiidams/NeMoOnnxSharp

Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime...

32
Emerging
2828 ShihabYasin/Isolated-Bengali-Word-and-Speaker-Recognition.

Isolated Bengali word and speaker recognition.

32
Emerging
2829 royangkr/BabyReady

CNN to predict the reason why a baby is crying

32
Emerging
2830 6Morpheus6/alltalk-tts

[NVIDIA ONLY] AllTalk-TTS is a unified UI for F5-TTS, XTTS, Vite TTS, Piper...

32
Emerging
2831 sera619/S4M-2.0

German supported VoiceAssist without BigData

32
Emerging
2832 pinkpixel-dev/comeback-ai

🎤🔥 AI-powered clapback machine that transforms mean comments into witty...

32
Emerging
2833 Goblincomet/digitaltwin

Using a single image and just 10 seconds of sample audio, our project...

32
Emerging
2834 NICEElevateAI/ElevateAIPythonSDK

ElevateAI - Speech-to-text API Python SDK

32
Emerging
2835 KinglittleQ/Tacotron

An implementation of Tacotron with Pytorch0.4

32
Emerging
2836 rohanprichard/fastrtc-demo

A simple POC of FastRTC, a framework to use voice mode in python!

32
Emerging
2837 mazzasaverio/youtube-auto-dub

Automated voice dubbing for YouTube videos using Docker, OpenVoice, and...

32
Emerging
2838 aiyu-ayaan/tts-engine

The TTS-Engine is a simple and efficient library that provides...

32
Emerging
2839 JuJu2181/Automatic-Nepali-Speech-Recognition-and-Summarizer

A system capable of converting Nepali speech to text and generate summary of text

32
Emerging
2840 yandex-cloud-examples/yc-speechkit-web-ui

SpeechKit Web UI Example

32
Emerging
2841 guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

A paper and project list about the cutting edge Speech Synthesis,...

32
Emerging
2842 Ephrem-ETH/E2E-KWS

End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM

32
Emerging
2843 R3tr0gh057/Celeste

A voice-activated desktop assistant and automation toolkit built with...

32
Emerging
2844 chase-west/VocaSpanish

Python app using tts and speech recognition to memorize spanish vocabulary

32
Emerging
2845 The-Swarm-Corporation/Voice-Agents

Voice-Agents is a production-ready Python library for building...

32
Emerging
2846 SCRN-VRC/Voice-Recognition-Shader

Audio detection with visemes in a fragment shader

32
Emerging
2847 TheVoxProject/calcvox

Accessible and open-source talking calculator for everyone.

32
Emerging
2848 Miihir79/Messaging_app

This is an advanced messaging app which has smart log in options smart...

32
Emerging
2849 Yangyangii/TPGST-Tacotron

Google's TPGST reimplementation.

32
Emerging
2850 biyoml/End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

32
Emerging
2851 EtienneAb3d/WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper...

32
Emerging
2852 Forne/ha-yandexcloudtts

Yandex.Cloud SpeechKit for Home Assistant

32
Emerging
2853 ancs21/awesome-openai-whisper

A curated list of awesome OpenAI's Whisper

32
Emerging
2854 zzw922cn/LPC_for_TTS

Linear Prediction Coefficients estimation from mel-spectrogram implemented...

32
Emerging
2855 mrmanna/Nvidia_Nemo_FastPitch_TTS_Example

How to Build a High-Quality Text-to-Speech (TTS) System Locally with Nvidia...

32
Emerging
2856 TranHuuDat2004/tts-flask-app

Text-to-Speech Generator Powered by Python, Flask, and Piper TTS

32
Emerging
2857 Chelsea486MHz/debat-politique-ia

Génération automatique de débats politiques par IA. Audio + vidéo.

32
Emerging
2858 Wookie-VUI/Wokiee

Cross-platform Voice User Interface for your Desktop

32
Emerging
2859 birros/pico2wave.js

JS port of pico2wave (Emscripten)

32
Emerging
2860 csikasote/bigc

This repository contains the data resources for the LacunaFund supported...

32
Emerging
2861 botbahlul/VOSK-Powered-LIVE-SUBTITLE-V2

ANDROID APP that can RECOGNIZE LIVE AUDIO/VIDEO STREAMING (using free VOSK...

32
Emerging
2862 arpabot/ohno-bot

Discord Japanese text-to-speech bot

32
Emerging
2863 shreyasnisal/VoiceQuiz-v2

Verstion 2 of the quiz-app, this is the repository for the voice-based quiz....

32
Emerging
2864 snaraya7/Ok_Eclipse

CSC 510 Software Engineering (Spring 2018) project - Group 'O'

32
Emerging
2865 muqadasejaz/Text-to-Speech-Converter-

A simple Python project that converts text into speech using different...

32
Emerging
2866 kaiidams/voice100

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost...

32
Emerging
2867 louischen737/PodCast-Master

AI驱动的播客生成工具,具备台词级脚本编辑功能与多语音文本转语音合成能力

32
Emerging
2868 thewh1teagle/israwave

Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet

32
Emerging
2869 CoffreLv/ASR_CNN_CTC

从零开始搭建一个基于CNN+CTC的语音识别系统。

32
Emerging
2870 ace19-dev/tensorflow-speech-recognition-challenge

Kaggle Competitions: TensorFlow Speech Recognition Challenge

32
Emerging
2871 aloproducao/Live-captions-for-broadcast

The Real-Time Speech Recognition System is an innovative tool designed to...

32
Emerging
2872 akukerang/StudySurfer

Subway Surfer TikTok Study Tool

32
Emerging
2873 pranayjoshi/speech_to_text

This is a speech_to_text script by Pranay Joshi

32
Emerging
2874 ye-kyaw-thu/myG2P

Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary...

32
Emerging
2875 rock3125/tts

Simple text to speech server in docker using coqui-ai/TTS

32
Emerging
2876 sap1119/voice-agent-0.01

A self-hosted, AI-powered voice assistant system with real-time voice...

32
Emerging
2877 ckaznable/yt-cli-live

Youtube Text Live Streaming in CLI

32
Emerging
2878 pnkvalavala/multivoice

Multivoice: Enhance your foreign-language movie and TV show experience with...

32
Emerging
2879 siva-sub/NekoTTS

🔊 Local Text-to-Speech service for Android with system-wide integration....

32
Emerging
2880 NullEnt1ty/GCloudSpeech

Transcribe voice data to text using Google Cloud Speech-to-Text

32
Emerging
2881 FragJage/PicoVoiceCpp

PicoVoiceCpp is a simple TTS (text to speech) class base on picovoice (svox).

32
Emerging
2882 dgnsrekt/Discorgeous

Discord + GTTS = a discord bot that sends google text to speech voice...

32
Emerging
2883 AntoBrandi/Robotics-and-ROS-Learn-by-Doing-Manipulators

3D Printed robot arm powered by ROS and Arduino and controlled via MoveIt!...

32
Emerging
2884 TeaPoly/CTC-OptimizedLoss

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge...

32
Emerging
2885 abumubaarak/Wellbeing-Doctor

Doctor management app

32
Emerging
2886 1038lab/ComfyUI-FireRedTTS

A ComfyUI integration for FireRedTTS‑2, a real-time multi-speaker TTS system...

32
Emerging
2887 alfianlosari/flutter_cloud_text_to_speech

Flutter project that uses the Google Cloud Text to Speech API to synthesize...

32
Emerging
2888 sberdevices/smartspeech

SmartSpeech — это сервис для синтеза и распознавания речи

32
Emerging
2889 ArkS0001/IIT-Bombay-Whisper-Hindi-ASR-Model-Machine-Learning-Intern

Whisper is an automatic speech recognition (ASR) system trained on 680,000...

32
Emerging
2890 atahanuz/yt2text

Extract text from a YouTube video in a single command, using OpenAi's...

32
Emerging
2891 linagora-labs/asr_benchmark

Toolkit to benchmark various speech recognition APIs (NeMo, Whisper...) and...

32
Emerging
2892 whiteSHADOW1234/WhisperTranscriber

🎙️ Effortlessly transcribe YouTube videos, MP4, and MP3 files to text using...

32
Emerging
2893 Cosmos-Break/asr

沪语(上海话)ASR(语音识别)模型

32
Emerging
2894 SALT-Research/SHALLOW

SHALLOW, the first hallucination benchmark for ASR models

32
Emerging
2895 ZoraizQ/urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs...

32
Emerging
2896 yuvraj108c/ComfyUI-PiperTTS

ComfyUI Piper TTS Custom Node

32
Emerging
2897 praweshd/speech_emotion_recognition

In this project, the performance of speech emotion recognition is compared...

32
Emerging
2898 srvk/srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

32
Emerging
2899 slayerrr12/WaveSlayer

ai chatbot that uses speech to operate and respond

32
Emerging
2900 SladkyCitron/gotau

Work-in-progress UTAU-compatible singing voice synthesizer, written in Go

32
Emerging
« Prev 1 2 3 27 28 29 30 31 80 81 82 Next »