All Voice AI Tools

8,165 tools ranked by quality score · Page 25 of 82

Showing 2401–2500 of 8,165
# Tool Score Tier
2401 mascotbot/elevenlabs-avatar

Open-source example for integrating ElevenLabs conversational AI with...

34
Emerging
2402 adeepak7/Speech-To-Code

Speech To Code is Google Chrome Extension to convert Speech into Code.

34
Emerging
2403 Ggorets0dev/rantovox-telegram-bot

Telegram bot for text-to-speech and speech-to-speech translation, works with...

34
Emerging
2404 LuluW8071/VocalMind

Automatic Speech Recognition using Conformer with Speech Sentiment Analysis...

34
Emerging
2405 nuhmanpk/PyttsBot

A Pyrogram Bot for gtts module, Text to speech Telegram bot.

34
Emerging
2406 trabdlkarim/voce-browser

Voice Controlled Chromium Web Browser

34
Emerging
2407 agentvoiceresponse/avr-asr-vosk

This repository provides a real-time speech-to-text transcription service...

34
Emerging
2408 candlewill/AiVoice

Deep CNN networks for Speech Synthesis

34
Emerging
2409 nickpending/clarvis

Jarvis-style voice notifications for Claude Code that transforms AI...

34
Emerging
2410 philsyn/DiffWave-Vocoder

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and...

34
Emerging
2411 FlutterHack20/FlutterBand

Flutter built retro cyberpunk CB Radio App for Hack20 Flutter Hackathon....

34
Emerging
2412 vliu15/adversarial-tts

End-to-end Text-to-Speech with Generative Adversarial Networks

34
Emerging
2413 edde746/tiktok-askreddit

A content generation & posting bot for TikTok, scraping posts from r/AskReddit

34
Emerging
2414 berk76/words

Voice vocabulary :gb: :de: :fr: :es: :ru: :jp: :cn: ...

34
Emerging
2415 audo-ai/magic-mic

Open Source Noise Cancellation App for Virtual Meetings

34
Emerging
2416 heymrhayes/text-to-speech

A basic Text-to-Speech app

34
Emerging
2417 OpenTSLab/BELLE

Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn...

34
Emerging
2418 messiaen/full-lattice-search

Full Text Search Over Probabilistic Lattices with Elasticsearch!

34
Emerging
2419 techiaith/docker-marytts

Lleisiau synthetig cadwynedig Cymraeg gyda MaryTTS a Docker // Welsh...

34
Emerging
2420 ReneeYe/XSTNet

This is an implementation of paper "End-to-end Speech Translation via...

34
Emerging
2421 akashmjn/cs224n-gpu-that-talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

34
Emerging
2422 decasteljau/waapi-text-to-speech

Wwise text-to-speech integration using external editors.

34
Emerging
2423 RodneyKoolman/Azure-Speech-TextToSpeech

Written in Python using the Azure Speech SDK. App.py provides an easy way to...

34
Emerging
2424 Blackwood416/AstraTTS

基于 ONNX Runtime 的跨平台高性能 TTS 合成方案,支持流式输出与低延迟播放,支持自定义音色与中英混合生成。

34
Emerging
2425 Asaayu/integrated-voice-control-system

Integrated AI Voice Control System allows players to give commands to AI...

34
Emerging
2426 GlobalTechInfo/gspeak

Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies.

34
Emerging
2427 lpalbou/VoiceLLM

A modular Python library for voice interactions with AI systems, featuring...

34
Emerging
2428 luongnv89/voice-cast

Your words, any voice. Voice cloning and text-to-speech with multiple TTS...

34
Emerging
2429 ArdaGnsrn/elevenlabs-js

This is an Open Source NodeJS package for ElevenLabs Text to Speech API.

34
Emerging
2430 phanxuanphucnd/wav2asr

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

34
Emerging
2431 kssteven418/Q-ASR

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

34
Emerging
2432 khuangaf/ITRI-speech-recognition-dataset-generation

Automatic Speech Recognition Dataset Generation

34
Emerging
2433 nvmoyar/aind2-speech-recognition

Some approaches based on deep learning to build the acoustic model for an...

34
Emerging
2434 botbahlul/Live-Subtitle-V2

ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free...

34
Emerging
2435 ShivamRajSharma/Transformer-Text-To-Speech

Pytorch implementation of Transformer-TTS for converting text into speech.

34
Emerging
2436 PRITHIVSAKTHIUR/Vision-to-VibeVoice-en

A Gradio-based demo for end-to-end vision-to-speech inference: Extract text...

34
Emerging
2437 AndreDalwin/Whisper2Summarize

Whisper2Summarize is an application that uses Whisper for audio processing...

34
Emerging
2438 heezes/Hand-gesture-to-speech

This project aims at providing speech to the mute people.

34
Emerging
2439 OpenVoiceOS/status

Open Voice OS Server Status Page

34
Emerging
2440 Fatma-Chaouech/audioverse

Breathe Life Into Your Books! 📚🌱

34
Emerging
2441 C0NZZ/better-teletask

Browser extension that adds useful features like subtitles to HPI Tele-Task.

34
Emerging
2442 FNBUBBLES420-ORG/Speech-to-Text-Application

🎙️ Welcome to the Speech to Text Application! 📝 This tool converts your...

34
Emerging
2443 kaiidams/Voice100AndroidApp

Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and...

34
Emerging
2444 speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on...

34
Emerging
2445 cjhoward/cedict-tts

TTS audio files for the CC-CEDICT Chinese-English dictionary

34
Emerging
2446 MichaelGrafnetter/defender-asr-admx

Administrative Template (ADMX) for Microsoft Defender Attack Surface Reduction (ASR)

34
Emerging
2447 LucaLuke13/TalkyBotty

Simply forward a video or voice message in any language to the bot, and it...

34
Emerging
2448 snowy-0wl/piper-mode

A vibe-coded text-to-speech for Emacs using the Piper TTS engine. Features...

34
Emerging
2449 mmpneo/simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

34
Emerging
2450 lepisma/emacs-speech-input

Set of packages for speech and voice inputs in Emacs

34
Emerging
2451 khakers/go-subgen

Automatically generate subtitles for your media using whisper.cpp via...

34
Emerging
2452 ThetaOne-AI/HiKE

Hierarchical Korean-English Code-Switching Speech Recognition Benchmark...

34
Emerging
2453 kristofferv98/whisper_turboapi

An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo...

34
Emerging
2454 naskopw/read_aloud

A cross-platform text-to-speech library

34
Emerging
2455 pevers/parkiet

Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)

34
Emerging
2456 ivan770/ems

EMS (External Media Server)

34
Emerging
2457 hacktronaut/azure-avatar-demo

Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.

34
Emerging
2458 jeantimex/F5-TTS-Server

F5-TTS server APIs for voice cloning and text-to-speech generation with...

34
Emerging
2459 m-nathani/speech_to_text

how to use the Google Cloud Speech API to transcribe audio/video files.

34
Emerging
2460 yufan-aslp/AliMeeting

The project is associated with the recently-launched ICASSP 2022...

34
Emerging
2461 A-Jacobson/tacotron2

pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf

34
Emerging
2462 Aman22sharma/Python-AI-Virtual-Assistant

This is python AI Virtual Assistant.

34
Emerging
2463 ACT900/faster-whisper-railway

Deploy Faster Whisper on Railway — Speech-to-Text & Text-to-Speech API with 52 voices

34
Emerging
2464 yuyq96/pyshengyun

A Python converter for Chinese Pinyin and Shengyun (initials and finals)

34
Emerging
2465 DragonDiffusionbyBoyo/Boyonodes

A set of Comfyui nodes

34
Emerging
2466 go-restream/zipenhancer-rs

🚀 High-Performance Real-Time Audio Noise Reduction Library - Rust...

34
Emerging
2467 jorcelinojunior/whisper-vtt2srt

A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,...

34
Emerging
2468 jianchang512/parakeet-api

一个基于 NVIDIA Parakeet-tdt-0.6b 模型的本地语音转录服务。它提供了一个与 OpenAI API 兼容的接口和一个简洁的 Web 用户界面

34
Emerging
2469 cdyangbo/end2endASR

implement end-to-end asr algorithm with tensorflow

34
Emerging
2470 iotjin/JhPrivacyAuthTool

隐私权限判断 - 封装了几种常用的隐私权限判断(定位服务,通讯录, 日历,提醒事项, 照片, 蓝牙共享,麦克风, 相机)和通知的注册和判断。定位服务,蓝牙共享是单独调用的

34
Emerging
2471 De-Technocrats/simple-text-to-speech-javascript

Simple text to speech with javascript.

34
Emerging
2472 msjsc001/Anki-TTS-Edge

A modern text-to-speech tool powered by Microsoft Edge TTS. Creates Anki...

34
Emerging
2473 vhanagwal/speech-recognition

A speech-to-text app using AVAudioEngine.

34
Emerging
2474 rishikksh20/VQ-TTS-pytorch

Unofficial Pytorch implementation of paper VQTTS: High-Fidelity...

34
Emerging
2475 deepkyu/ml-talking-face

Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)

34
Emerging
2476 Pzc-Neo/vue-web-reader

城墨网页小说朗读 ( Novel read aloud on web. )

34
Emerging
2477 blakkd/faster-whisper-hotkey

Effortless Push-to-Talk Transcription, Anywhere.

34
Emerging
2478 keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...

34
Emerging
2479 EvilFreelancer/docker-fish-speech-server

OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.

34
Emerging
2480 keonlee9420/Stepwise_Monotonic_Multihead_Attention

PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to...

34
Emerging
2481 mmahdibarghi/finglish-dataset

Persian to Finglish dataset with all the sentences voice for TTS dataset...

34
Emerging
2482 aditya-joglekar/FS02_Scoring_Toolkit

Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks

34
Emerging
2483 brailcom/festival-freebsoft-utils

Festival extensions and utilities, focused on interaction with Speech Dispatcher

34
Emerging
2484 cyberboysumanjay/VoiceAssistant

Python Project

34
Emerging
2485 GeorgiosIoannouCoder/vera

Voice Emotion Recognition of Audio (VERA) is an open-source project created...

34
Emerging
2486 Arbazkhan4712/Speech-To-Text

A program that can convert Speech into Text using python

34
Emerging
2487 gowtham4545/Project

Sign2Sound is dedicated to revolutionizing communication for non-verbal...

34
Emerging
2488 soheil-mp/Speech-Recognition

End-to-End Speech Recognition using Neural Networks.

34
Emerging
2489 keenresearch/keenasr-swift-poc

Proof-of-concept app that showcases use of KeenASR SDK in a Swift app. WE...

34
Emerging
2490 buddyeorl/deep-talk

Deep-speech react app to test trained models,to visualize the speech to text...

34
Emerging
2491 KilianB/GoogleTranslatorTTS

Converts a string of text to mp3 files utilizing the google translator text...

34
Emerging
2492 stgloorious/stm32-speech-recognition

Speech Recognition using STM32 and Machine Learning

34
Emerging
2493 slp-rl/HebTTS

The official implementation of "A Language Modeling Approach to...

34
Emerging
2494 rishiskhare/parrot

A free, offline, private AI text-to-speech desktop app built on Rust 🦜

34
Emerging
2495 tiansztiansz/voice-assistant

重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。

34
Emerging
2496 SynHub/syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

34
Emerging
2497 c99koder/AudioClassifier-MQTT

Use the yamnet TensorFlow model to classify live audio from a microphone and...

34
Emerging
2498 grammatek/simaromur

Icelandic TTS (text-to-speech) service for Android

34
Emerging
2499 tasmirz/EyeWear

Eyewear with OCR and live WebRTC based calling for the visually impaired....

34
Emerging
2500 veralvx/xtts-finetune

XTTS fine-tuning via CLI

34
Emerging
« Prev 1 2 3 23 24 25 26 27 80 81 82 Next »