All Voice AI Tools

8,165 tools ranked by quality score · Page 28 of 82

Showing 2701–2800 of 8,165
# Tool Score Tier
2701 JhaAyush01/Multimodal-AI-Assistant

Multimodal AI Assistant with Google Gemini-1.5-pro, gTTS, PIL, and...

33
Emerging
2702 jindongwang/EasyEspnet

Making Espnet easier to use

33
Emerging
2703 aks-devs/mod_piper_tts

Freeswitch Text-to-Speech module

33
Emerging
2704 mvanzulli/MeetingAssistant.py

A local deployable version of an AI meeting assitant

33
Emerging
2705 b7s/whisper-php

State-of-the-art speech recognition to your PHP/Laravel applications

33
Emerging
2706 KickerMix/Discord-Local-LLM-VoiceChat-Bot

Saya Voice Assistant for Discord AI voice bot: listens, detects keywords,...

33
Emerging
2707 hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and...

33
Emerging
2708 TheProfessorsLab/Oracle-VocalAI-Interface-DISCONTINUED

A custom version of J.A.R.V.I.S. made to be my personal digital assistant...

33
Emerging
2709 MatheusKProt/SpeechToText

Este bot para telegram tem a função principal de transformar os seus áudios...

33
Emerging
2710 MysteryPancake/Discord-Lyrebird

[DEPRECATED] Text to speech Discord bot using the Lyrebird API

33
Emerging
2711 vroomai/vst

🎹 Generate sounds from words. Directly in your DAW.

33
Emerging
2712 TBETool/ibm-watson-tts-php

IBM Watson Text to Speech PHP Library to convert written text into...

33
Emerging
2713 kapi2800/qwen3-tts-mac

Optimized implementation of Qwen3-TTS for Apple Silicon (M1-M4)

33
Emerging
2714 TSG405/Automated-Email--BOT

This Bot can send emails to anyone, any number of times from a USER's...

33
Emerging
2715 t0mer/ttsbot

ttsbot is a Telepot powerd, easy to use Telegram bot allowing you to convert...

33
Emerging
2716 fvarrui/PowerPointToVideo

:clapper: PowerPoint to MP4 converter with synthesized interlocutor voice.

33
Emerging
2717 teamtee/LLM-ASR-Error-Correction

This is a framework for using large language models to improve ASR...

33
Emerging
2718 EvilFreelancer/docker-canary-serve

Canary-Serve is a FastAPI server with Docker support that provides an HTTP...

33
Emerging
2719 madzadev/voice-cue

📣 Find sentiments, tags, entities, and actions in your voice recordings instantly

33
Emerging
2720 Workplace-Futurists/DiScribe

An automated meeting transcriber which autonomously connects to scheduled...

33
Emerging
2721 surfaceyu/edge-tts-go

Use Microsoft Edge's online text-to-speech service from golang WITHOUT...

33
Emerging
2722 hannabdul/ldasr

Official repo for the paper "LDASR: An Experimental Study on Layer Drop...

33
Emerging
2723 bauyrzhanospan/VirtualAssistant

Virtual Assistant project done in the Middlesex University with Dr. Nawaz...

33
Emerging
2724 Rubiksman78/RenAI-Chat

VN Like Interface for Chatbots

33
Emerging
2725 koudounasalkis/Audio-Speech-Tutorial

This repository contains a short introduction on the topic of audio and...

33
Emerging
2726 egorsmkv/asr-corpus-creator

This app is intended to automatically create a corpus for ASR systems using...

33
Emerging
2727 DannyBen/voicemaker

Create Text to Speech files with the Voicemaker API from Ruby or the command line

33
Emerging
2728 Acelogic/Retrieval-based-Voice-Conversion-MLX

A pure MLX implementation of RVC for Apple Silicon, delivering 8.71x faster...

33
Emerging
2729 AI-TOOLKIT/VoiceData

Automatic Speech Recognition (ASR) Data Generator Toolkit

33
Emerging
2730 Tech-Cravers/Gesture-Speech

To develop an application which could be used by especially abled person to...

33
Emerging
2731 Yashkapure06/TextToSpeech-ChromeExtension

Text To Speech - Chrome Extension

33
Emerging
2732 arnobt78/In-Browser-ML-Speech-Transcription-Translation--NextJS-Frontend

An open-source, educational app for speech-to-text & text translation that...

33
Emerging
2733 Lev-etd/Multimodal-emotion-recognition

Audio-Visual Group Emotion Recognition in the wild using cross-modal attention

33
Emerging
2734 Ordyns/TextToSpeech-TikTokAPI

Small program that uses the TikTok API to convert text to speech

33
Emerging
2735 rn0x/TelegramWhisperer

بوت تيليجرام يعمل على تحويل الصوت إلى نص باستخدام نموذج Whisper، مع تحسينات...

33
Emerging
2736 lord-lethris/ComfyUI-lethris-dia2

ComfyUI custom nodes for the Dia2 TTS model — generate speech, timestamps,...

33
Emerging
2737 swarms/mozilla-common-voice

Swarms supports the Common Voice Project from Mozilla! This repo contains...

33
Emerging
2738 PranavMishra17/VoicePersona-Dataset

A comprehensive voice persona dataset for character consistency in voice...

33
Emerging
2739 amritsinghcse/Say-Hi

This Android app pronounces a word in different languages using TTS and...

33
Emerging
2740 zozonteq/yomiage-bot

RVCをサポートしたテキスト読み上げDiscordBot

33
Emerging
2741 speechly/slu-client

Interact with Speechly SLU API from the command line

33
Emerging
2742 ShawnPi233/SynParaSpeech

Official Repository of Paper: "SynParaSpeech: Automated Synthesis of...

33
Emerging
2743 Gust4voSales/Marvin-VirtualAssistent

A dinamic virtual assistent made with Python, you can easily add more voice...

33
Emerging
2744 gheyret/uyghur-asr-ctc

Speech Recognition for Uyghur using deep learning

33
Emerging
2745 tsukumijima/TarakoTalk

Cross-platform CLI TTS Tools for Hiroyuki's Voice

33
Emerging
2746 ALERTua/styletts2-ukrainian-openai-tts-api

OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline

33
Emerging
2747 habitual69/speakify

Speakify is a web application that uses Edge TTS to convert text to speech...

33
Emerging
2748 Foxify52/RVG_tts

A retrieval based voice generation text to speech

33
Emerging
2749 RapDoodle/Web-Real-Time-Speech-Recognition-with-Azure

An example project that provides a web interface to real-time speech-to-text...

33
Emerging
2750 DeepSwissVoice/DeepVoice

A TensorFlow implementation of Baidu's DeepSpeech architecture

33
Emerging
2751 kaiidams/Voice100Sharp

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost...

33
Emerging
2752 Shyguy99/Whatsapp-bot

A simple WhatsApp Bot made using open-wa library with some additional features.

33
Emerging
2753 kromme/Teams-Notetaker

Let AI create the notes of your Teams Meeting

33
Emerging
2754 Unicorn-Commander/Unicorn-Orator

🦄 Text-to-Speech offloaded to iGPU and/or NPU

32
Emerging
2755 JesusGautamah/chatgpt_assistant

ChatGPT Virtual Assistant to Telegram and Discord with Voice Recognition

32
Emerging
2756 EX3exp/MiriVoice

Open-Free TTS Platform For All

32
Emerging
2757 arunk140/serve-piper-tts

Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices

32
Emerging
2758 Leapward-Koex/Namida-OCR

A purely browser based OCR tool designed recognizing, copying, and...

32
Emerging
2759 amanda-emerick/guess-the-animal

:monkey_face: Guess the Animal :frog: is a didactic game developed for...

32
Emerging
2760 speechpro/speechpro-cloud-asr-examples

Примеры использования Beta-версии gRPC API потокового распознавания речи в ЦРТ Облаке

32
Emerging
2761 Jor02/DectalkNET

Use the Dectalk voice sythesizer directly in .NET applications

32
Emerging
2762 Syduan0921/Muliti-Role_Cosyvoice2

🤖一键部署,利用TTS与LLM将长文本小说转化为多角色音/视频。

32
Emerging
2763 Cabbagito/Fine-Tuning-Whisper-on-LibriSpeech

The code for fine-tuning OpenAI's Whisper model on the LibriSpeech dataset.

32
Emerging
2764 codejs-kr/stt.js

Speech To Text library for browser 🎤

32
Emerging
2765 arthurfortes/speech2text_keras

This repository reports how to build a speech to text model to recognize...

32
Emerging
2766 shinchanat/Py

Pyreader is a python project created for reading pdf and text files by applying tts.

32
Emerging
2767 mush42/leanspeech

Unofficial pytorch implementation of LeanSpeech: The Microsoft Lightweight...

32
Emerging
2768 mathquis/node-picotts

SVOX PicoTTS binding for Node.js

32
Emerging
2769 biaji/kokoro-tts

基于Kokoro的Android TTS引擎

32
Emerging
2770 osteele/speech-provider

A unified TypeScript interface for browser speech synthesis and Eleven Labs...

32
Emerging
2771 zhongyuchen/speech-classification

CNN and VGG speech classification with interactive website for testing

32
Emerging
2772 ArthurBabkin/Parimate

A Telegram bot for validating audio and video content using CV models, SR...

32
Emerging
2773 Anwarvic/Arabic-Speech-Recognition

This repository contains my attempt to use two famous speech recognition...

32
Emerging
2774 Deimos-M/DL-Virtual-Assistant

It is a virtual assistant for visually impaired which include models like...

32
Emerging
2775 arthurxlw/cytonNss

Cyton Online Neural Sentence Segmentation for Simultaneous Interpretation

32
Emerging
2776 KiLJ4EdeN/Persian_Speech_To_Text

Simple Speech to text prototype using google api

32
Emerging
2777 manascb1344/zonos-api

Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration,...

32
Emerging
2778 egorsmkv/speech-recognition-uk

🇺🇦 Speech Recognition & Synthesis for Ukrainian

32
Emerging
2779 cyrta/broadcast-news-videos-dataset

Collection of broadcast news video clips

32
Emerging
2780 MahtaFetrat/VirgoolInformal-Speech-Dataset

A dataset of informal Persian audio and text chunks, along with a fully open...

32
Emerging
2781 debelopumento/phaser-test

A voice controlled runner game for Chrome

32
Emerging
2782 IbrokhimN/IJAI

IJAI is a modular AI assistant that supports text and voice interactions...

32
Emerging
2783 KennethanCeyer/awesome-audio-speech

Awesome list of Audio, Speech, and DSP(Digital signal processing)

32
Emerging
2784 ibelgin/Text-To-Speech-App

This App is Made Using React Native.

32
Emerging
2785 ShawnPi233/HQ-SVC

Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice...

32
Emerging
2786 consulfedor/VoiceGrab

🎙️ Voice-to-Text Bridge for AI & Any Application. Record voice → Get text →...

32
Emerging
2787 bougieL/tts-fluent

Text to speech

32
Emerging
2788 DavidBradbury/tts-assistant

TTS Assistant: A front-end app utilizing OpenAI's TTS API. Easily input text...

32
Emerging
2789 Baibhav-nag/SER-using-MLP-and-CNN

Speech emotion recognition using MLP and CNN on four benchmark datasets...

32
Emerging
2790 csikasote/bembaspeech-exps

Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English...

32
Emerging
2791 karkranikhil/voice-notes

Voice Note taking app using Svelte.

32
Emerging
2792 n0an/VivaDicta

Voice Transcription, Reimagined

32
Emerging
2793 korniichuk/google-speech

QuickStart. Google Cloud Speech-to-Text API with Python

32
Emerging
2794 helemanc/ambient-intelligence

Application for Disruptive Situations Detection in public transports through...

32
Emerging
2795 isthistechsupport/tts_for_discord

Using Discord.py and the Azure Cognitive Services Python SDK to bring Azure...

32
Emerging
2796 nilakshdas/ADAGIO

Adversarial Defense for Audio in a Gadget with Interactive Operations

32
Emerging
2797 daymade/chattts-seed-example

这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。

32
Emerging
2798 SharkyRawr/go-tiktok-tts

Go library for TikToks Text2Speech engine

32
Emerging
2799 othneildrew/open-whisperer

AI Video Translator and Subtitler

32
Emerging
2800 jarmitage/tts-cli

Simple CLI app for TTS

32
Emerging
« Prev 1 2 3 26 27 28 29 30 80 81 82 Next »