All Voice AI Tools

8,165 tools ranked by quality score · Page 26 of 82

Showing 2501–2600 of 8,165
# Tool Score Tier
2501 sandy1990418/ChineseTaiwaneseWhisper

This repository focuses on leveraging OpenAI's Whisper model for speech...

34
Emerging
2502 nhaouari/local11labs

Local11Labs allows generating high-quality text-to-speech and podcast...

34
Emerging
2503 LucaDe/text_to_speech_api

A simple wrapper for Google's Text-To-Spech API for Dart and Flutter projects.

34
Emerging
2504 Gaurav890/vocal-stack

vocal-stack is a high-performance utility library for developers building...

34
Emerging
2505 alkhimey/esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

34
Emerging
2506 jiwidi/DeepSpeech-pytorch

Pytorch implementation for DeepSpeech 2.0

34
Emerging
2507 jianchang512/gemini-speech2srt

使用 Gemini AI 转写音视频为 SRT 字幕

34
Emerging
2508 ale-grassi/discord-elevenlabs-tts-bot

A simple Discord TTS bot that uses the Eleven Labs API

34
Emerging
2509 medokin/soundpad-text-to-speech

Text-To-Speech for Soundpad

34
Emerging
2510 hwk06023/SONATA

SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced...

34
Emerging
2511 simalexan/speechy

Voice command tool for an easy web speech recognition for your web...

34
Emerging
2512 EuleMitKeule/speaker-recognition

Speaker recognition service for Home Assistant using voice embeddings. Train...

34
Emerging
2513 sskorol/respeaker-websockets

This project reveals full Respeaker Core V2 potential by using bundled...

34
Emerging
2514 JensBorrisholt/GoogleSpeak

This repository demonstrates how to Use Google for implementing Text to...

34
Emerging
2515 r1di/neutts-fastapi

OpenAI-compatible Text-to-Speech API server powered by NeuTTS. Drop-in...

34
Emerging
2516 makeabilitylab/ProtoSound

ProtoSound is a deployable interactive system for personalizing a sound...

34
Emerging
2517 ZhuoZhuoCrayon/AcousticKeyBoard-Web

❓声学键盘|脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。

34
Emerging
2518 Pallas1303/FestPB

FestPB é um projeto com objetivo de oferecer suporte ao Português Brasileiro...

34
Emerging
2519 Speech-to-text-Kafka-Airflow-Spark/StoTkas

Data engineering pipeline that allows recording millions of Amharic and...

34
Emerging
2520 Supremolink81/TTSCeleb

A TTS app where you can clone the voices of any person you wish.

34
Emerging
2521 felipefacundes/guglinatts

Guglina TTS é um sintetizador de voz, em português do Brasil, que lê telas...

34
Emerging
2522 teyang-lau/YOListenO

Building an AI-powered tool for auto converting audio from lectures/meetings...

34
Emerging
2523 laszukdawid/cracker

Usable GUI for text-to-speech services

34
Emerging
2524 freakingrocky/EmoCh

Emotion Analysis from Speech AI in Python using mfcc, mel, chroma

34
Emerging
2525 Jen-Hung-Ho/ros2_jetbot_voice

Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson...

34
Emerging
2526 ThisModernDay/f5-tts

F5-TTS is a web application that allows users to clone voices and generate...

34
Emerging
2527 nay-cat/LiveKit-PiperTTS-Plugin

Quick integration of Piper TTS (super lightweight, high-quality model) with LiveKit

34
Emerging
2528 shaheennabi/Multi-lingual-AI-Assistant-with-gTTS-and-Gemini-Pro

An end-to-end AI assistant using gTTS for multi-lingual text-to-speech and...

34
Emerging
2529 adrxLV/J.A.R.V.I.S.AI

A AI-powered voice assistant based on JARVIS using ollama.

34
Emerging
2530 sudonitin/Audio-book-generator

Convert your ebooks to audiobooks. 📖->🎧

34
Emerging
2531 TharanaBope/whisper-v3-diarization

Production-ready audio transcription & speaker diarization CLI & GUI using...

34
Emerging
2532 ctkqiang/ZhuYing

竹影是一款创新的视频语音转录与翻译工具,专注于提供高质量的视频音频转文字服务和多语言翻译功能。本项目采用先进的人工智能技术,为用户提供便捷的视频内容处理解决方案。

34
Emerging
2533 Dark2C/Viral-Faceless-Shorts-Generator

Automatically generate faceless YouTube Shorts from trending topics using AI...

34
Emerging
2534 ARAI-Telegram/teledash-backend-processing

Optional AI-powered features of Teledash, an open-source software for...

34
Emerging
2535 boochow/TFLite_Micro_MicroSpeech_M5Stack

M5Stack (ESP32) port of TensorFlow Lite for Microcontrollers demo "Micro Speech"

34
Emerging
2536 kaituoxu/Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS)...

34
Emerging
2537 dfop02/auto-sub

Automatically subtitle a video from almost any language to your native...

34
Emerging
2538 rezkyatinnov/capetangjs

A JavaScript library for text to speech vice versa using Web Speech API

34
Emerging
2539 DePasqualeOrg/swift-tiktoken

A pure Swift implementation of OpenAI's tiktoken tokenizer

34
Emerging
2540 twangodev/speak-mintlify

Automatically generate voice narration for your Mintlify documentation.

34
Emerging
2541 upskyy/Paper-Review

Paper Review about Speech Recognition · NLP

34
Emerging
2542 vibhasdutta/PC-ASSISTANT

A voice-operated PC assistant for Windows , enabling hands-free control for...

34
Emerging
2543 tuanio/nextformer

PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For...

34
Emerging
2544 GENIVI/VCIVING-SpeechRecognition

GENIVI GSoC 2018 and 2019

34
Emerging
2545 GeoHaberC/Story-to-Video

Create a Movie animation plus Audio plus Subtitle from a text file

34
Emerging
2546 spandan114/AI-realtime-voice-agent

A Python-based real-time voice-to-voice conversation system that lets you...

34
Emerging
2547 Llamacha/asr-htk-quechua

ASR for quechua language is an open source which can run in real time using...

33
Emerging
2548 anooptoffy/DLJeju2018CodeRepoASR

Details on my work on using GANs for speech synthesis for improving Speech...

33
Emerging
2549 eazhary/dctts2

Deep Convolution Text to Speech

33
Emerging
2550 nowickam/facial-animation

Audio-driven facial animation generator with BiLSTM used for transcribing...

33
Emerging
2551 lucasnewman/e2-tts-mlx

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive...

33
Emerging
2552 Bassamejlaoui/Voice-Cloning-Translation-Transcription

Voice cloning, a revolutionary technology, allows us to replicate and...

33
Emerging
2553 zoebchhatriwala/CamWord

CamWord Is an android application that uses character recognition and voice...

33
Emerging
2554 victor369basu/End2EndAutomaticSpeechRecognition

In this repository, I have developed an end to end Automatic speech...

33
Emerging
2555 aishoot/Multi-Hotword_Spotting

Won't it be cool to build a speech assistant like Alexa or Siri yourself...

33
Emerging
2556 pnkvalavala/digitaltwin

Using a single image and just 10 seconds of sample audio, our project...

33
Emerging
2557 prathamsolanki/gender-recognition-by-voice

Identify a voice as male or female.

33
Emerging
2558 tabahi/WebSpeechAnalyzer

JS speech analyzer for fast speech analysis and labeling

33
Emerging
2559 CypherousSkies/reading-for-listeners

A deep-learning powered accessibility application which turns pdfs into...

33
Emerging
2560 AASHISHAG/DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

33
Emerging
2561 bhattbhavesh91/speech-python-demos

pyttsx3 is a text-to-speech conversion library in Python. Its a Python-based...

33
Emerging
2562 Issac-Moses/Beacon

Beacon – A lightweight voice-controlled AI assistant using Whisper.cpp. ...

33
Emerging
2563 Enforcer03/voice-cloning

Voice cloning with tortoise-tts

33
Emerging
2564 HerambVD/spoken2written

A source of python package which converts language styles in speech to its...

33
Emerging
2565 MrAliHasan/Sophia-AI-Assistant

Sophia AI Assistant is a Python-based desktop AI that performs a variety of...

33
Emerging
2566 Ishan7390/Jarvis_AI

This is my attempt at building a not so much of an AI, Jarvis

33
Emerging
2567 Zuellni/Orpheus-GGUF

Orpheus-TTS inference.

33
Emerging
2568 thewh1teagle/vad-rs

Speech detection using silero vad in Rust

33
Emerging
2569 The-Data-Dilemma/MediBeng-Whisper-Tiny

MediBeng Whisper Tiny improves doctor-patient transcription by training the...

33
Emerging
2570 RF5/transfusion-asr

Transcribing Speech with Multinomial Diffusion, training code and models.

33
Emerging
2571 stellarloop/bitbat.ai

My father, a journalist, used to painstakingly transcribe interviews from a...

33
Emerging
2572 yakhyo/kokoro-onnx

Kokoro-82m TTS ONNX Runtime inference | Gradio Demo | HuggingFace Demo | Docker

33
Emerging
2573 rhulha/Speech2Speech

A web application that converts speech to speech 100% private

33
Emerging
2574 mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained...

33
Emerging
2575 orhun/dialogflowbot

Google's Dialogflow implementation on Android with additional features.

33
Emerging
2576 gogyzzz/beamformit_matlab

A MATLAB implementation of CHiME4 baseline Beamformit

33
Emerging
2577 neosapience/n8n-nodes-typecast

Integrate Typecast AI TTS into your n8n workflows with this community node.

33
Emerging
2578 agentvoiceresponse/avr-tts-deepgram

This project demonstrates the integration of Agent Voice Response with...

33
Emerging
2579 aydinnyunus/LinuxVoiceAssistant

Linux Voice Assistant for to Make Your Work Easier

33
Emerging
2580 Serkali-sudo/auto-subtitle-generator

An Android app that automatically generates subtitles for videos locally,...

33
Emerging
2581 KathyReid/opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in...

33
Emerging
2582 pschatzmann/arduino-simple-tts

A simple TTS solution based on pre-recorded audio

33
Emerging
2583 Madhur215/Chatbot-cum-voice-Assistant

An AI chatbot with features like conversation through voice, fetching events...

33
Emerging
2584 va-kiet/Voice-Assistant-wake-word-detection-model

Build a Wake Word Detection model for Voice Assistant using PyTorch

33
Emerging
2585 daanzu/wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct...

33
Emerging
2586 codename0og/codename-rvc-fork-3

Codename's rvc fork version 3, based on Applio.

33
Emerging
2587 theoomoregbee/paysense-backend

This is our paysense backend , a sails app

33
Emerging
2588 lucadellalib/audiocodecs

A collections of audio codecs with a standardized API

33
Emerging
2589 mtokar3v/ReversoAPI-NET

🌐 An API Client for the reverso.net, written in C#/.NET (Based on Site API...

33
Emerging
2590 ignabelitzky/easy-subber

A Python-based tool that that takes video files and generates .srt subtitle...

33
Emerging
2591 hanxiao/mls

MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon

33
Emerging
2592 gunarakulangunaretnam/voice-typer

A voice recognition based typing tool for English, Tamil, Sinhala languages.

33
Emerging
2593 shawnrushefsky/talky-talky

MCP server for Audio Generation and Analysis with a Variety of Open Models.

33
Emerging
2594 revsic/tf-glow-tts

Tensorflow implementation of Glow-TTS

33
Emerging
2595 echo8795/react-native-android-text-to-speech

React Native Text-To-Speech wrapper module for android

33
Emerging
2596 Animator617/jasper

Jasper is a AI asistence programm based on deeplearning

33
Emerging
2597 m0wer/aibot

Telegram bot powered by Ollama, capable of handling text and voice messages,...

33
Emerging
2598 fquirin/speech-recognition-experiments

Experiments to test different speech recognition systems for SEPIA Framework

33
Emerging
2599 Ahmed5attab/Qaf-QuranSearchAndMemorization

iOS Islamic application for the holy Quran, helps the Muslims to have the...

33
Emerging
2600 rt400/ReversoTTS-HA

ReversoTTS component for HomeAssistant

33
Emerging
« Prev 1 2 3 24 25 26 27 28 80 81 82 Next »