All Voice AI Tools

8,165 tools ranked by quality score · Page 32 of 82

Showing 3101–3200 of 8,165
# Tool Score Tier
3101 AlasdairKing/Calendar-VB6

Simple, accessible Calendar for screenreader and blind users.

31
Emerging
3102 tigjaw/remyme

ReMyMe - a basic "Read My Messages" Android application (old)

31
Emerging
3103 Infineon/i2s-microphone

A collection of documentation and examples for Infineon's I2S microphones.

31
Emerging
3104 The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning

Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English...

31
Emerging
3105 BenjaminPoncet/bobby-snips-tts

bobby-snips-tts is an implementation of snips-tts written in Node.js with...

31
Emerging
3106 Abhradipta/OCR-With-Read-Out-Loud-Using-Python

An Optical Character Recognition (OCR) System designed using Python to read...

31
Emerging
3107 taeefnajib/Vocazee

A voice cloning and text-to-speech application that can generate speech in any voice.

31
Emerging
3108 viig99/esolafast

Fast C++ implementation of ESOLA using KFRLib, can be used for online...

31
Emerging
3109 koesan/Evoars

A multi-model AI platform for comics, manga, and videos. It colorizes...

31
Emerging
3110 PiasRoY/Bangla-Spoken-Number-Recognition

recognizing spoken Bangla numbers using MFCCs and CNN.

31
Emerging
3111 suzumushi0/SoundObject_binary

SoundObject binary distribution.

31
Emerging
3112 palahsu/Greeting-PC

Greeting PC, made with simple Visual Basic Script. Run file it will executes...

31
Emerging
3113 dhdaines/soundswallower-demo

Simple demo of client-side speech recognition

31
Emerging
3114 TCL606/Speech-Number-Recognition

基于数字信号处理的语音数字识别器

31
Emerging
3115 baocin/hugging_face_example_STT_api

Demonstration of Hugging Face's (https://huggingface.co/) newly released...

31
Emerging
3116 vinbhaskara/Digit-Speech-Recognition

Using MFCC features on Speech Signals to classify Digits after matching...

31
Emerging
3117 idiap/TIDIGITSRecipe.jl

A Julia recipe for training an ASR system using the TIDIGITS database

31
Emerging
3118 marvinborner/CTC-LSTM

Spoken word recognition using CTC LSTMs for SWR2 Tübingen

31
Emerging
3119 vectominist/rspin

Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and...

31
Emerging
3120 SzLeaves/asr-model-ctc

ASR deep learning models (use BiGRU & WaveNet & CTC), use Tensorflow2...

31
Emerging
3121 loglux/FlexAudioPrint

FlexAudioPrint is a Python-based app for transcribing audio to text using...

31
Emerging
3122 SEPIA-Framework/sepia-web-audio

Create modular, cross-browser, web audio pipelines to record and process...

31
Emerging
3123 oeschsec/Sidekick---voice-controlled-keyboard-and-mouse

Voice controlled keyboard and mouse that is lightweight (minimal...

31
Emerging
3124 aeleraqi/gTTS---Arabic-text-to-multiple-languages

Converting Arabic text to speech in various languages with the versatile...

31
Emerging
3125 BobRandomNumber/ComfyUI-KyutaiTTS

A non real-time ComfyUI implementation of Kyutai TTS

31
Emerging
3126 papercast-dev/papercast

A Python pipeline tool and plugin ecosystem for processing technical...

31
Emerging
3127 deepgram/deepgram-js-captions

This package is the JavaScript implementation of Deepgram's WebVTT and SRT...

31
Emerging
3128 khanld/Wav2vec2-Pretraining

Wav2vec 2.0 Self-Supervised Pretraining

31
Emerging
3129 heptacode/interactivekiosk

다양한 사용자를 위한 키오스크 개선 프로젝트 ✨

31
Emerging
3130 elie-atia/talk-to-chat-gpt

Enable to talk to ChatGPTusing voice-to-text (record and recognize the...

31
Emerging
3131 X-LANCE/VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient...

31
Emerging
3132 tsengia/SphinxTrainHelper

A Bash script designed to make training sphinx4 and pocketsphinx acoustic...

31
Emerging
3133 Phe0nix/Speech-Email-Sender

Send email with speech recognition means just start talking and send emails....

31
Emerging
3134 Philipinho/ThreadVoice

Source code for https://twitter.com/threadvoice

31
Emerging
3135 yeyupiaoling/VITS-PaddlePaddle

本项目是基于PaddlePaddle的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复...

31
Emerging
3136 bookbot-hive/OpenBible-TTS

Building Text-to-Speech Systems using OpenBible!

31
Emerging
3137 falabrasil/cmusphinx-br

Scripts e recursos para ASR em Português Brasileiro

31
Emerging
3138 arcb01/g-narrator

A screen reading accessibility tool

31
Emerging
3139 kofemann/streetguide

An Android app to discover where you drive

31
Emerging
3140 Ryan5453/lyricscribe

Automated Lyric Transcription Research

31
Emerging
3141 pragmatrix/context-switch

Audio Streaming for FreeSWITCH with backends powered by Azure, OpenAI, and Aristech

31
Emerging
3142 ASR-project/Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....

31
Emerging
3143 savg92/voice-cloning

This project provides a comprehensive testing and comparison platform for...

31
Emerging
3144 repodiac/espeak-ng_german_loan_words

Brief tutorial with code where you can automatically create a dictionary...

31
Emerging
3145 tongplw/ASR-web-based-restaurant

🍔 Foody, a smart voice-assistant web-based restaurant using Kaldi, React, and WebRTC

31
Emerging
3146 vishalnagda1/text-to-speech

Python program to convert text to speech.

31
Emerging
3147 KernelOverseer/caLLMe

Realtime voice conversation with llm models using an asynchronous Voice to...

31
Emerging
3148 USSLab/DolphinAttack

Inaudible Voice Commands

31
Emerging
3149 Arbazkhan4712/Text-To-Speech

A program that can convert Text into Speech using python

31
Emerging
3150 auroraapi/aurora-python

Aurora SDK for Python

31
Emerging
3151 belambert/asr-scripts

Lots of miscellaneous scripts to work with Sphinx ASR files and other...

31
Emerging
3152 mehdichaouch/nabstory

Let your Nabaztag 🐰 read you a story 📖

31
Emerging
3153 hanifabd/voice-activity-detection-vad-realtime

Real-time Voice Activity Detection (VAD) with some example use case like...

31
Emerging
3154 visu123s/MimicKit

🤖 Learn motion imitation with MimicKit, a framework offering advanced...

31
Emerging
3155 Inviro/Illud

Illud is a smart text analyzer written in pure Java that displays different...

31
Emerging
3156 speechly/api

Speechly public API definitions and generated code

31
Emerging
3157 lpkpaco/Bocchi-The-Rock-GPT-SoVITS-Models

Contains voice models based on the GPT-SoVITS architecture of different...

31
Emerging
3158 ggh-png/EMOTIBOT

emotion robot using gpt model3.5 EMOTIBOT

31
Emerging
3159 nikkiw/realtime_translator

Python tool for real-time voice recognition and multilingual translation

31
Emerging
3160 SEPIA-Framework/sepia-docs

Documentation and Wiki for SEPIA. Please post your questions and bug-reports...

31
Emerging
3161 m1n1v1rus/futuristic-calculator

A futuristic, AI-powered advanced calculator with voice control, graph...

31
Emerging
3162 wamich/personal-vocabulary

「个人词库」是一款浏览器插件。 用于英文阅读时,不断记住生词,构建个人词库。

31
Emerging
3163 in03/squawk

Automatic subtitles for DaVinci Resolve with OpenAI Whisper

31
Emerging
3164 indri-voice/audiotoken

Audio tokenization, in the fastest way possible!

31
Emerging
3165 charlescao460/SpeechRecognitionByGoogleCloud

A .NET program that captures local audio and recognizes speech

31
Emerging
3166 milosgajdos/go-playht

PlayHT API client Go module

31
Emerging
3167 binglel/asr_baidu_web_server

asr web server based on flask

31
Emerging
3168 aks-devs/mod_whisper_asr

Freeswitch ASR module

31
Emerging
3169 theawless/sr-lib

Automatic Speech Recognition library for my BTech Project.

31
Emerging
3170 kouyt5/lightning-asr

基于pytorch-lighting框架搭建的端到端语音识别模型,目前还在实验中,性能在不断优化

31
Emerging
3171 AppleHolic/FastSpeech2

Refactored version of https://github.com/ming024/FastSpeech2

31
Emerging
3172 denizariyan/Real-Time-Auto-Transcriber

Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to...

31
Emerging
3173 naturalDesign/fusion-remote

Chatbot for Autodesk Fusion 360 with speech recognition

31
Emerging
3174 cjh0613/vosk-android-demo-chinese

中文 vosk-android-demo

31
Emerging
3175 MatteoM95/Smart-Home-Vigilance-System

An indoor video surveillance system capable of recognizing the presence of a...

31
Emerging
3176 kehlawicode/audiblez

🎧 Create high-quality audiobooks from e-books with ease using Audiblez,...

31
Emerging
3177 guibranco/talabat-hackathon-2022

🏃 💡 Talabat Hackathon 2022 API project

31
Emerging
3178 egorsmkv/radtts-uk

🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model

31
Emerging
3179 zhurlik/smart-home

A multi-project that contains UDP server, MQTT broker and a few sub-projects...

31
Emerging
3180 1epalpyrgou/smartbell-server

Ένα έξυπνο κουδούνι για το σχολείο μας - 1ο Επαγγελματικό Λύκειο Πύργου

31
Emerging
3181 nisiddharth/TextToSpeech

A Simple Java based Text to Speech converter made using NetBeans 8.2

31
Emerging
3182 burrmill/sph2pipe

sph2pipe v2.5. We do not maintain this, and/or accept pull requests; just...

31
Emerging
3183 MaikeMota/comando-voz

Utilizando HTML5 SpeechRecognizer para Reconhecimento de Comandos.

31
Emerging
3184 Zuhef/Text-to-Speech

USING HTML , CSS AND JAVASCRIPT I HAVE BUILD A SIMPLE TEXT TO SPEECH CONVERTER.

31
Emerging
3185 pkprajapati7402/Darvin-Chatbot

Darvin is a Python-based voice-activated chatbot that interacts with users...

31
Emerging
3186 GitPolyakoff/voice-assistant

Voice Assistant — приложение на C# для управления компьютером голосом....

31
Emerging
3187 wukan1986/KWebSpeaker

保持原排版可选段的网页朗读神器

31
Emerging
3188 Flux9665/ArticulatoryTextFrontend

This is a text-processing frontend that converts graphemes to phonemes and...

31
Emerging
3189 Ex094/VoiceCom

A Simple Voice Command Application powered by Java and Sphinx4 Speech...

31
Emerging
3190 ognistik/alfred-superwhisper

Use Alfred to Control Superwhisper - AI Powered Voice to Text

31
Emerging
3191 speechnotes/speechnotes-speech-recognizer

The speech recognition engine behind Speechnotes, based on the Webspeech-API

31
Emerging
3192 backpropper/DNN-Activation-Brain

Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016)

31
Emerging
3193 Alan-6666/chinese_asr

a demo of chinese asr

31
Emerging
3194 mayank-kumar-giri/Speech-Recognizer-cum-Voice-Typing-Editor

Speech Recognizer cum text editor that facilitates voice typing using Google...

31
Emerging
3195 CodingWithEnjoy/Speech-To-Text-Python

متن به صدا | Text To Speech 😊🤩

31
Emerging
3196 HawksLab/narratify

e-book to audiobook convertor

31
Emerging
3197 PalabraAI/palabra-ai-java

Java SDK for Palabra AI's real-time speech-to-speech translation API. Break...

30
Emerging
3198 grayhatdevelopers/deepdub

🗣️ Videos for everyone. Implementation of "Automated Dubbing and Facial...

30
Emerging
3199 mallorbc/brillibot-client

Easy to use voice commands API python client. Create your own commands in...

30
Emerging
3200 VisionBrain/Neural_Voice_Cloning

Open Source Implementation of Neural Voice Cloning with Few Audio Samples...

30
Emerging
« Prev 1 2 3 30 31 32 33 34 80 81 82 Next »