All Voice AI Tools
8,165 tools ranked by quality score · Page 36 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 3501 |
mobassir94/Multilingual-Speech-to-Speech-Translator
Multilingual Speech to Speech (STS) Translator is the First Ever Code-mixed... |
|
Experimental |
| 3502 |
amitybell/srcvox
Companion app for Valve's Source Engine based games with voice clips and... |
|
Experimental |
| 3503 |
Iiridayn/pico-tts
Android PicoTTS w/C calling application using submodule |
|
Experimental |
| 3504 |
aios-ai/voxta-providers
Adds functionality to Voxta by utilizing the SDK. |
|
Experimental |
| 3505 |
QinHsiu/BiCLTTS
Bi-level Cntrastive Learning for Text-to-Speech |
|
Experimental |
| 3506 |
nkpro2000sr/Word-to-AudioFile
this is to generate audio files from given words. useful for generating... |
|
Experimental |
| 3507 |
1999AZZAR/sakuraAI_Bloom
SakuraAI Bloom is an advanced chatbot with versatile capabilities, designed... |
|
Experimental |
| 3508 |
stutstev/pimp
Hackable music player optimized for use on screenless SBCs in cars. |
|
Experimental |
| 3509 |
huuhka/t-pain
T-Pain Bot is a telegram bot that helps the user track their daily pain... |
|
Experimental |
| 3510 |
furushchev/ros_gtts
Text-to-Speech service for ROS using python gTTS library for backend. |
|
Experimental |
| 3511 |
Ultan-Kearns/GestureBasedUIProject
Gesture Based UI Project 4th Year |
|
Experimental |
| 3512 |
echocatzh/GTCNN
Personalized AEC |
|
Experimental |
| 3513 |
KrishnaDN/BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder... |
|
Experimental |
| 3514 |
amityalwar/snoofus
Generative AI based speech analyzer |
|
Experimental |
| 3515 |
dannycrief/python-voice-assistant
Sarah Voice Assistant (SVA) is a Python voice assistant project on... |
|
Experimental |
| 3516 |
Unovamata/Neopets-Shop-And-Attic-Autobuyer-Cracked
An Auto Item Buyer and Pricer Bot for Neopets.com |
|
Experimental |
| 3517 |
GoodSpeech/good-speech-web-client
Practice your speech level in any language using speech recognition |
|
Experimental |
| 3518 |
VanModers/oostfraeisk_ooversetter
First AI-based translator for East Frisian Low Saxon... |
|
Experimental |
| 3519 |
mattt/supertone-swift
A Swift wrapper for the Supertone text-to-speech model |
|
Experimental |
| 3520 |
Appfairy/speech-tree
An events tree which lets you define a sequence of voice commands. |
|
Experimental |
| 3521 |
s-l-h/cat
A basic toolkit for speech analytics, using GPT and Whisper-X |
|
Experimental |
| 3522 |
robinhad/voice-recognition-ua
Training scripts for Speech-To-Text models for Ukrainian language |
|
Experimental |
| 3523 |
farjadilyas/MUKALMA
MUKALMA is a human-like chatbot which incorporates correct, relevant... |
|
Experimental |
| 3524 |
rajatgoyal715/Awaaz
🎙 An android project with some features like text to speech, speech to text... |
|
Experimental |
| 3525 |
vasilevp/sam
SAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM) |
|
Experimental |
| 3526 |
logisticinfotech/Laravel-text-to-speech
Laravel text to speech |
|
Experimental |
| 3527 |
grantCelley/Shout-Scribe
A completely free and open source dictation program |
|
Experimental |
| 3528 |
thiswillbeyourgithub/Voice2Anki
A powerful tool that converts voice recordings into high-quality Anki... |
|
Experimental |
| 3529 |
dobby-seo/korean-speech-recognition-quartznet
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식 |
|
Experimental |
| 3530 |
jekyll2014/VoiceAssistant
Locally hosted voice assistant with plugin extension feature |
|
Experimental |
| 3531 |
mecparts/Talker
A code-to-speech board based on the General Instrument 1980s chip set |
|
Experimental |
| 3532 |
SARIT42/image-Annotation-Speech
Explaining the contents of an image in the form of speech through caption... |
|
Experimental |
| 3533 |
dkurt/audio_recognition_android
Audio recognition on Android with OpenVINO |
|
Experimental |
| 3534 |
vadimkantorov/discordspeechtotext
Discord Speech-To-Text bot in Python using Google Cloud Speech-To-Text API |
|
Experimental |
| 3535 |
VIKASRAPARTHI/Jarvis-Voice-Assistant
Jarvis is a powerful desktop voice assistant designed to enhance... |
|
Experimental |
| 3536 |
Dada-Tech/speech-to-code
Limited Keyword Speech Recognition using Transfer Learning |
|
Experimental |
| 3537 |
wayne214/react-native-baidu-vtts
react-native-baidu-vtts |
|
Experimental |
| 3538 |
gnat/text-to-speech-ubuntu
🙊 Text to speech GUI / TTS on Ubuntu Linux 26.04 25.10 25.04 24.10 24.04... |
|
Experimental |
| 3539 |
technicianted/msspeech-gbridge
Bridge service to enable using Google Cloud Speech client SDKs with... |
|
Experimental |
| 3540 |
yp2211/gTTS4j
gTTS4j (Google Text to Speech): Java version of an interface to Google's... |
|
Experimental |
| 3541 |
erzaozi/vits-plugin
基于 Yunzai 的语音合成插件 |
|
Experimental |
| 3542 |
Yangyangii/DeepConvolutionalTTS-pytorch
Deep Convolutional TTS pytorch implementation |
|
Experimental |
| 3543 |
yuanhao-chen-nyoeghau/shanghainese-tts
Shanghainese TTS |
|
Experimental |
| 3544 |
isaacgounton/awesome-tts
A unified Text-to-Speech gateway combining multiple TTS providers (Kokoro... |
|
Experimental |
| 3545 |
yanorei32/aitalked-server
Simple GynoidTalk / VOICEROID Web Server based on aitalked library |
|
Experimental |
| 3546 |
abdnh/anki-asr
Anki add-on for speech recognition |
|
Experimental |
| 3547 |
HiMeditator/wfts-chinese-tool
使用中文游玩《群星低语》游戏。Playing the game "Whisper from the Stars" in Chinese. |
|
Experimental |
| 3548 |
ynop/NTSpeechRecognition
NTSpeechRecognition is a iOS/macOS framework, written in Objective-c,... |
|
Experimental |
| 3549 |
rusiaaman/PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in... |
|
Experimental |
| 3550 |
thotnd173389/tdnn_with_swsa
Create model keyword spotting using Time Delay Neural Network and Shared... |
|
Experimental |
| 3551 |
the-avyakta/Speech-to-GCode
I created a speech-to-Gcode generator using speech recognition and... |
|
Experimental |
| 3552 |
samnaveenkumaroff/Indic-F5
IndicF5: High-Quality Text-to-Speech for Indian Languages , including voice cloning |
|
Experimental |
| 3553 |
stefantaubert/tts-mos-test-mturk
Command-line interface (CLI) and Python library to evaluate text-to-speech... |
|
Experimental |
| 3554 |
onuratakan/ONUR_Voice_Assistant
A modular and expandable voice assistant. |
|
Experimental |
| 3555 |
X-LANCE/UniCATS-CTX-txt2vec
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS |
|
Experimental |
| 3556 |
Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes... |
|
Experimental |
| 3557 |
turtlehacks/speechportal
(1st place at HopHacks) A dynamic webVR memory palace for speech training,... |
|
Experimental |
| 3558 |
idiap/zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System... |
|
Experimental |
| 3559 |
Prem-kumar27/Fast-KTSpeechCrawler
Parallelized automatic corpus collection for ASR. Forked from... |
|
Experimental |
| 3560 |
nttcslab-sp/torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI) |
|
Experimental |
| 3561 |
virnarula/speechful
A speech-based document editing tool intended for those who cannot use keyboards. |
|
Experimental |
| 3562 |
JeanCaro/Babelin
Babelin Speach, for voice recognition and real-time translation, services... |
|
Experimental |
| 3563 |
reservamos/speech-to-text-demo
Flutter app with implementation of openAI tools (ChatGPT & Whisper) |
|
Experimental |
| 3564 |
nmstoker/SimpleSpeechLoop
A very basic demonstration connecting speech recognition and text-to-speech |
|
Experimental |
| 3565 |
Mmiglio/SpeechRecognition
Small-footprint Keyword Spotting |
|
Experimental |
| 3566 |
zhang-tuo-pdf/FedAudio
[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks |
|
Experimental |
| 3567 |
Msparihar/Transcriber
Developed an AI tool to automatically generate captions and transcripts for... |
|
Experimental |
| 3568 |
jurihock/remucs
Demucs wrapper for remixing audio files with additional customizations |
|
Experimental |
| 3569 |
nutdanai-kpjr/L03-Speech-Recognition-Python
Python Speech-To-Text projects using AssemblyAI API |
|
Experimental |
| 3570 |
jsvir/sparknet
[Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting |
|
Experimental |
| 3571 |
jm12138/iFLYTEK-MSC-Python-SDK
一个讯飞智能语音平台 MSC 的第三方 Python SDK,支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python... |
|
Experimental |
| 3572 |
AbdulBasit-MrRobo/Real-Time-Speech-Emotion-Recognition
Code for the paper "Real Time Speech Emotion Recognition using Machine Learning" |
|
Experimental |
| 3573 |
Siddhant-Ray/SlideEZ
Automated presentation generation software using direct speech. Hands free... |
|
Experimental |
| 3574 |
soohyunme/foreigner_speech
Foreigner Korean speech voice recognition hackathon - CSLEE |
|
Experimental |
| 3575 |
HRN-Projects/AVA---Accessibility-Virtual-Assistant
It is an open source accessibility tool created for better usability and... |
|
Experimental |
| 3576 |
easonlai/ms-speech-services-demo-web-tts
Microsoft Azure Speech Services (Text-to-Speech, TTS) Web Demo with Node.JS... |
|
Experimental |
| 3577 |
loneicewolf/AI-SNN
AI SNN - or Artificial Intelligence Stuttering Neural Network - a Project I... |
|
Experimental |
| 3578 |
TCBOMC/audio-book-TTS-tool
一个可以快速对大批量长文本(百万字量级)的文章/小说/剧本等进行AI标注角色以及语言合成的软件 |
|
Experimental |
| 3579 |
mhagglun/Speech-Recognition
Tensorflow implementation for Speech Recognition using Convolutional Neural... |
|
Experimental |
| 3580 |
p337r/Efes
Proof of concept demo for a tool that listens for keywords, and records... |
|
Experimental |
| 3581 |
ORI-Muchim/BERT-MB-iSTFT-VITS
High-quality Multilingual(Korean, Japanese, Chinese, English, French and... |
|
Experimental |
| 3582 |
lyncisdev/voco
Create a speech recognition system for programming by voice using Kaldi |
|
Experimental |
| 3583 |
vivek-nexus/listen
Multilingual reading companion that helps you listen to any written material... |
|
Experimental |
| 3584 |
tifaniwarnita/indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using... |
|
Experimental |
| 3585 |
Gopi-Durgaprasad/Speech-To-Text
End-to-End Speech Recognition |
|
Experimental |
| 3586 |
Forced-Alignment-and-Vowel-Extraction/fave-asr
Interface for automated transcription and time alignment of conversational... |
|
Experimental |
| 3587 |
dense-analysis/vim-speech
Vim Speech Recognition Experiments |
|
Experimental |
| 3588 |
chameleon82/avatar-ai
OpenAI Avatar for real-time api |
|
Experimental |
| 3589 |
Badri467/DubFlow
DubFlow lets you effortlessly dub YouTube videos into any language with... |
|
Experimental |
| 3590 |
Bangla-Language-Processing/Katha-Bangla-TTS
The first Bangla Text To Speech System for Bangladeshi Bangla (Katha) |
|
Experimental |
| 3591 |
01-SayantanI/Assistant
This Python Voice Assistant with GUI uses Tkinter to enable users to... |
|
Experimental |
| 3592 |
funmaker/4voiced
4chan voiced |
|
Experimental |
| 3593 |
aleksandarbos/Sound-Recognition-Convo2D-Neural-Network
Tools: Python (OpenCV 3.0 + Keras lib-Convolution 2D Neural Network). Desc:... |
|
Experimental |
| 3594 |
lukasjakobi/ha-sync-announcement
Broadcast synchronized TTS announcements across multiple media players in... |
|
Experimental |
| 3595 |
edisonneza/image-to-text
PWA - Convert Image to Text - A small multi language project built to use... |
|
Experimental |
| 3596 |
moimart/geppetto
GPT-Whisper-based Voice Assistant for Home Assistant (Experimental) |
|
Experimental |
| 3597 |
alexjsteffen/ttsrs
The ai-tts.rs project provides a command-line tool for generating spoken... |
|
Experimental |
| 3598 |
verrannt/snn_speechrec
Convolutional Spiking Neural Network to recognize speech utterances using... |
|
Experimental |
| 3599 |
zhenye234/FlashSpeech
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis |
|
Experimental |
| 3600 |
lucadellalib/ts-asr
Target speaker automatic speech recognition (TS-ASR) |
|
Experimental |