All Voice AI Tools

8,165 tools ranked by quality score · Page 27 of 82

Showing 2601–2700 of 8,165
# Tool Score Tier
2601 Arvind2903/Accent-Classification-And-Conversion

Tackle accent classification and conversion using audio data, leveraging...

33
Emerging
2602 matthijsvk/TIMITspeech

Speech recognition on the TIMIT (or any other) dataset

33
Emerging
2603 jayesh15111988/SpeechRecognitionLibrary

A pluggable library for speech recognition on iOS - Requires iOS 10.0+

33
Emerging
2604 tollwerk/speakable

Simple and privacy friendly on-page screenreader / text-to-speech player...

33
Emerging
2605 xinjli/ucla-phonetic-corpus

Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH...

33
Emerging
2606 ryhorv/tf-flowavenet

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

33
Emerging
2607 TheMonocledHamster/Hamster-Bot-Prototype

Rudimentary Chatterbot written in Python

33
Emerging
2608 RomainLLC/booking-openai-chatbot

Booking chatbot example app with Django, OpenAI and text to speech

33
Emerging
2609 robmsmt/CommonCorrections

Easily fix common corrections in speech!

33
Emerging
2610 mateusz-kow/auto-subs

Generate, edit and apply subtitles locally using Whisper or any ASR backend

33
Emerging
2611 creafz/kaggle-speech-recognition

Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%)

33
Emerging
2612 lesleyrs/clipboard-narrator

Turn any web page into an audiobook, works in the background on desktop!

33
Emerging
2613 SuryanshNaugraiya/AI-JARVIS

AI JARVIS, an intelligent personal assistant is a software agent that can...

33
Emerging
2614 pkprajapati7402/JARVIS-voice-assistant

JARVIS Voice Assistant is a powerful and intuitive voice-activated assistant...

33
Emerging
2615 Uni-Creator/Jarvis-Desktop-Assistance

A powerful desktop assistant built in Python that combines voice commands,...

33
Emerging
2616 techiaith/trawsgrifiwr-arlein

Cod gwefan Trawsgrifiwr Ar-lein gan Uned Technolegau Iaith, Prifysgol Bangor...

33
Emerging
2617 Lucasfrota/pyssistant

Pyssistant is designed to be an conversational interface builder.

33
Emerging
2618 royshil/obs-squawk

Real-time Text-to-Speech AI Engine built-in OBS, integrative and intuitive

33
Emerging
2619 team-listnr/text-to-speech-api

Listnr Text to speech API

33
Emerging
2620 HelloChatterbox/speech2text

Chatterbox STT engines

33
Emerging
2621 rcdalj/speech2speech

Full speech-to-speech workflow (can be customized to user's requirements)

33
Emerging
2622 angangwa/azure-speech-to-text

Azure speech to text capabilities including OpenAI models. Gradio demo.

33
Emerging
2623 agentvoiceresponse/avr-tts-google-speech-tts

This project demonstrates the integration of Agent Voice Response with...

33
Emerging
2624 charslab/Home-Assistant

Home assistant inspired by Amazon Echo, based on wit.ai with speech recognition

33
Emerging
2625 holgern/ttsforge

Convert EPUB files to audiobooks using Kokoro ONNX TTS

33
Emerging
2626 totalvoice/totalvoice-java

Client Java pra API da TotalVoice

33
Emerging
2627 6-robot/xfyun_waterplus

A xfyun ros package for Waterplus Robots

33
Emerging
2628 Aditya1Jhaveri/AI-Video-Dubbing

AI video dubbing using Google APIs automates translation and dubbing by...

33
Emerging
2629 yandex-cloud-examples/yc-speechkit-streams-recognizer

SpeechKit Streaming Recognizer.

33
Emerging
2630 Gemeri/Discord-Voice-Channel-Bot

A bot that can join voice channels using the OpenAI api and Microsoft's free...

33
Emerging
2631 9jaswag/speechrec

a simple speech recognition app using the Web Speech API Interfaces

33
Emerging
2632 Tinkoff/asterisk-voicekit-modules

Non-blocking Asterisk modules for accessing VoiceKit services for speech...

33
Emerging
2633 haydonryan/epub2audiobook

Blazingly fast EPUB to Audiobook converter

33
Emerging
2634 Drakonis96/whispad

WhisPad is a note management tool where you can write or dictate your notes...

33
Emerging
2635 jawebada/piper-audio-example-streaming-web-worker

Simple piper-js example

33
Emerging
2636 Ayushverma135/Whisper-Hindi-ASR-model-IIT-Bombay-Internship

The Whisper Hindi ASR (Automatic Speech Recognition) model utilizes the...

33
Emerging
2637 huytd/speech

A tool to practice English speaking

33
Emerging
2638 ajaygujja/Kahani-Storytelling-App-For-Children-With-Hearing-Impairment

Storytelling App For Children With Hearing Impairment

33
Emerging
2639 m1el/nemotron-asr.cpp

Nemotron ASR rewrite to GGML

33
Emerging
2640 igorbezsmertnyi/speech

speech recognition and speech synthesis

33
Emerging
2641 jakob-stoeck/speechToText

iOS speech recognition app for voice messages and general audio files

33
Emerging
2642 kanttouchthis/text_generation_webui_xtts

XTTSv2 Extension for oobabooga text-generation-webui

33
Emerging
2643 revsic/tf-mlptts

Tensorflow implementation of MLP-Mixer based TTS

33
Emerging
2644 speechly/ios-client

The iOS client library for Speechly API

33
Emerging
2645 orbitalsonic/Speech-Recognition-SpeechToTextConverter

The Speech Recognition or Speech-to-Text Converter module in Android,...

33
Emerging
2646 TheDeathDragon/LiveTranslate

Real-time audio translation overlay for Windows — captures system audio +...

33
Emerging
2647 agentvoiceresponse/avr-tts-kokoro

The application sets up an Express.js server that accepts a text string from...

33
Emerging
2648 vdutts7/ai-rapper

Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise...

33
Emerging
2649 musa11971/manhuw

Recognizing and identifying Quran reciters from audio recordings.

33
Emerging
2650 jasonclark/voice-user-interface

Prototypes for voice assistance and UI design based on voice interactions

33
Emerging
2651 prathamesh-mandavkar/AutoTalker

The project focuses on leveraging technology to create new courses,...

33
Emerging
2652 jhudsl/text2speech

Text to Speech

33
Emerging
2653 graphiteSWE/DeSpeect

Codice per il prodotto "DeSpeect: un'interfaccia grafica per Speect"

33
Emerging
2654 Ziyodullodev/useful-codes

@ziyodev

33
Emerging
2655 MaxMax2016/Grad-TTS-Chinese

Huawei Grad-TTS for Chinese

33
Emerging
2656 pingfury108/book2tts

有声书制作工具

33
Emerging
2657 radkoder/qt-whisper

A Qt & QML wrapper for whisper.cpp

33
Emerging
2658 stefantaubert/tacotron-cli

Command-line interface to train Tacotron 2 using .wav <=> .TextGrid pairs.

33
Emerging
2659 olami-developers/olami-android-hotword-detect-sdk

Hotword Detection (Wake Word Detection) Android library and sample codes

33
Emerging
2660 renaudjenny/swift-tts

A straightforward package containing version for Swift modern concurrency,...

33
Emerging
2661 Mobile-Artificial-Intelligence/maise

Maise is an open-source android speech engine designed to provide a powerful...

33
Emerging
2662 AmSh4/gemini-live-app

A real-time voice AI web app using Google Gemini Live API. Features...

33
Emerging
2663 markokosticdev/cloud_text_to_speech_nodejs

Single interface to Google, Microsoft, and Amazon Text-To-Speech.

33
Emerging
2664 hanxi/epub2mp3

这是一个使用 Microsoft Edge TTS 服务将 EPUB 电子书转换为 MP3 音频文件的工具。

33
Emerging
2665 parzibyte/reconocimiento-voz-javascript

Usar webkitSpeechRecognition para convertir voz a texto en la web con JavaScript

33
Emerging
2666 masayoshi-louis/microsoft-speech-rs

Rust wrapper for microsoft speech recognition

33
Emerging
2667 SABER-labs/SABER

Semi-Supervised Audio Baseline for Easy Reproduction

33
Emerging
2668 happyf-weallareeuropean/cC

auto Speak lastest chatgpt stream responses. & more room for display chat content

33
Emerging
2669 crazymidnight/speech-recognition

[WIP] Speech recognition microservice

33
Emerging
2670 Mildemelwe/Non-English-Tacotron-2-Training-Notebook

Tacotron 2 training notebook supporting Japanese, French, and Mandarin

33
Emerging
2671 adrianmfi/gpt-tutor

Generate personalized audio lessons for learning languages with GPT and...

33
Emerging
2672 tomik395/ESP32-AI

Speak to your ESP32 and it speaks back! Your new personal assistance is...

33
Emerging
2673 dyazincahya-blog/k-speech

a simple component "text to speech"

33
Emerging
2674 zero-nnkn/vision-assistant-services

👁‍🗨 Vision Assistant (Backend): Smart Assistant for Visually Impaired People

33
Emerging
2675 HristovB/Speech_Recognition_Macedonian

Speech recognition model for recognising Macedonian spoken language.

33
Emerging
2676 nafiuny/ICRCycleGAN-VC

Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and...

33
Emerging
2677 PMO-IT/voiceassistant

Nova, a Java based voice assistant. Runnable on Raspberry Pi.

33
Emerging
2678 Pooventhiran/VSR

Speaker-Independent Speech Recognition using Visual Features

33
Emerging
2679 minji-o-j/AI-Speaker-for-Senior-Citizen

독거노인을 위한 AI스피커 - 일반적인 AI 스피커의 역할 뿐만 아니라 사용자가 있는 환경의 온·습도를 주기적으로 측정하여 필요시 환경...

33
Emerging
2680 LM-Kit/LynxTranscribe

LynxTranscribe is a comprehensive, professional-grade audio transcription...

33
Emerging
2681 Hassi34/NLP-Hub

The NLP Hub consists of multiple NLP services, each providing specific...

33
Emerging
2682 bobo52310/TypeLate

Voice-to-text for macOS and Windows. 100% free — fork it, make it yours, and...

33
Emerging
2683 ramizeid/Discord-Voice-Chat-Text-to-Speech

A text to speech bot for Discord using IBM Watson

33
Emerging
2684 devnamdev2003/PC_Assistant

The virtual assistant is a general-purpose desktop-based application...

33
Emerging
2685 jaywcjlove/TextSoundSaver

Using the TextSoundSaver application, you can convert text into realistic...

33
Emerging
2686 gokulakannant/text-to-speech

A experiment project for react js and electron app. Download binaries here:...

33
Emerging
2687 danielclough/parler-tts-wasm

A Rust and Wasm Demo to generate and play speech from text using Parler-TTS.

33
Emerging
2688 seungwonpark/awesome-tts-samples

Awesome list of TTS papers with audio samples

33
Emerging
2689 streamer45/streamkit

StreamKit is a self-hosted real-time media processing engine with pluggable...

33
Emerging
2690 ywatanabe1989/scitex-notification

Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One...

33
Emerging
2691 FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis,...

33
Emerging
2692 jumadi59/android-game-teka-teki-silang

Simple game Teka-Teki Silang (Word Cross). Available on the play store!

33
Emerging
2693 ikram-shah/iris-fhir-transcribe-summarize-export

A full-stack application that allows practitioners to record voice notes and...

33
Emerging
2694 ddlBoJack/MT4SSL

[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL:...

33
Emerging
2695 winstxnhdw/CapGen

A fast CPU-first video/audio transcriber for generating caption files with...

33
Emerging
2696 hoishing/speech-recog

Speech recognition web app powered by Google Speech API

33
Emerging
2697 tometoproject/tometo

:zzz: A text to speech social network. [mirror]

33
Emerging
2698 MotazSabri/Hanami-release

Live translator that captures any audio that comes from a WINDOWS speaker or...

33
Emerging
2699 tirsky/speechpro_wrapper

Wrapper for text to speech speechpro (only russian)

33
Emerging
2700 yyaadet/autosrt_page

AutoSRT is an macOS app that automatically generates dual language subtitles...

33
Emerging
« Prev 1 2 3 25 26 27 28 29 80 81 82 Next »