ibotplus/kbase-media
视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert text to audio(base64)
This tool helps you quickly extract text from videos, audio recordings, or images, or turn written text into spoken audio. You can input various media files and receive text transcripts, image descriptions, or audio files (in base64 format). It's designed for anyone who needs to convert media content for analysis, archiving, or accessibility purposes.
Use this if you need to process large volumes of multimedia content to extract text, create audio versions of text, or perform content analysis.
Not ideal if you require highly specialized, domain-specific recognition for complex, low-quality, or uncommon media types, or if you need advanced video editing capabilities.
Stars
24
Forks
11
Language
Java
License
MIT
Category
Last pushed
Dec 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ibotplus/kbase-media"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alphacep/vosk-android-demo
Offline speech recognition for Android with Vosk library.
marytts/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
lkuza2/java-speech-api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines...
chenliangrui/EasyMrcp
欢迎使用EasyMrcp! EasyMrcp使用java编写,目前提供了多种不同的asr和tts的集成,做到真正简单使用ASR和TTS。...
goxr3plus/java-google-speech-api
🙊 Speech Recognition , Text To Speech , Google Translate