ibotplus/kbase-media

视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert text to audio(base64)

45
/ 100
Emerging

This tool helps you quickly extract text from videos, audio recordings, or images, or turn written text into spoken audio. You can input various media files and receive text transcripts, image descriptions, or audio files (in base64 format). It's designed for anyone who needs to convert media content for analysis, archiving, or accessibility purposes.

Use this if you need to process large volumes of multimedia content to extract text, create audio versions of text, or perform content analysis.

Not ideal if you require highly specialized, domain-specific recognition for complex, low-quality, or uncommon media types, or if you need advanced video editing capabilities.

content-analysis transcription accessibility-services digital-archiving multimedia-processing
No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

24

Forks

11

Language

Java

License

MIT

Last pushed

Dec 03, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ibotplus/kbase-media"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.