ahmedbesbes/audiolizr
A bentoML-powered API to transcribe audio and make sense of it
Audiolizr helps content creators, educators, and researchers quickly understand audio and video content. It takes a YouTube video URL or audio file and provides a full transcript, a concise summary, key topics, and identified entities like names or locations. This is ideal for anyone who needs to quickly grasp the core insights of spoken content without watching or listening to the full piece.
No commits in the last 6 months.
Use this if you need to extract the core insights, keywords, and entities from spoken content, especially from YouTube videos, without spending time watching or transcribing them manually.
Not ideal if you require highly nuanced analysis of tone or visual cues, or if your primary need is real-time processing of live audio streams.
Stars
39
Forks
2
Language
Python
License
—
Category
Last pushed
Dec 21, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ahmedbesbes/audiolizr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Arkapravo-Ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:
scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers
Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages
This project presents a multilingual pipeline for both speech-to-text and speech-to-speech...
EdVince/whisper-trtllm
Whisper in TensorRT-LLM