ahmedbesbes/audiolizr

A bentoML-powered API to transcribe audio and make sense of it

/ 100

Experimental

Audiolizr helps content creators, educators, and researchers quickly understand audio and video content. It takes a YouTube video URL or audio file and provides a full transcript, a concise summary, key topics, and identified entities like names or locations. This is ideal for anyone who needs to quickly grasp the core insights of spoken content without watching or listening to the full piece.

No commits in the last 6 months.

Use this if you need to extract the core insights, keywords, and entities from spoken content, especially from YouTube videos, without spending time watching or transcribing them manually.

Not ideal if you require highly nuanced analysis of tone or visual cues, or if your primary need is real-time processing of live audio streams.

content-analysis video-summarization media-research knowledge-extraction podcast-analysis

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

scalable-ml-deep-learning/fine_tune_whisper

Fine-Tune Whisper for Italian ASR with transformers

Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages

This project presents a multilingual pipeline for both speech-to-text and speech-to-speech...

EdVince/whisper-trtllm

Whisper in TensorRT-LLM

Explore Transformer Models

All categories Trending Transformer directory Insights