Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages

This project presents a multilingual pipeline for both speech-to-text and speech-to-speech summarization in Indian languages. It transcribes audio using a fine-tuned Whisper ASR model, summarizes text with mT5, and optionally synthesizes the summary back into speech using Indic Parler-TTS.

/ 100

Experimental

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 3 / 25

Maturity 7 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Category

whisper-speech-transcription

Last pushed

Aug 31, 2025

Commits (30d)

GitHub

Whisper Speech Transcription · 17 models

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

scalable-ml-deep-learning/fine_tune_whisper

Fine-Tune Whisper for Italian ASR with transformers

EdVince/whisper-trtllm

Whisper in TensorRT-LLM

mahiiyh/asr-primer

A complete implementation of an Automatic Speech Recognition (ASR) system using OpenAI's Whisper...

Explore Transformer Models

All categories Trending Transformer directory Insights