Arkapravo-Ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
This tool helps convert spoken words from audio recordings into written text. You provide an audio file, and it outputs a precise text transcript. Anyone who needs to quickly get written records from audio, such as journalists, researchers, or content creators, will find this useful for tasks like interviews, lectures, or podcast transcription.
Use this if you need a straightforward way to turn audio recordings into accurate text transcripts without manual typing.
Not ideal if you need an interactive, real-time transcription service for live conversations or meetings.
Stars
8
Forks
7
Language
Python
License
AGPL-3.0
Category
Last pushed
Feb 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Arkapravo-Ghosh/speech-to-text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:
scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers
Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages
This project presents a multilingual pipeline for both speech-to-text and speech-to-speech...
EdVince/whisper-trtllm
Whisper in TensorRT-LLM
mahiiyh/asr-primer
A complete implementation of an Automatic Speech Recognition (ASR) system using OpenAI's Whisper...