Martouta/speech_processor
Speech-to-text from videos and audios (including youtube and tiktok links)
This tool helps developers integrate speech-to-text functionality into their applications. It takes video or audio links (from YouTube, TikTok, or hosted URLs) or local files, processes them using various AI speech recognition services, and outputs the transcribed text. The primary users are software developers building systems that require automated transcription.
Use this if you are a developer looking for a component to extract text transcripts from diverse audio and video sources for your application, with options for different speech recognition engines.
Not ideal if you are an end-user needing a simple desktop application or web service for transcription, as this project requires development setup and integration.
Stars
20
Forks
1
Language
Python
License
GPL-3.0
Category
Last pushed
Feb 07, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Martouta/speech_processor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT