Martouta/speech_processor

Speech-to-text from videos and audios (including youtube and tiktok links)

/ 100

Emerging

This tool helps developers integrate speech-to-text functionality into their applications. It takes video or audio links (from YouTube, TikTok, or hosted URLs) or local files, processes them using various AI speech recognition services, and outputs the transcribed text. The primary users are software developers building systems that require automated transcription.

Use this if you are a developer looking for a component to extract text transcripts from diverse audio and video sources for your application, with options for different speech recognition engines.

Not ideal if you are an end-user needing a simple desktop application or web service for transcription, as this project requires development setup and integration.

developer-tooling data-pipeline backend-development media-processing

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

GPL-3.0

Higher-rated alternatives

speechmatics/speechmatics-python

Python library and CLI for Speechmatics

gooofy/py-nltools

A collection of basic python modules for spoken natural language processing

IBM/MAX-Speech-to-Text-Converter

Converts spoken words into text form.

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...

snakers4/open_stt

Open STT

Explore Voice AI Tools

All categories Trending Voice AI directory Insights