IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
This tool helps convert spoken English words into written text. You provide a short audio recording in a WAV file, and it outputs a string of text representing what was said. This is useful for anyone who needs to quickly transcribe audio, such as journalists, researchers, or content creators.
No commits in the last 6 months.
Use this if you need to transcribe short, single-channel English audio clips from WAV files into text.
Not ideal if you need to process long audio files, multiple speakers, or languages other than English.
Stars
76
Forks
32
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/IBM/MAX-Speech-to-Text-Converter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT
verbio-technologies/python-verbio-speech-center
Python integration with the Verbio Speech Center Cloud. https://speechcenter.verbio.com/