xuchennlp/S2T
The project for speech translation
This toolkit helps researchers and developers working with speech-based data to convert spoken language into text, translate speech into other languages, or perform direct speech-to-text translation. It takes audio inputs and dataset configurations, then outputs transcribed or translated text. This is ideal for machine learning engineers, AI researchers, or NLP specialists focused on advanced speech processing applications.
No commits in the last 6 months.
Use this if you are developing or experimenting with cutting-edge models for automatic speech recognition, machine translation, or speech translation and need a comprehensive framework to streamline your workflow.
Not ideal if you are looking for a simple, off-the-shelf application to transcribe audio or translate speech without deep technical involvement in model training and development.
Stars
12
Forks
3
Language
Python
License
MIT
Category
Last pushed
Sep 28, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/xuchennlp/S2T"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT