ReneeYe/XSTNet
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
This project offers a robust system for accurately translating spoken English into several other languages, including German, Spanish, French, Italian, Dutch, Portuguese, Romanian, and Russian. It takes audio recordings in English and produces corresponding text translations in the target language. This is ideal for professionals in media, international communication, or anyone needing precise, automated speech-to-text translation.
No commits in the last 6 months.
Use this if you need to reliably translate English speech into one of the supported European languages.
Not ideal if you require real-time, instantaneous translation or translation for languages not listed.
Stars
19
Forks
3
Language
Python
License
—
Category
Last pushed
May 01, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ReneeYe/XSTNet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT