lucadellalib/ts-asr
Target speaker automatic speech recognition (TS-ASR)
This project offers tools to build advanced speech recognition systems that can isolate and transcribe the speech of a specific person, even in noisy conversations. It takes mixed audio recordings and information about the target speaker, producing a clear transcript of only that speaker's words. It is designed for speech AI researchers and engineers developing next-generation voice technologies.
No commits in the last 6 months.
Use this if you are a researcher or engineer working on automatic speech recognition and need to train models capable of transcribing a specific speaker's voice in multi-speaker environments.
Not ideal if you are looking for a ready-to-use application to transcribe audio files without delving into model training and development.
Stars
12
Forks
6
Language
Python
License
—
Category
Last pushed
Oct 14, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lucadellalib/ts-asr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project