xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
This tool helps linguists, phoneticians, and speech researchers analyze spoken language by converting audio recordings into sequences of phonetic symbols. You provide a standard WAV audio file, and it outputs the International Phonetic Alphabet (IPA) representation of the speech sounds. It's designed for anyone needing to transcribe speech phonetically across a wide range of languages.
715 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically transcribe spoken audio into phonetic symbols (IPA) for more than 2000 languages, particularly for linguistic analysis or speech research.
Not ideal if you need a word-for-word transcript in a standard writing system, rather than a phonetic breakdown of the sounds.
Stars
715
Forks
100
Language
Python
License
GPL-3.0
Category
Last pushed
Apr 26, 2024
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/xinjli/allosaurus"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
srvk/eesen
The official repository of the Eesen project
hirofumi0810/neural_sp
End-to-end ASR/LM implementation with PyTorch