xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

/ 100

Established

This tool helps linguists, phoneticians, and speech researchers analyze spoken language by converting audio recordings into sequences of phonetic symbols. You provide a standard WAV audio file, and it outputs the International Phonetic Alphabet (IPA) representation of the speech sounds. It's designed for anyone needing to transcribe speech phonetically across a wide range of languages.

715 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically transcribe spoken audio into phonetic symbols (IPA) for more than 2000 languages, particularly for linguistic analysis or speech research.

Not ideal if you need a word-for-word transcript in a standard writing system, rather than a phonetic breakdown of the sounds.

phonetics linguistics speech-analysis language-research audio-transcription

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 21 / 25

How are scores calculated?

Stars

715

Forks

100

Language

Python

License

GPL-3.0

Related tools

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

srvk/eesen

The official repository of the Eesen project

hirofumi0810/neural_sp

End-to-end ASR/LM implementation with PyTorch

Explore Voice AI Tools

All categories Trending Voice AI directory Insights