xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

56
/ 100
Established

This tool helps linguists, phoneticians, and speech researchers analyze spoken language by converting audio recordings into sequences of phonetic symbols. You provide a standard WAV audio file, and it outputs the International Phonetic Alphabet (IPA) representation of the speech sounds. It's designed for anyone needing to transcribe speech phonetically across a wide range of languages.

715 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically transcribe spoken audio into phonetic symbols (IPA) for more than 2000 languages, particularly for linguistic analysis or speech research.

Not ideal if you need a word-for-word transcript in a standard writing system, rather than a phonetic breakdown of the sounds.

phonetics linguistics speech-analysis language-research audio-transcription
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 21 / 25

How are scores calculated?

Stars

715

Forks

100

Language

Python

License

GPL-3.0

Last pushed

Apr 26, 2024

Commits (30d)

0

Dependencies

6

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/xinjli/allosaurus"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.