hhguo/SoCodec

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

37
/ 100
Emerging

SoCodec helps you compress speech audio into extremely small digital codes, allowing for efficient use in advanced text-to-speech systems. It takes in spoken audio (currently Chinese) and converts it into a highly compressed representation, which can then be used to synthesize speech. This is ideal for speech synthesis engineers or researchers working with language models for text-to-speech.

No commits in the last 6 months.

Use this if you need to dramatically reduce the file size of speech audio while maintaining quality for text-to-speech applications based on language models.

Not ideal if you need to process languages other than Chinese, as multi-lingual support is still under development.

speech-synthesis audio-compression language-modeling text-to-speech voice-technology
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

90

Forks

9

Language

Python

License

MIT

Last pushed

Dec 20, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/hhguo/SoCodec"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.