modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

41
/ 100
Emerging

This toolkit helps you efficiently manage and process speech and audio data. It takes raw audio files (like WAVs) and converts them into compact 'audio codes,' which can then be used to reconstruct the original audio or generate new speech. This is ideal for researchers and developers working on advanced audio applications like text-to-speech systems or music generation.

442 stars. No commits in the last 6 months.

Use this if you need to compress, reproduce, or synthesize speech and audio, especially for developing AI models like text-to-speech.

Not ideal if you are an end-user simply looking to convert text to speech or generate music without developing underlying models.

Speech Synthesis Audio Processing Voice AI Deep Learning Research Neural Audio
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

442

Forks

33

Language

Python

License

MIT

Last pushed

Jan 25, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/modelscope/FunCodec"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.