antouanbg/Bulgarian_Linguistic

Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.

/ 100

Experimental

This project offers a collection of Bulgarian datasets and pre-trained models for tasks like speech recognition, text-to-speech, and natural language processing. It takes raw Bulgarian audio or text and provides structured data and linguistic models. This is ideal for linguists, researchers, or developers working on Bulgarian language technology applications.

No commits in the last 6 months.

Use this if you need readily available Bulgarian linguistic resources to build applications that understand or generate Bulgarian speech and text.

Not ideal if you are looking for a complete, out-of-the-box application rather than foundational data and models for Bulgarian language tasks.

Bulgarian-linguistics speech-recognition text-to-speech natural-language-processing language-model-development

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Java

License

—

Higher-rated alternatives

ynop/audiomate

Python library for handling audio datasets.

reazon-research/ReazonSpeech

Massive open Japanese speech corpus

common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...

EgorLakomkin/KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Explore Voice AI Tools

All categories Trending Voice AI directory Insights