lgessler/microbert

A tiny BERT for low-resource monolingual models

/ 100

Emerging

This tool provides compact language models for understanding and processing text in languages with limited digital resources, like Ancient Greek or Coptic. It takes raw text data in these languages and helps researchers, linguists, or cultural heritage professionals build smaller, more efficient models for tasks such as identifying parts of speech or grammatical relationships, even with scarce training data. The output is a specialized language model tailored for that specific low-resource language.

Use this if you are a linguist or researcher working with languages that have very few digital texts available and need to build effective language processing models without extensive computational resources.

Not ideal if you are working with well-resourced languages like English or Spanish, as standard, larger BERT models are likely more suitable.

linguistics-research low-resource-languages ancient-language-processing natural-language-processing digital-humanities

No License No Package No Dependents

Maintenance 6 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

HTML

License

—

Higher-rated alternatives

codertimo/BERT-pytorch

Google AI 2018 BERT pytorch implementation

JayYip/m3tl

BERT for Multitask Learning

920232796/bert_seq2seq

pytorch实现 Bert 做seq2seq任务，使用unilm方案,现在也可以做自动摘要，文本分类，情感分析，NER，词性标注等任务,支持t5模型，支持GPT2进行文章续写。

sileod/tasknet

Easy modernBERT fine-tuning and multi-task learning

graykode/toeicbert

TOEIC(Test of English for International Communication) solving using pytorch-pretrained-BERT model.

Explore NLP Tools

All categories Trending NLP directory Insights