lgessler/microbert
A tiny BERT for low-resource monolingual models
This tool provides compact language models for understanding and processing text in languages with limited digital resources, like Ancient Greek or Coptic. It takes raw text data in these languages and helps researchers, linguists, or cultural heritage professionals build smaller, more efficient models for tasks such as identifying parts of speech or grammatical relationships, even with scarce training data. The output is a specialized language model tailored for that specific low-resource language.
Use this if you are a linguist or researcher working with languages that have very few digital texts available and need to build effective language processing models without extensive computational resources.
Not ideal if you are working with well-resourced languages like English or Spanish, as standard, larger BERT models are likely more suitable.
Stars
31
Forks
6
Language
HTML
License
—
Category
Last pushed
Dec 24, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/lgessler/microbert"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
JayYip/m3tl
BERT for Multitask Learning
920232796/bert_seq2seq
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
sileod/tasknet
Easy modernBERT fine-tuning and multi-task learning
graykode/toeicbert
TOEIC(Test of English for International Communication) solving using pytorch-pretrained-BERT model.