AndyTheFactory/RO-Diacritics
Python package for Romanian diacritics restoration
This helps you automatically add the correct diacritics (like ș, ț, ă, î, â) back into Romanian text that's missing them. You feed in plain Romanian text, and it returns the same text with all the proper diacritical marks restored. It's ideal for anyone working with Romanian language data, such as content creators, linguists, or data entry specialists.
No commits in the last 6 months. Available on PyPI.
Use this if you have Romanian text that was typed or processed without diacritics and you need to correct it for readability, searchability, or further linguistic analysis.
Not ideal if you're working with languages other than Romanian or if your primary need is for general text cleanup beyond diacritic restoration.
Stars
9
Forks
—
Language
Python
License
MIT
Category
Last pushed
Jan 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/AndyTheFactory/RO-Diacritics"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
KoichiYasuoka/esupar
Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and...
hellohaptik/multi-task-NLP
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single...
old-wang-95/easy-bert
easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习
taishi-i/nagisa_bert
A BERT model for nagisa
ant-louis/netbert
📶 NetBERT: a domain-specific BERT model for computer networking.