IgarashiAkatuki/CNMBERT
基于BERT的拼音缩写/汉字谐音翻译模型,在命名实体识别,情感分析等多种任务上有应用潜力。中文拼写纠错 Chinese Spelling Correction (CSC),互联网黑话,拼音缩写,汉字谐音
This tool helps convert internet slang, Pinyin abbreviations like 'bhys' to '不好意思', and homophones like '紫砂' to '自杀' in Chinese text. It takes a Chinese sentence with such informal language as input and outputs the corrected, standard Chinese characters. It's ideal for anyone who needs to clean up user-generated content, understand online conversations, or process informal Chinese text.
133 stars.
Use this if you need to translate informal Chinese internet slang, Pinyin abbreviations, or homophone misspellings into standard Chinese characters.
Not ideal if you need a tool that automatically detects all such instances in a sentence without specifying the words to be translated, as it currently requires you to identify the terms yourself.
Stars
133
Forks
4
Language
Python
License
AGPL-3.0
Category
Last pushed
Jan 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/IgarashiAkatuki/CNMBERT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hellohaptik/chatbot_ner
chatbot_ner: Named Entity Recognition for chatbots.
openeventdata/mordecai
Full text geoparsing as a Python library
Rostlab/nalaf
NLP framework in python for entity recognition and relationship extraction
mpuig/spacy-lookup
Named Entity Recognition based on dictionaries
NorskRegnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks