JasonShao55/Chinese_Metaphor_Explanation
An annotated Chinese metaphor dataset
This project provides a comprehensive, expert-annotated dataset of over 27,000 Chinese metaphors. It takes in various Chinese metaphors, breaking them down into their component 'tenor' and 'vehicle' with explanations of the 'ground' (the shared characteristic). This is designed for AI researchers and natural language processing specialists who are developing large language models capable of understanding and generating sophisticated Chinese text.
No commits in the last 6 months.
Use this if you are developing or fine-tuning large language models and need high-quality, structured data to improve their ability to process and generate Chinese metaphors, especially for tasks requiring explicit metaphor explanations (grounds).
Not ideal if your primary goal is to analyze metaphorical language in English or if you need a dataset for simple Chinese text classification without a focus on complex semantic relationships like metaphor.
Stars
22
Forks
—
Language
Python
License
MIT
Category
Last pushed
Feb 23, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/JasonShao55/Chinese_Metaphor_Explanation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NateScarlet/holiday-cn
📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告
sagorbrur/bnlp
BNLP is a natural language processing toolkit for Bengali Language.
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
houbb/sensitive-word
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java...
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese...