Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
This project provides over 100 pre-trained Chinese word vectors, also known as embeddings, that help computers understand the meaning and relationships between Chinese words. You input a Chinese word, and it provides a numerical representation that captures its semantic meaning and context. This is useful for anyone working with Chinese text data, such as natural language processing researchers or data scientists building text analysis applications.
12,188 stars. No commits in the last 6 months.
Use this if you need to integrate a nuanced understanding of Chinese words into your text analysis, search, or recommendation systems, especially if you want to save time on training your own word embeddings.
Not ideal if your project only deals with English text or other non-Chinese languages, as these vectors are specifically for Chinese words.
Stars
12,188
Forks
2,325
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 30, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Embedding/Chinese-Word-Vectors"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
MinishLab/model2vec
Fast State-of-the-Art Static Embeddings
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
tensorflow/hub
A library for transfer learning by reusing parts of TensorFlow models.
twang2218/vocab-coverage
语言模型中文认知能力分析
Santosh-Gupta/SpeedTorch
Library for faster pinned CPU <-> GPU transfer in Pytorch