yukunqi/Chinese-Word2Vec-Model
搜狗新闻语料训练的word2vec中文模型
This provides a pre-trained Chinese Word2Vec model, built from a vast collection of Sogou news articles. It takes individual Chinese words or phrases as input and can output related words or numerical representations of words. This is useful for anyone working with Chinese text data, such as data scientists, NLP researchers, or product managers building search or recommendation features.
No commits in the last 6 months.
Use this if you need to understand semantic relationships between Chinese words, find synonyms, or convert Chinese text into numerical data for machine learning tasks.
Not ideal if you need a model trained on a highly specialized or domain-specific Chinese corpus, or if you prefer to train your own Word2Vec model from scratch.
Stars
69
Forks
10
Language
Python
License
—
Category
Last pushed
Apr 12, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yukunqi/Chinese-Word2Vec-Model"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Planeshifter/node-word2vec
Node.js interface to the Google word2vec tool.
nathanrooy/word2vec-from-scratch-with-python
A very simple, bare-bones, inefficient, implementation of skip-gram word2vec from scratch with Python
thunlp/paragraph2vec
Paragraph Vector Implementation
akoksal/Turkish-Word2Vec
Pre-trained Word2Vec Model for Turkish
RichDavis1/PHPW2V
A PHP implementation of Word2Vec, a popular word embedding algorithm created by Tomas Mikolov...