noobiegz/cw2vec

Implementation of the cw2vec model

/ 100

Emerging

This helps Chinese language practitioners create better semantic search, recommendation, or text analysis systems. It takes a large collection of Chinese text and produces numerical representations for each word, enhancing how well computers understand and group related Chinese terms based on both meaning and character structure. This is for data scientists or NLP engineers working with Chinese textual data.

No commits in the last 6 months.

Use this if you need to generate high-quality, context-aware word embeddings specifically for Chinese text, especially when traditional methods fall short due to the unique characteristics of Chinese characters.

Not ideal if you primarily work with English or other non-Chinese languages, or if you need the absolute fastest training time and are not concerned with leveraging character-level stroke information for Chinese.

Chinese NLP text embeddings semantic search natural language processing computational linguistics

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

shibing624/text2vec

text2vec, text to vector....

predict-idlab/pyRDF2Vec

🐍 Python Implementation and Extension of RDF2Vec

IntuitionEngineeringTeam/chars2vec

Character-based word embeddings model based on RNN for handling real world texts

IITH-Compilers/IR2Vec

Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Explore Embedding Tools

All categories Trending Embeddings directory Insights