shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
This project helps you convert text, like words, sentences, or paragraphs, into numerical vectors. These vectors can then be used to calculate how similar different pieces of text are to each other. It's designed for data scientists, natural language processing engineers, or researchers who need to quantify semantic relationships between text for tasks like information retrieval or semantic search.
4,950 stars. Used by 1 other package. Available on PyPI.
Use this if you need to transform Chinese or English text into numerical representations and calculate their semantic similarity quickly and efficiently.
Not ideal if you are looking for a simple keyword search solution rather than a semantic understanding of text.
Stars
4,950
Forks
428
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 14, 2026
Commits (30d)
0
Dependencies
7
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/shibing624/text2vec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
predict-idlab/pyRDF2Vec
🐍 Python Implementation and Extension of RDF2Vec
IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts
IITH-Compilers/IR2Vec
Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
natasha/navec
Compact high quality word embeddings for Russian language