shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

65
/ 100
Established

This project helps you convert text, like words, sentences, or paragraphs, into numerical vectors. These vectors can then be used to calculate how similar different pieces of text are to each other. It's designed for data scientists, natural language processing engineers, or researchers who need to quantify semantic relationships between text for tasks like information retrieval or semantic search.

4,950 stars. Used by 1 other package. Available on PyPI.

Use this if you need to transform Chinese or English text into numerical representations and calculate their semantic similarity quickly and efficiently.

Not ideal if you are looking for a simple keyword search solution rather than a semantic understanding of text.

natural-language-processing semantic-search text-analytics information-retrieval data-science
Maintenance 10 / 25
Adoption 11 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

4,950

Forks

428

Language

Python

License

Apache-2.0

Last pushed

Feb 14, 2026

Commits (30d)

0

Dependencies

7

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/shibing624/text2vec"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.