AliOsm/simplerepresentations
Easy-to-use text representations extraction library based on the Transformers library.
This library helps natural language processing (NLP) engineers and data scientists quickly turn raw text into numerical representations (embeddings) using state-of-the-art Transformer models. You feed in sentences or documents, and it outputs numerical vectors that capture the meaning of your text, which can then be used for tasks like text classification, semantic search, or clustering.
No commits in the last 6 months. Available on PyPI.
Use this if you need an easy-to-use Python library to generate high-quality numerical representations of text for further analysis or machine learning tasks.
Not ideal if you need a visual interface, a no-code solution, or if you are not comfortable working with Python code.
Stars
32
Forks
4
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 21, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/AliOsm/simplerepresentations"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MinishLab/model2vec
Fast State-of-the-Art Static Embeddings
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
tensorflow/hub
A library for transfer learning by reusing parts of TensorFlow models.
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
twang2218/vocab-coverage
语言模型中文认知能力分析