yannvgn/laserembeddings

LASER multilingual sentence embeddings as a pip package

/ 100

Established

This project helps anyone working with text across multiple languages who needs to compare or categorize sentences. It takes sentences in various languages as input and converts them into universal numerical codes (embeddings). These codes allow you to identify similar sentences, regardless of their original language, making it useful for researchers, analysts, or anyone building language-agnostic text applications.

224 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to understand the semantic similarity between sentences written in different languages without having to translate them first.

Not ideal if you need to train or fine-tune the model for very specific domain-specific language tasks, as the pre-trained models are not designed for further training.

multilingual-text-analysis cross-lingual-information-retrieval natural-language-processing text-categorization semantic-search

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 16 / 25

How are scores calculated?

Stars

224

Forks

Language

Python

License

BSD-3-Clause

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead You're Shipping AI You Can't Measure

Related tools

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise data and...

embeddings-benchmark/results

Data for the MTEB leaderboard

Hironsan/awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

fresh-stack/freshstack

This repository helps you evaluate your models on the FreshStack benchmark!

Explore Embedding Tools

All categories Trending Embeddings directory Insights