rug-compling/bimu

Bilingual Learning of Multi-sense Embeddings with Discrete Autoencoders

/ 100

Experimental

This project helps natural language processing researchers or developers to create word embeddings that understand multiple meanings of a word across two languages. You input parallel text in two languages (e.g., English and French) along with word alignments, and it outputs multi-sense word embedding matrices. This tool is for researchers in computational linguistics or NLP engineers working on multilingual understanding.

No commits in the last 6 months.

Use this if you need to generate high-quality, context-aware word embeddings for polysemous words in a bilingual setting to improve tasks like machine translation or cross-lingual information retrieval.

Not ideal if you only work with a single language or if you need general-purpose word embeddings that don't differentiate between word senses.

natural-language-processing computational-linguistics multilingual-text-analysis word-sense-disambiguation machine-translation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

MilaNLProc/contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings...

vinid/cade

Compass-aligned Distributional Embeddings. Align embeddings from different corpora

spcl/ncc

Neural Code Comprehension: A Learnable Representation of Code Semantics

criteo-research/CausE

Code for the Recsys 2018 paper entitled Causal Embeddings for Recommandation.

vintasoftware/entity-embed

PyTorch library for transforming entities like companies, products, etc. into vectors to support...

Explore Embedding Tools

All categories Trending Embeddings directory Insights