SapienzaNLP/clubert
Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.
This project offers pre-computed data that helps you understand the different meanings of words in English, Italian, French, German, and Spanish. It takes raw text data and provides a breakdown of each word's possible meanings, along with a score indicating how likely each meaning is. This is ideal for computational linguists or natural language processing researchers working on multilingual text analysis.
No commits in the last 6 months.
Use this if you need to analyze the specific meanings of words in text across multiple languages for research or application development.
Not ideal if you're looking for a tool to perform real-time word sense disambiguation on new, incoming text without pre-computed distributions.
Stars
10
Forks
—
Language
—
License
—
Category
Last pushed
Jan 04, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SapienzaNLP/clubert"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alvations/pywsd
Python Implementations of Word Sense Disambiguation (WSD) Technologies.
SapienzaNLP/ewiser
A Word Sense Disambiguation system integrating implicit and explicit external knowledge.
danlou/LMMS
Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings
dustalov/watset
Watset: Automatic Induction of Synsets from a Graph of Synonyms
USC-NSL/sage
SAGE disambiguates protocol description in an IETF RFC document, then converts the disambiguated...