KennethEnevoldsen/scandinavian-embedding-benchmark
A Scandinavian Benchmark for sentence embeddings
This tool helps researchers and developers evaluate how well different language models understand and represent text in Scandinavian languages like Danish, Norwegian, and Swedish. It takes various sentence or document embedding models as input and produces performance scores on a range of tasks, allowing you to compare and select the best model for your specific needs. It's designed for anyone developing or applying natural language processing technologies for Scandinavian languages.
Use this if you need to objectively compare the quality of different text embedding models for Scandinavian languages across various real-world tasks.
Not ideal if you are looking for a tool to build or train a language model from scratch, as this focuses solely on evaluation.
Stars
46
Forks
9
Language
Python
License
MIT
Category
Last pushed
Dec 05, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/KennethEnevoldsen/scandinavian-embedding-benchmark"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
acl-org/acl-anthology
Data and software for building the ACL Anthology.
anoopkunchukuttan/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Separius/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
SudhirGadhvi/open-vernacular-ai-kit
Clean Indian code-mixed text before it reaches your LLM.