PlanTL-GOB-ES/lm-spanish
Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
This project offers ready-to-use Spanish language models and datasets for various natural language processing tasks. If you work with large volumes of Spanish text and need to perform tasks like answering questions, categorizing documents, or recognizing specific entities, these resources can provide highly accurate results. It helps researchers, data scientists, and language technology developers build applications that understand and generate Spanish.
262 stars. No commits in the last 6 months.
Use this if you need pre-trained language models, word embeddings, or datasets specifically tailored for high-quality natural language processing in Spanish.
Not ideal if your primary language is not Spanish or Catalan, or if you need to process extremely short, informal text like social media posts without further fine-tuning.
Stars
262
Forks
23
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 27, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/PlanTL-GOB-ES/lm-spanish"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MinishLab/model2vec
Fast State-of-the-Art Static Embeddings
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
tensorflow/hub
A library for transfer learning by reusing parts of TensorFlow models.
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
twang2218/vocab-coverage
语言模型中文认知能力分析