hppRC/simple-simcse-ja
Exploring Japanese SimCSE
This project offers pre-trained Japanese language models that can convert sentences into numerical representations, known as embeddings. These embeddings can then be used to find similar sentences for tasks like information retrieval or improving AI-generated text. It provides several fine-tuned models ready for use, catering to anyone working with Japanese text data who needs to understand semantic similarity.
No commits in the last 6 months.
Use this if you need to efficiently find Japanese sentences that are semantically similar to each other, for applications such as search, question answering, or enhancing generative AI with relevant information.
Not ideal if your primary need is for tasks other than semantic similarity, or if you are working exclusively with languages other than Japanese.
Stars
69
Forks
4
Language
Python
License
—
Category
Last pushed
Oct 31, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/hppRC/simple-simcse-ja"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
LoicGrobol/zeldarose
Train transformer-based models.
CPJKU/wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of...
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
minggnim/nlp-models
A repository for training transformer based models
IntelLabs/nlp-architect
A model library for exploring state-of-the-art deep learning topologies and techniques for...