hppRC/simple-simcse-ja

Exploring Japanese SimCSE

/ 100

Experimental

This project offers pre-trained Japanese language models that can convert sentences into numerical representations, known as embeddings. These embeddings can then be used to find similar sentences for tasks like information retrieval or improving AI-generated text. It provides several fine-tuned models ready for use, catering to anyone working with Japanese text data who needs to understand semantic similarity.

No commits in the last 6 months.

Use this if you need to efficiently find Japanese sentences that are semantically similar to each other, for applications such as search, question answering, or enhancing generative AI with relevant information.

Not ideal if your primary need is for tasks other than semantic similarity, or if you are working exclusively with languages other than Japanese.

Japanese-language-processing semantic-search information-retrieval natural-language-understanding text-similarity

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

LoicGrobol/zeldarose

Train transformer-based models.

CPJKU/wechsel

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of...

yuanzhoulvpi2017/zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

minggnim/nlp-models

A repository for training transformer based models

IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for...

Explore Transformer Models

All categories Trending Transformer directory Insights