dongjinleekr/beanpiece

A Java binding to Google SentencePiece

20
/ 100
Experimental

This is a tool for developers who work with text processing in Java. It helps integrate Google's SentencePiece tokenizer into Java applications. Developers can use it to break down raw text into meaningful subword units and reconstruct text from those units within their Java projects.

No commits in the last 6 months.

Use this if you are a Java developer needing to implement robust subword tokenization and detokenization in your applications.

Not ideal if you are not a Java developer or if you require SentencePiece functionality on Windows or macOS without compiling the native libraries yourself.

Java development text processing natural language processing software development
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

7

Forks

Language

C++

License

Apache-2.0

Last pushed

Jun 28, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dongjinleekr/beanpiece"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.