evalphobia/go-jp-text-ripper

tokenize text and separate it into words for Japanese

20
/ 100
Experimental

This tool helps Japanese language data analysts and researchers prepare text data for further analysis. It takes a CSV or TSV file containing Japanese text, breaks down the text into individual words (tokenization), and adds new columns with the segmented words and word counts. This is useful for anyone who needs to process large volumes of Japanese text for tasks like sentiment analysis, keyword extraction, or linguistic research.

No commits in the last 6 months.

Use this if you need to quickly and accurately break down Japanese sentences into individual words from structured data files like spreadsheets.

Not ideal if you're working with languages other than Japanese, or if you need advanced natural language processing features beyond basic word segmentation and frequency analysis.

Japanese-linguistics text-analysis data-preparation market-research content-analysis
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 7 / 25

How are scores calculated?

Stars

11

Forks

1

Language

Go

License

Category

go-nlp-libraries

Last pushed

Jan 05, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/evalphobia/go-jp-text-ripper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.