cmccomb/rust-stop-words

Common stop words in a variety of languages

48
/ 100
Emerging

This tool helps text analysts, data scientists, and researchers clean up written text by identifying and removing common 'stop words' like 'the', 'a', or 'is'. You provide text in a variety of languages, and it returns a cleaner version, making the core meaning easier to find and analyze. This is crucial for anyone performing tasks like sentiment analysis, topic modeling, or keyword extraction.

Use this if you need to preprocess text data in multiple languages to focus on important keywords and phrases by eliminating common, less meaningful words.

Not ideal if your analysis relies on the presence of common articles or prepositions, or if you only work with highly structured, non-textual data.

text-analysis natural-language-processing data-cleaning information-retrieval content-analysis
No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

25

Forks

5

Language

Rust

License

MIT

Last pushed

Feb 21, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/cmccomb/rust-stop-words"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.