despawnerer/truecase
Restore correct letter casings in arbitrary text using a statistical model
This tool helps restore proper capitalization in text that might be all uppercase or all lowercase. You provide examples of correctly cased sentences to train a statistical model, then feed it your messy text, and it returns the text with correct sentence casing and proper nouns capitalized. It's ideal for anyone who works with large volumes of text that needs to be cleaned up for readability or further processing.
No commits in the last 6 months.
Use this if you frequently receive text inputs, like transcripts or scraped data, that lack proper capitalization and need to be corrected automatically.
Not ideal if you only have a few sentences to correct manually or require very precise linguistic-rule based casing rather than statistical inference.
Stars
7
Forks
1
Language
Rust
License
Apache-2.0
Category
Last pushed
Sep 29, 2023
Monthly downloads
5
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/despawnerer/truecase"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/nlpo3
Thai natural language processing library in Rust, with Python and Node bindings.
forzagreen/n2words
Convert numerical numbers to written numbers, in 52+ languages.
greyblake/whatlang-rs
Natural language detection library for Rust. Try demo online: https://whatlang.org/
wikimedia/sentencex
A sentence segmentation library with wide language support optimized for speed and utility.
pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and...