sebpuetz/lumberjack
Read and modify constituency trees in Rust.
This tool helps linguists and computational linguists work with constituency trees, which show the grammatical structure of sentences. You can input treebanks in various formats like NEGRA or simple bracketed text, and it outputs trees that are converted, filtered, or otherwise modified to suit your analysis. It's designed for researchers or practitioners who need to prepare linguistic data for further study or machine learning tasks.
No commits in the last 6 months.
Use this if you need to convert, filter, or modify constituency treebank data for linguistic analysis or natural language processing tasks.
Not ideal if you're looking for a tool to parse raw text into constituency trees, as this tool primarily processes existing tree data.
Stars
10
Forks
2
Language
Rust
License
Apache-2.0
Category
Last pushed
May 05, 2020
Monthly downloads
22
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sebpuetz/lumberjack"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/nlpo3
Thai natural language processing library in Rust, with Python and Node bindings.
forzagreen/n2words
Convert numerical numbers to written numbers, in 52+ languages.
greyblake/whatlang-rs
Natural language detection library for Rust. Try demo online: https://whatlang.org/
wikimedia/sentencex
A sentence segmentation library with wide language support optimized for speed and utility.
pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and...