korpling/pepper
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.
This tool helps linguistic researchers and corpus creators convert their language data between different annotation formats. You input linguistic corpora in one format (like EXMARalDA or TCF), and it outputs the same data in a different, compatible format for your analysis tools. It's designed for anyone working with annotated linguistic datasets who encounters format incompatibilities.
No commits in the last 6 months.
Use this if you need to convert linguistic corpora between various annotation formats or merge data from multiple annotation tools into multilayer corpora.
Not ideal if you are working with non-linguistic data or require conversions for formats outside of linguistic annotation.
Stars
24
Forks
3
Language
XSLT
License
—
Category
Last pushed
Jan 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/korpling/pepper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
winkjs/wink-nlp
Developer friendly Natural Language Processing ✨
LSYS/LexicalRichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
mbejda/Node-OpenNLP
Apache OpenNLP wrapper for Nodejs
LanguageMachines/frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for...
winkjs/wink-nlp-utils
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic...