BramVanroy/spacy_conll

Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.

54
/ 100
Established

This tool helps linguists, computational linguists, and NLP researchers convert raw text into a standardized format called CoNLL-U. It takes text as input and processes it using natural language processing models (like spaCy, Stanza, or UDPipe) to produce detailed linguistic annotations (like parts of speech, lemmas, and dependencies) in a structured, plain-text or tabular (Pandas DataFrame) CoNLL-U output. The end-user is typically someone who needs to analyze text with precise grammatical and syntactic information.

Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to process text and extract detailed grammatical information in the CoNLL-U format for linguistic analysis, dataset creation, or further NLP tasks.

Not ideal if you only need high-level text summaries or general sentiment analysis, as its primary purpose is deep linguistic annotation.

linguistic-analysis computational-linguistics natural-language-processing text-annotation dependency-parsing
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

81

Forks

18

Language

Python

License

BSD-2-Clause

Last pushed

Jul 02, 2024

Commits (30d)

0

Dependencies

1

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/BramVanroy/spacy_conll"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.