texttechnologylab/UCE
The Unified Corpus Explorer (UCE) for UIMA-annotated Corpora.
The Unified Corpus Explorer (UCE) helps researchers and practitioners in fields like biodiversity research or AI-generated text detection to navigate and understand large collections of annotated text. It takes UIMA-annotated text corpora as input and provides an interactive web portal where users can search, visualize, and explore linguistic annotations and patterns. This tool is designed for anyone working with large volumes of text that have undergone Natural Language Processing (NLP) annotation.
Use this if you need a web-based, searchable interface to explore and share your UIMA-annotated text corpora and the insights derived from them.
Not ideal if your text data is not UIMA-annotated or if you primarily need to perform custom, programmatic analysis rather than interactive exploration.
Stars
7
Forks
17
Language
Java
License
AGPL-3.0
Category
Last pushed
Mar 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/texttechnologylab/UCE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...
SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.