Text2TCS/Term-Extraction-With-Language-Models
Extracting terms from text using XLM-R for token and sequence classification
This project helps linguistic researchers and terminology managers automatically identify key terms within a text, even across multiple languages. You input a document, and it outputs a list of relevant terms. It's designed for anyone working with specialized vocabulary in various fields, like technical writers, translators, or domain experts.
No commits in the last 6 months.
Use this if you need to quickly and accurately extract important domain-specific terms from large volumes of text in English, French, or Dutch.
Not ideal if you need to extract relationships between terms or perform a deeper semantic analysis beyond simple term identification.
Stars
15
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Apr 18, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Text2TCS/Term-Extraction-With-Language-Models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...