honghanhh/terminology-extraction
Terminology extraction on ACTER using Transformer-based language models
This project helps specialists quickly identify and extract important technical terms from large text documents in English, French, and Dutch. You provide text in these languages, and it outputs a list of relevant terms. This is ideal for linguists, translators, technical writers, or anyone working with specialized content.
No commits in the last 6 months.
Use this if you need to automatically compile lists of key terminology from scientific papers, technical manuals, or other domain-specific texts.
Not ideal if you are looking to extract general keywords rather than domain-specific terminology, or if your documents are in languages other than English, French, or Dutch.
Stars
8
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Nov 19, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/honghanhh/terminology-extraction"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ziqizhang/jate
JATE - Just Automatic Term Extraction (in Python)
mcs07/ChemDataExtractor
Automatically extract chemical information from scientific documents
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability...
mmmaurer/elfen
A python package to efficiently extract linguistic features for text/NLP datasets
strangetom/ingredient-parser
A tool to parse recipe ingredients into structured data