Automatic Term Extraction NLP Tools

Tools for automatically identifying and extracting domain-specific terms, technical terminology, and named entities from unstructured text documents. Does NOT include general named entity recognition (NER), keyword extraction for intent analysis, or fact extraction.

There are 41 automatic term extraction tools tracked. 6 score above 50 (established tier). The highest-rated is ziqizhang/jate at 67/100 with 84 stars.

Get all 41 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=automatic-term-extraction&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 ziqizhang/jate

JATE - Just Automatic Term Extraction (in Python)

67
Established
2 mcs07/ChemDataExtractor

Automatically extract chemical information from scientific documents

59
Established
3 brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction;...

55
Established
4 mmmaurer/elfen

A python package to efficiently extract linguistic features for text/NLP datasets

54
Established
5 strangetom/ingredient-parser

A tool to parse recipe ingredients into structured data

52
Established
6 explosion/projects

🪐 End-to-end NLP workflows from prototype to production

51
Established
7 swabhs/open-sesame

A frame-semantic parsing system based on a softmax-margin SegRNN.

49
Emerging
8 pryndor/Lixplore_cli

A powerful Unix-inspired command-line tool for searching scientific...

47
Emerging
9 kevinlu1248/pyate

PYthon Automated Term Extraction

43
Emerging
10 zjunlp/OntoProtein

[ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding

42
Emerging
11 gorgitko/molminer

Python library and command-line tool for extracting compounds from...

42
Emerging
12 bafgreat/fairmofsyncondition

A robust Python module for predicting the synthesis conditions of MOFs. It...

41
Emerging
13 brucewlee/lingfeat

[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction...

40
Emerging
14 FACTSlab/glazing

Unified data models and interfaces for syntactic and semantic frame ontologies.

38
Emerging
15 aoldoni/tetre

TETRE: a Toolkit for Exploring Text for Relation Extraction

36
Emerging
16 robinvanschaik/flair-on-gcp

This repository adds examples on how to train Flair on Google Cloud Platform...

36
Emerging
17 ljvmiranda921/spacy-span-analyzer

Simple tool to analyze spans in your dataset. Implementation of Papay et...

35
Emerging
18 buaaliuming/Awesome-Resources-for-Scholarly-Big-Data

Tools, datasets, Corpus and Venue Challenge for scholarly big data——Pick up...

34
Emerging
19 CederGroupHub/text2chem

RegEx-based text parser that converts chemical terms and material entities...

34
Emerging
20 peter-grajcar/clause-extraction

Utility for clause extraction from complex sentences

31
Emerging
21 prit2596/NLP-Template-Extraction

Template Extraction from unstructured Wikipedia text using NLP techniques.

30
Emerging
22 dnaaun/openFraming

Tools for automatic frame discovery and labeling based on topic modeling and...

30
Emerging
23 hexuandeng/NewTerm

Implementation for our paper “NewTerm: Benchmarking Real-Time New Terms for...

29
Experimental
24 michaelmml/NLP-Information-Extraction

Automated PDF and text processing with Spacy and NLTK; information...

27
Experimental
25 seanox/seanox-ai-nlp

Modular NLP tools for domain-specific semantic matching and structured data

27
Experimental
26 ParvaShah/Template_Extraction_NLP

This project is about Template Extraction from a document using NLP Techniques

27
Experimental
27 theseekersutd/Research-Paper-Template-Extractor

Given repository extracts the templates from research papers using natural...

25
Experimental
28 k-kaundal/sah-kse

Semantic Adaptive Hash & Knowledge Seed Engine — compress knowledge into...

24
Experimental
29 Abhinand20/AUTO-ONTO

Tool to automatically extract keyphrases from text spanning across vast...

23
Experimental
30 hay/wiki-text-nlp

Extract 'Did you know?' facts from Wikipedia articles

21
Experimental
31 honghanhh/terminology-extraction

Terminology extraction on ACTER using Transformer-based language models

20
Experimental
32 RaziehZare/Speech-Processing-Ontology

A formal OWL ontology representing 79 core concepts in speech processing....

20
Experimental
33 ispasic/FlexiTerm-Python

Repository for FlexiTerm: a software tool to automatically recognise...

19
Experimental
34 harshit158/paper-dots

Automatic insights extraction and annotation tool from research papers

18
Experimental
35 alipascal/extract-rdf-from-text

Projet universitaire d'extraction d'entités triplets (sujet-prédicat-objet)...

11
Experimental
36 jeredhiggins/KeyIntentNER-T

KeyIntentNER-T is a Keyword Intent, Named Entity Recognition (NER), & Google...

11
Experimental
37 scarandriy/NeuroMine

LLM-powered pipeline for mining and structuring neuroprotective compound...

11
Experimental
38 SoniaMola/SemanticsExtraction

The aim of the project is to extract from Natural Language relevant semantic...

11
Experimental
39 carinhopm/topic-extraction

Novel implementation for Topic Extraction

10
Experimental
40 nicolaCirillo/termdomain

A domain-aware automatic term extraction tool.

10
Experimental
41 Ethan-Liu-Ethan/MariATE_RANLP_2025

Supplementary materials for the paper "MariATE: Automatic Term Extraction...

10
Experimental