mmmaurer/elfen
A python package to efficiently extract linguistic features for text/NLP datasets
This tool helps researchers, data scientists, and linguists analyze text datasets by efficiently extracting a wide range of linguistic features. You input text data, often in a tabular format, and it outputs calculated metrics about the text's readability, lexical richness, psycholinguistics, and more. This is ideal for anyone working with text who needs to quantify its characteristics for further analysis.
Available on PyPI.
Use this if you need to transform raw text into measurable linguistic features for machine learning, statistical analysis, or linguistic research.
Not ideal if you're looking for a simple keyword extractor or a tool that only provides basic text statistics like word counts.
Stars
28
Forks
4
Language
Python
License
MIT
Category
Last pushed
Mar 03, 2026
Commits (30d)
0
Dependencies
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mmmaurer/elfen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ziqizhang/jate
JATE - Just Automatic Term Extraction (in Python)
mcs07/ChemDataExtractor
Automatically extract chemical information from scientific documents
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability...
strangetom/ingredient-parser
A tool to parse recipe ingredients into structured data
explosion/projects
🪐 End-to-end NLP workflows from prototype to production