mmmaurer/elfen

A python package to efficiently extract linguistic features for text/NLP datasets

54
/ 100
Established

This tool helps researchers, data scientists, and linguists analyze text datasets by efficiently extracting a wide range of linguistic features. You input text data, often in a tabular format, and it outputs calculated metrics about the text's readability, lexical richness, psycholinguistics, and more. This is ideal for anyone working with text who needs to quantify its characteristics for further analysis.

Available on PyPI.

Use this if you need to transform raw text into measurable linguistic features for machine learning, statistical analysis, or linguistic research.

Not ideal if you're looking for a simple keyword extractor or a tool that only provides basic text statistics like word counts.

text-analysis linguistic-research natural-language-processing data-science readability-assessment
Maintenance 10 / 25
Adoption 7 / 25
Maturity 25 / 25
Community 12 / 25

How are scores calculated?

Stars

28

Forks

4

Language

Python

License

MIT

Last pushed

Mar 03, 2026

Commits (30d)

0

Dependencies

11

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mmmaurer/elfen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.