mmmaurer/elfen

A python package to efficiently extract linguistic features for text/NLP datasets

/ 100

Established

This tool helps researchers, data scientists, and linguists analyze text datasets by efficiently extracting a wide range of linguistic features. You input text data, often in a tabular format, and it outputs calculated metrics about the text's readability, lexical richness, psycholinguistics, and more. This is ideal for anyone working with text who needs to quantify its characteristics for further analysis.

Available on PyPI.

Use this if you need to transform raw text into measurable linguistic features for machine learning, statistical analysis, or linguistic research.

Not ideal if you're looking for a simple keyword extractor or a tool that only provides basic text statistics like word counts.

text-analysis linguistic-research natural-language-processing data-science readability-assessment

Maintenance 10 / 25

Adoption 7 / 25

Maturity 25 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Related tools

ziqizhang/jate

JATE - Just Automatic Term Extraction (in Python)

mcs07/ChemDataExtractor

Automatically extract chemical information from scientific documents

brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability...

strangetom/ingredient-parser

A tool to parse recipe ingredients into structured data

explosion/projects

🪐 End-to-end NLP workflows from prototype to production

Explore NLP Tools

All categories Trending NLP directory Insights