brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.

/ 100

Established

This tool helps researchers and analysts quickly extract over 200 linguistic features from text. You provide raw text documents, and it outputs specific measurements like readability scores, word difficulty, or noun counts. This is ideal for computational linguists, social scientists, or anyone analyzing text properties for studies or model building.

149 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to rapidly calculate a wide array of linguistic metrics from text for research, analysis, or as inputs for machine learning models.

Not ideal if you're looking for deep learning embeddings or features beyond traditional, handcrafted linguistic measures.

linguistic-analysis text-readability computational-linguistics social-science-research natural-language-processing

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

149

Forks

Language

Python

License

—

Compare

lftk and lingfeat

Related tools

ziqizhang/jate

JATE - Just Automatic Term Extraction (in Python)

mcs07/ChemDataExtractor

Automatically extract chemical information from scientific documents

mmmaurer/elfen

A python package to efficiently extract linguistic features for text/NLP datasets

strangetom/ingredient-parser

A tool to parse recipe ingredients into structured data

explosion/projects

🪐 End-to-end NLP workflows from prototype to production

Explore NLP Tools

All categories Trending NLP directory Insights