kk7nc/HDLTex

HDLTex: Hierarchical Deep Learning for Text Classification

58
/ 100
Established

This project helps classify large collections of documents into a hierarchy of categories, rather than a single flat list. You provide a dataset of documents with their associated hierarchical categories, and it outputs a model that can automatically sort new, unclassified documents into the correct nested categories. This is useful for researchers, librarians, or information managers dealing with extensive text archives.

278 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically organize very large document collections into a complex, multi-level categorical structure where standard classification methods struggle.

Not ideal if your documents only need to be sorted into a small number of flat, non-hierarchical categories.

document-organization information-retrieval knowledge-management content-categorization academic-research
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 23 / 25

How are scores calculated?

Stars

278

Forks

66

Language

Python

License

MIT

Last pushed

Oct 10, 2024

Commits (30d)

0

Dependencies

8

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kk7nc/HDLTex"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.