yuzhimanhua/HiGitClass

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)

29
/ 100
Experimental

This tool helps researchers and data scientists automatically categorize software projects by topic, organizing them into a clear, nested hierarchy. You provide repository information (like description, README text, and tags), and it outputs structured labels that classify the project into relevant scientific or technical domains. It's designed for those who need to understand and manage large collections of software projects, such as for bibliometric analysis or research trend tracking.

No commits in the last 6 months.

Use this if you need to systematically classify GitHub repositories into predefined, hierarchical categories like 'Bioinformatics > Genome Analysis' or 'Machine Learning > Image Generation' based on their textual content.

Not ideal if you're looking for a simple keyword search or a flat, non-hierarchical categorization of repositories.

research-analysis software-discovery bibliometrics data-science-research topic-modeling
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 5 / 25

How are scores calculated?

Stars

60

Forks

2

Language

Python

License

Apache-2.0

Last pushed

Apr 02, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yuzhimanhua/HiGitClass"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.