HappyHackingSpace/dit

HTML page, form and field type classifier using ML (LogReg + CRF)

40
/ 100
Emerging

This tool helps web researchers, analysts, and security professionals quickly understand the purpose of web pages and forms without manual inspection. You provide raw HTML content, and it identifies if it's a login page, a registration form, or a product page, and also labels individual fields like 'username' or 'password'. This allows for automated data collection or security assessments.

Use this if you need to automatically identify the type of web pages, classify forms (like login or search), and label individual form fields from raw HTML, especially for large-scale web analysis.

Not ideal if you need to interact with a web page as a browser would, or if your primary goal is to extract content that is not part of a form.

web-scraping security-auditing competitor-analysis data-extraction site-categorization
No Package No Dependents
Maintenance 10 / 25
Adoption 5 / 25
Maturity 11 / 25
Community 14 / 25

How are scores calculated?

Stars

12

Forks

3

Language

Go

License

MIT

Category

go-ml-bindings

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/HappyHackingSpace/dit"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.