HappyHackingSpace/dit
HTML page, form and field type classifier using ML (LogReg + CRF)
This tool helps web researchers, analysts, and security professionals quickly understand the purpose of web pages and forms without manual inspection. You provide raw HTML content, and it identifies if it's a login page, a registration form, or a product page, and also labels individual fields like 'username' or 'password'. This allows for automated data collection or security assessments.
Use this if you need to automatically identify the type of web pages, classify forms (like login or search), and label individual form fields from raw HTML, especially for large-scale web analysis.
Not ideal if you need to interact with a web page as a browser would, or if your primary goal is to extract content that is not part of a form.
Stars
12
Forks
3
Language
Go
License
MIT
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/HappyHackingSpace/dit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SPFlow/SPFlow
Sum Product Flow: An Easy and Extensible Library for Sum-Product Networks
gomlx/gomlx
GoMLX: An Accelerated Machine Learning Framework For Go
montanaflynn/stats
A well tested and comprehensive Golang statistics library package with no dependencies.
mattn/go-tflite
Go binding for TensorFlow Lite
james-bowman/sparse
Sparse matrix formats for linear algebra supporting scientific and machine learning applications