Cerenaut/Preprocess-NIST-SD19

Preprocessing for NIST Special Dataset 19 (uppercase single-character handwritten characters A..Z). Converts to same formatting as Yann Lecun MNIST (handwritten numerical digits 0..9), and produces a set of PNG images.

37
/ 100
Emerging

This tool helps researchers and educators prepare image datasets for machine learning models that recognize handwritten characters. It takes raw image files of uppercase handwritten letters (A-Z) from the NIST Special Database 19 and transforms them into a standardized format, similar to the well-known MNIST dataset for handwritten digits. The output is a collection of PNG images, ready for use in combined alphanumeric recognition projects.

No commits in the last 6 months.

Use this if you need to standardize the appearance of handwritten uppercase character images (A-Z) to match the MNIST digit format for use in machine learning models.

Not ideal if you need to process handwritten lowercase letters, symbols, or other character sets, or if you require an automatically split training and test dataset.

handwriting-recognition character-recognition image-preprocessing machine-learning-datasets optical-character-recognition
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

13

Forks

7

Language

Java

License

GPL-3.0

Last pushed

Jan 09, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Cerenaut/Preprocess-NIST-SD19"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.