teddykoker/grokking

PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

29
/ 100
Experimental

This project helps machine learning researchers explore and understand the phenomenon of "Grokking" using small algorithmic datasets. It takes as input training parameters and outputs models that demonstrate generalization beyond overfitting, helping you reproduce the behaviors described in the original "Grokking" paper. Machine learning researchers and academics studying model generalization would find this useful.

No commits in the last 6 months.

Use this if you are a machine learning researcher interested in replicating and studying the 'Grokking' phenomenon, particularly with modular arithmetic datasets.

Not ideal if you are looking for a pre-trained model or a tool for applying machine learning to real-world, large-scale problems.

Machine Learning Research Model Generalization Deep Learning Theory Algorithmic Datasets Academic Research
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

39

Forks

6

Language

Python

License

Last pushed

Dec 07, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/teddykoker/grokking"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.