maxencefaldor/nanoGPT-JAX

Repository nanoGPT from @karpathy accelerated with JAX/Flax! The simplest, fastest repository for training/finetuning medium-sized GPTs.

34
/ 100
Emerging

This project helps machine learning engineers quickly train or fine-tune medium-sized GPT models. You provide a dataset of text, and it generates a functional GPT model capable of producing new, similar text outputs. It's designed for ML engineers who need to efficiently work with generative pre-trained transformers.

No commits in the last 6 months.

Use this if you are a machine learning engineer looking for a simple and fast way to train or fine-tune medium-sized GPT models using JAX/Flax.

Not ideal if you are looking for an out-of-the-box text generation tool without needing to engage in model training or fine-tuning.

Machine Learning Engineering Natural Language Processing Generative AI Model Training Deep Learning
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

9

Forks

2

Language

Jupyter Notebook

License

MIT

Last pushed

Jun 06, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/maxencefaldor/nanoGPT-JAX"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.