erogol/BlaGPT
Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.
This is a platform for deep learning researchers and practitioners to quickly compare different architectural choices, layers, and training tricks for language models. It allows you to input various model configurations and small datasets to see how different techniques impact performance metrics like loss and memory usage. It's ideal for those experimenting with new ideas in language model design.
102 stars.
Use this if you are a researcher or engineer looking to understand the practical impact of new language model techniques or compare existing ones on smaller datasets.
Not ideal if you are a general user looking for a pre-trained language model or a large-scale deployment solution.
Stars
102
Forks
13
Language
Python
License
—
Category
Last pushed
Jan 26, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/erogol/BlaGPT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...
shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...
ai-forever/ru-gpts
Russian GPT3 models.
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...