erogol/BlaGPT

Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.

/ 100

Emerging

This is a platform for deep learning researchers and practitioners to quickly compare different architectural choices, layers, and training tricks for language models. It allows you to input various model configurations and small datasets to see how different techniques impact performance metrics like loss and memory usage. It's ideal for those experimenting with new ideas in language model design.

102 stars.

Use this if you are a researcher or engineer looking to understand the practical impact of new language model techniques or compare existing ones on smaller datasets.

Not ideal if you are a general user looking for a pre-trained language model or a large-scale deployment solution.

language-model-research deep-learning-experimentation neural-network-architecture machine-learning-engineering natural-language-processing

No License No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

102

Forks

Language

Python

License

—

Higher-rated alternatives

tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...

shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...

ai-forever/ru-gpts

Russian GPT3 models.

AdityaNG/kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...

Explore Transformer Models

All categories Trending Transformer directory Insights