GPT Multilingual Training LLM Tools

Tools for training GPT models on non-English languages and domain-specific datasets (poetry, regional languages). Does NOT include general English-language GPT implementations, architecture education, or inference-only tools.

There are 135 gpt multilingual training tools tracked. 1 score above 70 (verified tier). The highest-rated is Nixtla/nixtla at 71/100 with 3,792 stars. 1 of the top 10 are actively maintained.

Get all 135 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=gpt-multilingual-training&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 Nixtla/nixtla

TimeGPT-1: production ready pre-trained Time Series Foundation Model for...

71
Verified
2 andrewdalpino/NoPE-GPT

A GPT-style small language model (SLM) with no positional embeddings (NoPE).

52
Established
3 sigdelsanjog/gptmed

pip install gptmed

52
Established
4 akanyaani/gpt-2-tensorflow2.0

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

49
Emerging
5 samkamau81/FinGPT_

FinGPT is an AI language model designed to understand and generate financial...

45
Emerging
6 VinAIResearch/PhoGPT

PhoGPT: Generative Pre-training for Vietnamese (2023)

44
Emerging
7 teddykoker/image-gpt

PyTorch Implementation of OpenAI's Image GPT

43
Emerging
8 LIYUESEN/druggpt

DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting...

42
Emerging
9 alibaba/graph-gpt

Generative Pre-trained Graph Eulerian Transformer [ICML2025]

42
Emerging
10 milmor/GPT

Implementation of Generative Pretrained Transformer Model in Tensorflow / Keras

40
Emerging
11 aaron-wheeler/MarketGPT

MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial...

40
Emerging
12 mkt1412/GraspGPT_public

code implementation of GraspGPT and FoundationGrasp

38
Emerging
13 hunar4321/reweight-gpt

Reweight GPT - a simple neural network using transformer architecture for...

38
Emerging
14 AspirinCode/iupacGPT

IUPAC-based large-scale molecular pre-trained model for property prediction...

38
Emerging
15 abhaskumarsinha/Corpus2GPT

Corpus2GPT: A project enabling users to train their own GPT models on...

34
Emerging
16 duydvu/gpt-j-6B-vietnamese-news-api

Vietnamese GPT-J API service deployed with Docker & Helm chart

34
Emerging
17 Koziev/verslibre

Using transformers to generate Russian poetry

33
Emerging
18 BanglaGPT/bangla-gpt

Training code for BanglaGPT model

33
Emerging
19 brown-palm/AntGPT

Official code implemtation of paper AntGPT: Can Large Language Models Help...

32
Emerging
20 tufts-ml/G2PT

Graph generative pre-trained transformer

32
Emerging
21 MetaTrustLabs/GPTScan

Indexing three datasets for GPTScan

32
Emerging
22 PointsCoder/GPT-Driver

Learning to Drive with GPT

32
Emerging
23 Rishikesh-Jadhav/Video-Compression-and-Future-Prediction-Using-GPT

This repository presents a project focused on advanced video compression and...

31
Emerging
24 jmaczan/gpt

Generative Pre-trained Transformer in PyTorch from scratch

31
Emerging
25 creatorrr/cryptgpt

Pretrain a model on ciphered text so only you can use it

31
Emerging
26 VanekPetr/my-own-GPT

A simple PyTorch re-implementation of the OpenAI GPT (Generative Pretrained...

31
Emerging
27 fraserlove/gpt-alpha

GPT-α is a 124 million parameter decoder-only language model following the...

31
Emerging
28 MauroCE/DanteGPT

DanteGPT

31
Emerging
29 VachanVY/gpt.jax

Generative Pretrained Model (GPT) in JAX. A step by step guide to train LLMs...

31
Emerging
30 PetropoulakisPanagiotis/gpt-practice

GPT code - I completed the tutorial for building GPT components by Andrej...

31
Emerging
31 n4ze3m/footgpt

FootGPT is a GPT-based language model for football news

31
Emerging
32 berkerdemirel/GPT-from-scratch

Re-implementation of Andrej Karpathy's nanoGPT

30
Emerging
33 s4m-mo/tf-gpt

A TensorFlow implementation of GPT.

30
Emerging
34 xtreamsrl/build-your-own-gpt

Repository for the AMLD 2024 Workshop "At the cutting edge of Generative AI...

29
Experimental
35 nguyenphuminh/planckgpt

Train a GPT from scratch on your laptop

28
Experimental
36 ksupasate/GPTForAnything

This is a repository aimed at promoting open-source contributions for...

28
Experimental
37 gromdimon/beLLM

beLLM: GPT for belarusian language

28
Experimental
38 jbxamora/reversenanogpt

A minimal character-level language model using Transformer architecture in PyTorch

27
Experimental
39 xavierzheng/pxgpt

Phenotype eXplore GPT. Use multimodel LLM for structural plant phenotyping

26
Experimental
40 Laz4rz/GPT-2

Following Karpathy with GPT-2 implementation and training, writing lots of...

26
Experimental
41 mohd-faizy/GPT1-From-Scratch

This project implements GPT-1 using PyTorch, focusing on foundational...

25
Experimental
42 saforem2/wordplay

Playing with words

25
Experimental
43 ariannamethod/postgpt

GPT with metaweights: weights that don't actually exist

25
Experimental
44 jwchoi95/GPT_MLP

Official source codes for implementing "Accelerating materials language...

24
Experimental
45 antonio-f/GPT_from_scratch

Very simple implementation of GPT architecture using PyTorch and Jupyter.

24
Experimental
46 pguso/gpt-from-scratch

Implementation of a small GPT-style transformer from scratch in PyTorch....

24
Experimental
47 ashishsalunkhe/DickensSpeaks

Text Generation trained on the Short Stories of Charles Dickens using RNN,...

23
Experimental
48 abdullateefv/PeptideGPT

GPT powered plugin & fine tuned model for natural language interaction with...

23
Experimental
49 zhoucaiNi/poet-gpt-2

Generative LLM specifically trained to generate poems This LLM uses...

23
Experimental
50 betogaona7/gptpose

GPT pose image generator to condition SD models with ControlNet OpenPose

23
Experimental
51 aaditya29/GPT-2-124M-

Reproducing GPT-2 (124M) from Scratch in PyTorch

23
Experimental
52 fostiropoulos/ReGPT

Code for our work published at ICJAI 2023 Workshop on Knowledge-Based...

22
Experimental
53 DanielPuentee/gpt-from-zero

Create your own GPT model from scratch

22
Experimental
54 muhammad-fiaz/gpt

A simple implementation based on the "Attention is All You Need" paper,...

22
Experimental
55 mkashirin/cattode

Lil GPT and BPE built from scratch using PyTorch.

22
Experimental
56 Michaelgathara/GPT

FineWeb-EDU trained Billion+ Parameter Model

22
Experimental
57 joshyattridge/GPTkeras

Automates Keras model design using GPT - generates architectures, tunes...

22
Experimental
58 mcpeixoto/gpt

Implementation of a scalled down ChatGPT-like transformer pretraining using PyTorch

22
Experimental
59 LongpanZhou/Pat-GPT

This is a GPT model but brain rot rizzler... Implements standard GPT-2...

21
Experimental
60 amskit/in-naamgpt

Generate authentic-sounding Hindi names using a minimalist GPT built from...

21
Experimental
61 Uokoroafor/gpt_from_scratch

This is a PyTorch implementation of a smaller version of the GPT model

21
Experimental
62 kalvin807/sherlock

an attempt to generate code change from issue using LLM

20
Experimental
63 mytechnotalent/gpt_from_scratch

This notebook builds a complete GPT (Generative Pre-trained Transformer)...

20
Experimental
64 koayon/atp_star

PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)

19
Experimental
65 AlexGidiotis/gpt-light

The easiest repo for building GPT applications.

19
Experimental
66 HoangHao1009/hminiGPT

Pre-train GPT model by your txt

19
Experimental
67 Pacatro/gpoetry

A tiny GPT model to generate spanish poetry

18
Experimental
68 sszzz830/TensorFlow-in-GPT-4-advanced-data-analysis-mode

Install TensorFlow Lite on GPT-4 (advanced data analysis mode), or any other...

18
Experimental
69 purang2/GPT

OpenAI GPT, Generative Pre-Training

18
Experimental
70 sumo1/gpt-reproduction-SFT-RLHF

OpenAI...

18
Experimental
71 den1ksk/GPTClimat

GPT model for climate analysis

17
Experimental
72 billh0420/MathAssertGPT

Create a Generative Pretrained Transformer model for Metamath to generate...

17
Experimental
73 mjub/nlab-gpt

A small, custom GPT trained on nLab text, with an analysis of emergent...

17
Experimental
74 azimonti/gpt-playground

My playground for Generative Pre-trained Transformer (GPT) implementation

17
Experimental
75 B4S1C-Coder/GPT-2-from-scratch

GPT-2 Implementation using only PyTorch and Tiktoken

17
Experimental
76 tedoaba/GPT-from-scratch

GPT Model from scratch

17
Experimental
77 mytechnotalent/ToyGPT

ToyGPT, inspired by Andrej Karpathy’s GPT from scratch, creates a toy...

17
Experimental
78 UgurKap/gpt-implementation

This repository contains my personal implementation and experiments while...

17
Experimental
79 yogeshHax/Big_Dragon_Hatchling

BDH-Dragon: A custom dual-GPU Transformer model optimizing RoPE for...

15
Experimental
80 sigdelsanjog/code-llm

pip install gptgpt

14
Experimental
81 SCCSMARTCODE/gpt2-from-scratch

A fundamental implementation of the GPT-2 architecture from scratch,...

14
Experimental
82 derinworks/penr-oz-gpt-example

Implementation of an example GPT for understanding how next character is...

14
Experimental
83 krthr/poesia.pics-old

Convierte tus fotografías en poemas usando BLIP + GPT-4

14
Experimental
84 binoydipu/kobigpt

KobiGPT is a character-level GPT language model trained exclusively on the...

14
Experimental
85 seanwevans/pg_gpt2

gpt2 in postgres

14
Experimental
86 fraserlove/gpt-base

GPT-base is a basic decoder-only language model following the architecture...

14
Experimental
87 woodsj1206/Project-Gutenberg-GPT

This project trains a GPT model on English eBooks from Project Gutenberg.

14
Experimental
88 ravijo/TrumpGPT

A GPT model trained to mimic Donald Trump's style - just for fun

13
Experimental
89 BlairStanek/gpt-statutes

Probe how GPT-n performs on statutory reasoning

13
Experimental
90 alphatechlogics/FaseehGPT

FaseehGPT is an advanced pipeline for training a GPT-style language model...

13
Experimental
91 RisticDjordje/personalized-autocomplete-next-word-prediction-task

A personalized autocomplete (next word prediction) project using three...

13
Experimental
92 MrFishPL/gpt

I built this repo to prove to my granny that I can implement GPT.

13
Experimental
93 BlazeWild/GPT_FROM_SCRATCH

Minimal GPT implementation from scratch using PyTorch — trains a...

13
Experimental
94 onemriganka/GPT-0.5m

A 0.5 million parameter character-level Transformer model in PyTorch, base...

13
Experimental
95 suryanshgupta9933/Hindi-GPT

Hindi GPT is a transformer based language model trained on Hindi Oscar...

13
Experimental
96 KunkelAlexander/lets-play-with-gpts

Explore GPTs based on a series of Youtube videos.

13
Experimental
97 lewisnjue/gpt-2

gpt-2

13
Experimental
98 patrykniemczyk/gpt

A minimal from-scratch implementation of the GPT architecture with BPE...

13
Experimental
99 ravijo/GPT101

Getting started with GPT for language modeling

13
Experimental
100 LakshayD02/Natural_Language_Generation_AIModel_Python

An interactive application leveraging a pre-trained language model (GPT-2)...

12
Experimental
101 Manas02/ScaffoldGPT

Scaffold Generative Pretraining

12
Experimental
102 wansiqing1226/Younker_CISSP_GPT

A GPT model using fine-tuning techniques for the purpose of CISSP study.

12
Experimental
103 RISHIT7/GPT-From-Scratch

A repository that aims to model and train a GPT from scratch.

12
Experimental
104 zilaeric/othello-gpt-probing

Training and exploration of linear probes into Othello-GPT by Li et al. (2022)

12
Experimental
105 ES7/GPT-from-Scratch

In this repository, I have created the GPT architecture, provided the code...

12
Experimental
106 zmactep/pymol-gpt

Fast and dirty GPT injection into PyMOL

12
Experimental
107 hrithiksagar/Reproducing-GPT-2

Reproducing GPT-2 (124M) from scratch, following Mr. Karpathy's tutorial.

12
Experimental
108 abdussahid26/GPT-2-Model-from-Scratch-to-Generate-Text

Implementation of a GPT-2 model from scratch for text generation. This...

12
Experimental
109 iamNCJ/YuanGPT

GPT-like Large Language Model Pretrained on Inspur's Yuan Dataset

12
Experimental
110 LaurenceLungo/GPT-from-Scratch

PyTorch implementation of GPT from scratch

11
Experimental
111 satojkovic/gpt-tf-pytorch-jax

GPT from scratch (tensorflow / pytorch / jax)

11
Experimental
112 Mind-Interfaces/GPT-0

Generative Pre-trained Transformer

11
Experimental
113 willdphan/gpt-pdf

Using GPT to analyze PDFs

11
Experimental
114 Archistein/GPT-from-scratch

GPT: build and pre-training from scratch

11
Experimental
115 LuciAkirami/TRealestateGPT

A Custom GPT that can find the best places to live based on the user's queries

11
Experimental
116 huntergregory/gpt-country-facts

Educational UI: learn facts about countries using GPT

11
Experimental
117 ighoshsubho/QuestionAnswerGPT

Transformer based GPT finetuned on Standford Question Answer Dataset

11
Experimental
118 dirias/GPT-Privacy-Policies

This repo creates a small GPT system along with a web interface to generate...

11
Experimental
119 r-gould/gpt

An implementation of the GPT model in PyTorch.

11
Experimental
120 mateoservent/gpt-batching

Data preprocessing, processing, and collecting for batch jobs with OpenAI API

11
Experimental
121 MiguelCabreraVictoria/GPT

This project is an implementation of a GPT-based model trained on a dataset...

11
Experimental
122 rimo02/TwistGpt

GPT2 made from scratch and trained on a small text corpus

11
Experimental
123 auhide/beet-gpt

BeetGPT is a dialogue generation model trained on all 9 seasons of The Office.

11
Experimental
124 qbxlvnf11/GPT-series

Implementation of GPT models

11
Experimental
125 Jai0212/GPT-RNN-Poetry-Generator

A generative pre-trained transformer (GPT) using a recurrent neural network...

11
Experimental
126 gabrieltenerife/GptDesde0

Building a Gpt-type model

10
Experimental
127 nguyenhongson1902/gpt-from-scratch

I built a GPT model from scratch to generate text

10
Experimental
128 nickgnd/gpt-from-scratch-with-nx-and-axon

Implementation of "Let's build GPT: from scratch, in code, spelled out"...

10
Experimental
129 13X-Labs/13xlabs-gpt

GPT-2 is a natural language processing technology developed by OpenAI and...

10
Experimental
130 Dani1232312/gpt-api-evaluation

A data analasys application in a Jupyter Notebook that is able to determine...

10
Experimental
131 JayThibs/gpt-experiments

This repository contains various experiments and prototypes to get use to...

10
Experimental
132 vinhtran2611/KieuGPT

AI writes poetry

10
Experimental
133 xiahouzuoxin/zxlearn

Minimum implementation things (e.g. transformer, torchrec, chatgpt_webui...

10
Experimental
134 13X-Labs/gpt2-text-generation-xla

As the quality of large language models increases, so do our expectations of...

10
Experimental
135 inner-LMNt/Self-GPT

My own attempt at implementing Language Models like GPT from scratch.

10
Experimental

Comparisons in this category