All Transformer Models
7,795 models ranked by quality score · Page 35 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3401 |
ZifanL/TSDS
Implementation of TSDS: Data Selection for Task-Specific Model Finetuning.... |
|
Experimental |
| 3402 |
changwoolee/BLAST
[NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient... |
|
Experimental |
| 3403 |
kyegomez/MuonClip
This repository is an open source implementation of the MuonClip strategy... |
|
Experimental |
| 3404 |
AlexIoannides/llm-regression
Exploring the classical regression capabilities of LLMs. |
|
Experimental |
| 3405 |
m3hrdadfi/zabanshenas
Zabanshenas is a solution for identifying the most likely language of a... |
|
Experimental |
| 3406 |
kyegomez/MLXTransformer
Simple Implementation of a Transformer in the new framework MLX by Apple |
|
Experimental |
| 3407 |
ThaminduR/mt5-simplification
Scripts related to training and predicting Google's mt5 model |
|
Experimental |
| 3408 |
lizhaoliu-Lec/CG-VLM
This is the official repo for Contrastive Vision-Language Alignment Makes... |
|
Experimental |
| 3409 |
lechmazur/bazaar
The BAZAAR challenges LLMs to navigate the double-auction marketplace, where... |
|
Experimental |
| 3410 |
oooranz/Baby-CoThought
๐ผ Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models... |
|
Experimental |
| 3411 |
s-omranpour/Music-Generation
A toolkit for symbolic music generation in PyTorch (using transformers and rnn) |
|
Experimental |
| 3412 |
dimitreOliveira/hf_tf_serving_examples
Simple examples of serving HuggingFace models with TensorFlow Serving |
|
Experimental |
| 3413 |
kabachuha/nanoGPKANT
Testing KAN-based text generation GPT models |
|
Experimental |
| 3414 |
g1ibby/llm-deploy
Tool to manage ollama model on vast.ai |
|
Experimental |
| 3415 |
haormj/llama2.go
Inference Llama 2 in one file of pure go |
|
Experimental |
| 3416 |
fbaldassarri/llama-cpp-container
Docker image to deploy a llama-cpp container with conda-ready environments |
|
Experimental |
| 3417 |
KarthikSriramGit/H.E.I.M.D.A.L.L
H.E.I.M.D.A.L.L looks at fleet telemetry and gives you natural-language... |
|
Experimental |
| 3418 |
modelize-ai/LLM-Inference-Deployment-Tutorial
Tutorial for LLM developers about engine design, service deployment,... |
|
Experimental |
| 3419 |
Bhoomika2224/MinivLLM
๐ Implement a powerful vLLM inference engine with advanced attention... |
|
Experimental |
| 3420 |
GhTara/Dose_Prediction
A Cascade Transformer-based Model for 3D Dose Distribution Prediction in... |
|
Experimental |
| 3421 |
noah-hein/mazeGPT
AI model for making mazes that extends OpenAIs GPT2 model |
|
Experimental |
| 3422 |
wowsinfo/Convert-Migrate-LLM
Convert & Migrate from one technology to another ones using any LLM |
|
Experimental |
| 3423 |
OPTML-Group/Unlearn-Trace
Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs |
|
Experimental |
| 3424 |
gao-g/prelude
Code for the paper "Aligning LLM Agents by Learning Latent Preference from... |
|
Experimental |
| 3425 |
TRISTAN-ORF/RiboTIE
Scripts and instructions to apply RiboTIE on Ribo-seq data |
|
Experimental |
| 3426 |
blazejdolicki/bert-sarcasm-detection
Sarcasm detection with BERT |
|
Experimental |
| 3427 |
SlytherinGe/RSTeller
Vision-Language Dataset for Remote Sensing |
|
Experimental |
| 3428 |
kozodoi/Text_Readability_Prediction
Predicting text reading complexity with transformers (top-9% Kaggle solution... |
|
Experimental |
| 3429 |
The-Martyr/Awesome-Modality-Priors-in-MLLMs
Latest Advances on Modality Priors in Multimodal Large Language Models |
|
Experimental |
| 3430 |
mohsenMahmoodzadeh/image-and-text-classifier
Deep learning models(CNN, LSTM, BERT) for image and text classification task... |
|
Experimental |
| 3431 |
sagorbrur/fillblank
Fill The Blank |
|
Experimental |
| 3432 |
Meaquadddd/DPO-Shift
DPO-Shift: Shifting the Distribution of Direct Preference Optimization |
|
Experimental |
| 3433 |
rivas-lab/Smiles2Dock
Smiles2Dock: an open large-scale multi-task dataset for ML-based molecular... |
|
Experimental |
| 3434 |
shubhamkaushal765/TransformerQEC
Utilizing Transformers to correct errors in quantum circuits. |
|
Experimental |
| 3435 |
haozheji/exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact... |
|
Experimental |
| 3436 |
leonjovanovic/keywords-extraction
Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like... |
|
Experimental |
| 3437 |
lpalbou/model-quantizer
Effortlessly quantize, benchmark, and publish Hugging Face models with... |
|
Experimental |
| 3438 |
Human-Centric-Machine-Learning/strategic-ttc
Code for "Test-Time Compute Games", 2026 |
|
Experimental |
| 3439 |
leuas/Vrdndi
A full-stack context-aware productivity-focused recommendation system |
|
Experimental |
| 3440 |
bayeslabs/maslibpy
MASLibPy : Lightweight library for multi-agent systems with LLM integration... |
|
Experimental |
| 3441 |
nikhil6041/OLI-and-Meme-Classification
Author's implementation of the paper... |
|
Experimental |
| 3442 |
LeonEricsson/llmcontext
:anger: Pressure testing the context window of open LLMs |
|
Experimental |
| 3443 |
xiuqhou/DAPE
[AAAI2026] Official implementation of the paper "DAPE: Harmonizing... |
|
Experimental |
| 3444 |
somosnlp/the-annotated-transformer
Traducciรณn al espaรฑol del notebook "The Annotated Transformer" de Harvard... |
|
Experimental |
| 3445 |
jlamprou/Infini-Attention
Efficient Infinite Context Transformers with Infini-attention Pytorch... |
|
Experimental |
| 3446 |
YangLing0818/SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought... |
|
Experimental |
| 3447 |
Praful932/llmsearch
Find better generation parameters for your LLM |
|
Experimental |
| 3448 |
MTxSouza/MediumArticleGenerator
A Language Model (LLM) trained to generate text similar to Medium articles. |
|
Experimental |
| 3449 |
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism |
|
Experimental |
| 3450 |
joeljang/continual-knowledge-learning
[ICLR 2022] Towards Continual Knowledge Learning of Language Models |
|
Experimental |
| 3451 |
LFhase/CausalCOAT
[NeurIPS 2024] Discovery of the Hidden World with Large Language Models |
|
Experimental |
| 3452 |
MIMICLab/L-Verse
L-Verse: Bidirectional Generation Between Image and Text |
|
Experimental |
| 3453 |
kyegomez/Chai-1
An free and open source community implementation of Chai-1 in PyTorch |
|
Experimental |
| 3454 |
Marker-Inc-Korea/KO-Platypus
[KO-Platy๐ฅฎ] Korean-Open-platypus๋ฅผ ํ์ฉํ์ฌ llama-2-ko๋ฅผ fine-tuningํ KO-platypus model |
|
Experimental |
| 3455 |
yassenayoub/NEO
๐ Explore NEO, a groundbreaking native vision-language model designed to... |
|
Experimental |
| 3456 |
JuliusScheuerer/nlp-job-classifier
Text classification with fine-tuned DistilBERT โ FastAPI + Streamlit |
|
Experimental |
| 3457 |
MitulNakrani003/AI-Enhanced-IR-System
AI-enhanced search pipeline using hybrid retrieval + transformer models for... |
|
Experimental |
| 3458 |
ndoll1998/active-transformers
Active Learning for Transformer with focus on Sequence Tagging tasks |
|
Experimental |
| 3459 |
MysterionRise/transformers-nlp-suite
Enterprise NLP Platform - Production REST API with auth, rate limiting,... |
|
Experimental |
| 3460 |
algunion/UniLM.jl
UniLM.jl: Currently a Julia interface for OpenAI's (+Azure) language models,... |
|
Experimental |
| 3461 |
ArchitJ6/Llama2-FineTuning
๐ฆ Llama2-FineTuning: Fine-tune LLAMA 2 with Custom Datasets Using LoRA and... |
|
Experimental |
| 3462 |
Onco-Logic/Onco-Logic
Onco-Logic is a comprehensive, multi-modal decision support ecosystem... |
|
Experimental |
| 3463 |
Kuldeepmorya/LLM-TradeBot
๐ค Optimize your futures trading with LLM-TradeBot, an intelligent... |
|
Experimental |
| 3464 |
StarLight1212/LLM-and-Generative-Models-Community
AI Community Tutorial, including: LoRA/Qlora LLM fine-tuning, Training GPT-2... |
|
Experimental |
| 3465 |
RobinSmits/Dutch-LLMs
Various training, inference and validation code and results related to Open... |
|
Experimental |
| 3466 |
sammcj/llm-templates
My LLM Templates (Ollama Modelfiles & Tabby Templates + Presets) |
|
Experimental |
| 3467 |
csm9493/efficient-llm-unlearning
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs (ICLR 2025) |
|
Experimental |
| 3468 |
Peiyang-Song/LLM-A-Not-B-Errors
Official repository for paper "In-Context Learning May Not Elicit... |
|
Experimental |
| 3469 |
twitter-research/lmsoc
Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining |
|
Experimental |
| 3470 |
hitz-zentroa/This-is-not-a-Dataset
We introduce a large semi-automatically generated dataset of ~400,000... |
|
Experimental |
| 3471 |
januverma/transformers-stuff
Codes, scripts, and notebooks on various aspects of transformer models. |
|
Experimental |
| 3472 |
thevasudevgupta/transformers-adapters
This repositary hosts my experiments for the project, I did with OffNote Labs. |
|
Experimental |
| 3473 |
hmohebbi/ValueZeroing
The official repo for the EACL 2023 paper "Quantifying Context Mixing in... |
|
Experimental |
| 3474 |
merekat/children-stories
OhanashiGPT is an application that generates personalized children's stories... |
|
Experimental |
| 3475 |
eigencore/Tlama_124M
Tlama (124M) is a language model based on LlaMa3 (127M) optimized by... |
|
Experimental |
| 3476 |
LefterisKyriazanos/market_research_assistant
An AI-based tool that automates market research survey generation and... |
|
Experimental |
| 3477 |
Lahdhirim/NLP-financial-question-answering-tool
Fine-tuning a text-to-text transformer model (T5) on a financial question... |
|
Experimental |
| 3478 |
Navy10021/KRLawGPT
KRLawGPT : Generative Pre-trained Transformer for producing Korean Legal Text |
|
Experimental |
| 3479 |
kamyarghajar/DistilledNeuralResponseRanker
Implementation of "Distilling Knowledge for Fast Retrieval-based Chat-bots"... |
|
Experimental |
| 3480 |
jkanalakis/finetuning-llama-model-for-text-generation-using-unsloth
Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI |
|
Experimental |
| 3481 |
wearesulie/sulie
Access to Sulie foundation models for time-series forecasting ๐ |
|
Experimental |
| 3482 |
kyegomez/MultiModalCrossAttn
The open source implementation of the cross attention mechanism from the... |
|
Experimental |
| 3483 |
vlddshk/Transformer_translator
This project implements a neural machine translation system from French to... |
|
Experimental |
| 3484 |
pangatlo/RL-100
๐ค Implement advanced robotic manipulation techniques using real-world... |
|
Experimental |
| 3485 |
fatemehpesaran310/Text2Chart31
Official PyTorch implementation of "Text2Chart31: Instruction Tuning for... |
|
Experimental |
| 3486 |
kyegomez/AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo:... |
|
Experimental |
| 3487 |
Trustworthy-ML-Lab/Describe-and-Dissect
[TMLR 25] An automated method for explaining complex neuron behaviors in... |
|
Experimental |
| 3488 |
vishvaRam/Fine-Tuning-Siglip2-Vit-Model
This repository offers tools and guidance for fine-tuning the Siglip2 Vision... |
|
Experimental |
| 3489 |
AndreaCossu/continual-pretraining-nlp-vision
Code to reproduce experiments from the paper "Continual Pre-Training... |
|
Experimental |
| 3490 |
Jagoul/BLEND
This repository contains the official implementation of BLEND, a novel... |
|
Experimental |
| 3491 |
Lucien2468/Ollama-TurboQuant-Integration
TurboQuant: Native 3-Bit Quantization for Ollama - Achieve 25-28% better... |
|
Experimental |
| 3492 |
csiro-robotics/FactoFormer
[IEEE T-GRS 2024] The official repository for Journal Article โFactoFormer:... |
|
Experimental |
| 3493 |
ivallesp/cFavorita
A project for solving demand forecast of a medium retailer using a simple... |
|
Experimental |
| 3494 |
SuchetSanjeev/EncryptedTrafficAttackClassifierLLMs
This cybersecurity classifier integrates a lightweight LLM with a Random... |
|
Experimental |
| 3495 |
zjunlp/Knowledge2Data
[TASLP 2025] Spatial Knowledge Graph-Guided Synthesis for Multimodal LLMs |
|
Experimental |
| 3496 |
rookiemann/vllm-windows-build
Native Windows build patches for vLLM v0.14.1 โ MSVC 2022 + CUDA 12.6, 26... |
|
Experimental |
| 3497 |
sauradip/fewshotQAT
[BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action... |
|
Experimental |
| 3498 |
TingjiaInFuture/pixrep
Let LLMs see your codebase just like you do. |
|
Experimental |
| 3499 |
hrithickcodes/transformer-tf
This repository contains the code for the paper "Attention Is All You Need"... |
|
Experimental |
| 3500 |
LinkScapeOfficial/Ollmao
Ollmao (OH-luh-MAO) is a native SwiftUI app that integrates with Ollama to... |
|
Experimental |