All Transformer Models

7,795 models ranked by quality score · Page 35 of 78

Showing 3401–3500 of 7,795
# Model Score Tier
3401 ZifanL/TSDS

Implementation of TSDS: Data Selection for Task-Specific Model Finetuning....

27
Experimental
3402 changwoolee/BLAST

[NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient...

27
Experimental
3403 kyegomez/MuonClip

This repository is an open source implementation of the MuonClip strategy...

27
Experimental
3404 AlexIoannides/llm-regression

Exploring the classical regression capabilities of LLMs.

27
Experimental
3405 m3hrdadfi/zabanshenas

Zabanshenas is a solution for identifying the most likely language of a...

27
Experimental
3406 kyegomez/MLXTransformer

Simple Implementation of a Transformer in the new framework MLX by Apple

27
Experimental
3407 ThaminduR/mt5-simplification

Scripts related to training and predicting Google's mt5 model

27
Experimental
3408 lizhaoliu-Lec/CG-VLM

This is the official repo for Contrastive Vision-Language Alignment Makes...

27
Experimental
3409 lechmazur/bazaar

The BAZAAR challenges LLMs to navigate the double-auction marketplace, where...

27
Experimental
3410 oooranz/Baby-CoThought

๐Ÿผ Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models...

27
Experimental
3411 s-omranpour/Music-Generation

A toolkit for symbolic music generation in PyTorch (using transformers and rnn)

27
Experimental
3412 dimitreOliveira/hf_tf_serving_examples

Simple examples of serving HuggingFace models with TensorFlow Serving

27
Experimental
3413 kabachuha/nanoGPKANT

Testing KAN-based text generation GPT models

27
Experimental
3414 g1ibby/llm-deploy

Tool to manage ollama model on vast.ai

27
Experimental
3415 haormj/llama2.go

Inference Llama 2 in one file of pure go

27
Experimental
3416 fbaldassarri/llama-cpp-container

Docker image to deploy a llama-cpp container with conda-ready environments

27
Experimental
3417 KarthikSriramGit/H.E.I.M.D.A.L.L

H.E.I.M.D.A.L.L looks at fleet telemetry and gives you natural-language...

27
Experimental
3418 modelize-ai/LLM-Inference-Deployment-Tutorial

Tutorial for LLM developers about engine design, service deployment,...

27
Experimental
3419 Bhoomika2224/MinivLLM

๐Ÿš€ Implement a powerful vLLM inference engine with advanced attention...

27
Experimental
3420 GhTara/Dose_Prediction

A Cascade Transformer-based Model for 3D Dose Distribution Prediction in...

27
Experimental
3421 noah-hein/mazeGPT

AI model for making mazes that extends OpenAIs GPT2 model

27
Experimental
3422 wowsinfo/Convert-Migrate-LLM

Convert & Migrate from one technology to another ones using any LLM

27
Experimental
3423 OPTML-Group/Unlearn-Trace

Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs

27
Experimental
3424 gao-g/prelude

Code for the paper "Aligning LLM Agents by Learning Latent Preference from...

27
Experimental
3425 TRISTAN-ORF/RiboTIE

Scripts and instructions to apply RiboTIE on Ribo-seq data

27
Experimental
3426 blazejdolicki/bert-sarcasm-detection

Sarcasm detection with BERT

27
Experimental
3427 SlytherinGe/RSTeller

Vision-Language Dataset for Remote Sensing

27
Experimental
3428 kozodoi/Text_Readability_Prediction

Predicting text reading complexity with transformers (top-9% Kaggle solution...

27
Experimental
3429 The-Martyr/Awesome-Modality-Priors-in-MLLMs

Latest Advances on Modality Priors in Multimodal Large Language Models

27
Experimental
3430 mohsenMahmoodzadeh/image-and-text-classifier

Deep learning models(CNN, LSTM, BERT) for image and text classification task...

27
Experimental
3431 sagorbrur/fillblank

Fill The Blank

27
Experimental
3432 Meaquadddd/DPO-Shift

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

27
Experimental
3433 rivas-lab/Smiles2Dock

Smiles2Dock: an open large-scale multi-task dataset for ML-based molecular...

27
Experimental
3434 shubhamkaushal765/TransformerQEC

Utilizing Transformers to correct errors in quantum circuits.

27
Experimental
3435 haozheji/exact-optimization

ICML 2024 - Official Repository for EXO: Towards Efficient Exact...

27
Experimental
3436 leonjovanovic/keywords-extraction

Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like...

27
Experimental
3437 lpalbou/model-quantizer

Effortlessly quantize, benchmark, and publish Hugging Face models with...

27
Experimental
3438 Human-Centric-Machine-Learning/strategic-ttc

Code for "Test-Time Compute Games", 2026

27
Experimental
3439 leuas/Vrdndi

A full-stack context-aware productivity-focused recommendation system

27
Experimental
3440 bayeslabs/maslibpy

MASLibPy : Lightweight library for multi-agent systems with LLM integration...

27
Experimental
3441 nikhil6041/OLI-and-Meme-Classification

Author's implementation of the paper...

27
Experimental
3442 LeonEricsson/llmcontext

:anger: Pressure testing the context window of open LLMs

27
Experimental
3443 xiuqhou/DAPE

[AAAI2026] Official implementation of the paper "DAPE: Harmonizing...

27
Experimental
3444 somosnlp/the-annotated-transformer

Traducciรณn al espaรฑol del notebook "The Annotated Transformer" de Harvard...

27
Experimental
3445 jlamprou/Infini-Attention

Efficient Infinite Context Transformers with Infini-attention Pytorch...

27
Experimental
3446 YangLing0818/SuperCorrect-llm

[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought...

27
Experimental
3447 Praful932/llmsearch

Find better generation parameters for your LLM

27
Experimental
3448 MTxSouza/MediumArticleGenerator

A Language Model (LLM) trained to generate text similar to Medium articles.

27
Experimental
3449 feifeibear/Odysseus-Transformer

Odysseus: Playground of LLM Sequence Parallelism

27
Experimental
3450 joeljang/continual-knowledge-learning

[ICLR 2022] Towards Continual Knowledge Learning of Language Models

27
Experimental
3451 LFhase/CausalCOAT

[NeurIPS 2024] Discovery of the Hidden World with Large Language Models

27
Experimental
3452 MIMICLab/L-Verse

L-Verse: Bidirectional Generation Between Image and Text

27
Experimental
3453 kyegomez/Chai-1

An free and open source community implementation of Chai-1 in PyTorch

27
Experimental
3454 Marker-Inc-Korea/KO-Platypus

[KO-Platy๐Ÿฅฎ] Korean-Open-platypus๋ฅผ ํ™œ์šฉํ•˜์—ฌ llama-2-ko๋ฅผ fine-tuningํ•œ KO-platypus model

27
Experimental
3455 yassenayoub/NEO

๐Ÿ” Explore NEO, a groundbreaking native vision-language model designed to...

27
Experimental
3456 JuliusScheuerer/nlp-job-classifier

Text classification with fine-tuned DistilBERT โ€” FastAPI + Streamlit

27
Experimental
3457 MitulNakrani003/AI-Enhanced-IR-System

AI-enhanced search pipeline using hybrid retrieval + transformer models for...

27
Experimental
3458 ndoll1998/active-transformers

Active Learning for Transformer with focus on Sequence Tagging tasks

27
Experimental
3459 MysterionRise/transformers-nlp-suite

Enterprise NLP Platform - Production REST API with auth, rate limiting,...

27
Experimental
3460 algunion/UniLM.jl

UniLM.jl: Currently a Julia interface for OpenAI's (+Azure) language models,...

27
Experimental
3461 ArchitJ6/Llama2-FineTuning

๐Ÿฆ™ Llama2-FineTuning: Fine-tune LLAMA 2 with Custom Datasets Using LoRA and...

27
Experimental
3462 Onco-Logic/Onco-Logic

Onco-Logic is a comprehensive, multi-modal decision support ecosystem...

27
Experimental
3463 Kuldeepmorya/LLM-TradeBot

๐Ÿค– Optimize your futures trading with LLM-TradeBot, an intelligent...

27
Experimental
3464 StarLight1212/LLM-and-Generative-Models-Community

AI Community Tutorial, including: LoRA/Qlora LLM fine-tuning, Training GPT-2...

27
Experimental
3465 RobinSmits/Dutch-LLMs

Various training, inference and validation code and results related to Open...

27
Experimental
3466 sammcj/llm-templates

My LLM Templates (Ollama Modelfiles & Tabby Templates + Presets)

27
Experimental
3467 csm9493/efficient-llm-unlearning

Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs (ICLR 2025)

27
Experimental
3468 Peiyang-Song/LLM-A-Not-B-Errors

Official repository for paper "In-Context Learning May Not Elicit...

27
Experimental
3469 twitter-research/lmsoc

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

27
Experimental
3470 hitz-zentroa/This-is-not-a-Dataset

We introduce a large semi-automatically generated dataset of ~400,000...

27
Experimental
3471 januverma/transformers-stuff

Codes, scripts, and notebooks on various aspects of transformer models.

27
Experimental
3472 thevasudevgupta/transformers-adapters

This repositary hosts my experiments for the project, I did with OffNote Labs.

27
Experimental
3473 hmohebbi/ValueZeroing

The official repo for the EACL 2023 paper "Quantifying Context Mixing in...

27
Experimental
3474 merekat/children-stories

OhanashiGPT is an application that generates personalized children's stories...

27
Experimental
3475 eigencore/Tlama_124M

Tlama (124M) is a language model based on LlaMa3 (127M) optimized by...

27
Experimental
3476 LefterisKyriazanos/market_research_assistant

An AI-based tool that automates market research survey generation and...

26
Experimental
3477 Lahdhirim/NLP-financial-question-answering-tool

Fine-tuning a text-to-text transformer model (T5) on a financial question...

26
Experimental
3478 Navy10021/KRLawGPT

KRLawGPT : Generative Pre-trained Transformer for producing Korean Legal Text

26
Experimental
3479 kamyarghajar/DistilledNeuralResponseRanker

Implementation of "Distilling Knowledge for Fast Retrieval-based Chat-bots"...

26
Experimental
3480 jkanalakis/finetuning-llama-model-for-text-generation-using-unsloth

Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI

26
Experimental
3481 wearesulie/sulie

Access to Sulie foundation models for time-series forecasting ๐Ÿ“ˆ

26
Experimental
3482 kyegomez/MultiModalCrossAttn

The open source implementation of the cross attention mechanism from the...

26
Experimental
3483 vlddshk/Transformer_translator

This project implements a neural machine translation system from French to...

26
Experimental
3484 pangatlo/RL-100

๐Ÿค– Implement advanced robotic manipulation techniques using real-world...

26
Experimental
3485 fatemehpesaran310/Text2Chart31

Official PyTorch implementation of "Text2Chart31: Instruction Tuning for...

26
Experimental
3486 kyegomez/AudioFlamingo

Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo:...

26
Experimental
3487 Trustworthy-ML-Lab/Describe-and-Dissect

[TMLR 25] An automated method for explaining complex neuron behaviors in...

26
Experimental
3488 vishvaRam/Fine-Tuning-Siglip2-Vit-Model

This repository offers tools and guidance for fine-tuning the Siglip2 Vision...

26
Experimental
3489 AndreaCossu/continual-pretraining-nlp-vision

Code to reproduce experiments from the paper "Continual Pre-Training...

26
Experimental
3490 Jagoul/BLEND

This repository contains the official implementation of BLEND, a novel...

26
Experimental
3491 Lucien2468/Ollama-TurboQuant-Integration

TurboQuant: Native 3-Bit Quantization for Ollama - Achieve 25-28% better...

26
Experimental
3492 csiro-robotics/FactoFormer

[IEEE T-GRS 2024] The official repository for Journal Article โ€œFactoFormer:...

26
Experimental
3493 ivallesp/cFavorita

A project for solving demand forecast of a medium retailer using a simple...

26
Experimental
3494 SuchetSanjeev/EncryptedTrafficAttackClassifierLLMs

This cybersecurity classifier integrates a lightweight LLM with a Random...

26
Experimental
3495 zjunlp/Knowledge2Data

[TASLP 2025] Spatial Knowledge Graph-Guided Synthesis for Multimodal LLMs

26
Experimental
3496 rookiemann/vllm-windows-build

Native Windows build patches for vLLM v0.14.1 โ€” MSVC 2022 + CUDA 12.6, 26...

26
Experimental
3497 sauradip/fewshotQAT

[BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action...

26
Experimental
3498 TingjiaInFuture/pixrep

Let LLMs see your codebase just like you do.

26
Experimental
3499 hrithickcodes/transformer-tf

This repository contains the code for the paper "Attention Is All You Need"...

26
Experimental
3500 LinkScapeOfficial/Ollmao

Ollmao (OH-luh-MAO) is a native SwiftUI app that integrates with Ollama to...

26
Experimental
« Prev 1 2 3 33 34 35 36 37 76 77 78 Next »