All Transformer Models

7,795 models ranked by quality score · Page 23 of 78

Showing 2201–2300 of 7,795
# Model Score Tier
2201 michaelnny/QLoRA-LLM

A simple custom QLoRA implementation for fine-tuning a language model (LLM)...

34
Emerging
2202 johndpope/OmniTransfer-hack

OmniTransfer implementation for LTX-2 (work in progress)

34
Emerging
2203 liaoyuhua/LLM4TS

Large Language & Foundation Models for Time Series.

34
Emerging
2204 steinbergmedia/libmusictok

C++ Library for tokenizing MIDI files, designed to be compatible with the...

34
Emerging
2205 OneInterface/realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

34
Emerging
2206 zerovl/ZeroVL

[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources

34
Emerging
2207 Adversing/hf-model-checker

A tool to analyze HuggingFace models and determine their compatibility with...

34
Emerging
2208 jaygala24/fed-hate-speech

The official code repository for the paper titled "A Federated Approach for...

34
Emerging
2209 nanowell/Differential-Transformer-PyTorch

PyTorch implementation of the Differential-Transformer architecture for...

34
Emerging
2210 RLHFlow/Online-RLHF

A recipe for online RLHF and online iterative DPO.

34
Emerging
2211 google/curie

Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long...

34
Emerging
2212 Sunona-AI-labs/sunona

Sunona: Next-generation voice AI infrastructure. Orchestrate intelligent,...

34
Emerging
2213 Kaleidophon/nlp-uncertainty-zoo

Model zoo for different kinds of uncertainty quantification methods used in...

34
Emerging
2214 CEC-Agent/CEC

Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for...

34
Emerging
2215 moritztng/fltr

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

34
Emerging
2216 mcp-tool-shop-org/backpropagate

Headless LLM fine-tuning in 3 lines — smart defaults, VRAM-aware batch...

34
Emerging
2217 kkahatapitiya/LangRepo

Code for our ACL 2025 paper "Language Repository for Long Video Understanding"

34
Emerging
2218 suyash/mlt

Multilingual Neural Machine Translation using Transformers with Conditional...

34
Emerging
2219 hesamsheikh/llm-mechanics

Coding an LLM and its building blocks from scratch.

34
Emerging
2220 florist-notes/aicore_n

Artificial Intelligence > Machine Learning > Deep Learning

34
Emerging
2221 PKU-Alignment/beavertails

BeaverTails is a collection of datasets designed to facilitate research on...

34
Emerging
2222 starmpcc/CAMEL

Clinically Adapted Model Enhanced from LLaMA

34
Emerging
2223 Hamtech-ai/Persian-Image-Captioning

A Persian Image Captioning model based on Vision Encoder Decoder Models of...

34
Emerging
2224 18907305772/Explore-Instruct

EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage...

34
Emerging
2225 hpdps-group/ElasticMM

ElasticMM: Elastic and Efficient MLLM Serving System

34
Emerging
2226 JessicaLopezEspejel/HazPi

HazPi is a modified Transformer(Vaswani et al., 2017) neural network...

34
Emerging
2227 GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning

Llama 3 ORPO Fine Tuning on A100 in Colab Pro.

34
Emerging
2228 holarissun/RewardModelingBeyondBradleyTerry

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models...

34
Emerging
2229 egaoharu-kensei/flash-attention-triton

Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with...

34
Emerging
2230 nestordemeure/stop_word

Huggingface transformers stopping criteria that halts the generation when a...

34
Emerging
2231 deep-div/PlotLLM

Data Visualization with LLM automatically analyzes data and generates...

34
Emerging
2232 StargazerX0/ScaleKV

[NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with...

34
Emerging
2233 DunnBC22/Vision_Audio_and_Multimodal_Projects

This repository includes all computer vision, audio, document AI, and...

34
Emerging
2234 Beomi/BitNet-Transformers

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of...

34
Emerging
2235 hhy-huang/GraphJudge

[EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good...

34
Emerging
2236 asigalov61/Giant-Music-Transformer

[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with...

34
Emerging
2237 CristiVlad25/ai-papers

Tracing the evolution of AI and large language models from early neural...

34
Emerging
2238 wang2226/Awesome-LLM-Decoding

📜 Paper list on decoding methods for LLMs and LVLMs

34
Emerging
2239 fboulnois/llm-leaderboard-csv

CSVs of the Huggingface and LMArena LLM leaderboards, along with the code to...

34
Emerging
2240 Gen-Verse/ReasonFlux

[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux,...

34
Emerging
2241 llmapi-io/llmapi-cli

Command-line client and python development library for accessing LLM's...

34
Emerging
2242 SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with...

34
Emerging
2243 bayartsogt-ya/albert-mongolian

ALBERT trained on Mongolian text corpus

34
Emerging
2244 kingabzpro/French-to-Fongbe-and-Ewe-MT

The objective of this challenge is to create a machine translation system...

34
Emerging
2245 tugot17/Discord-Language-Detection-Bot

Restrict the use of forbidden languages on your discord server with transformers

34
Emerging
2246 VITA-Group/Ms-PoE

"Found in the Middle: How Language Models Use Long Contexts Better via...

34
Emerging
2247 CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths

The open-source Mixture of Depths code and the official implementation of...

34
Emerging
2248 bobazooba/xllm-demo

Demo project using XLLM

34
Emerging
2249 DAMO-NLP-SG/multilingual-safety-for-LLMs

[ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"

34
Emerging
2250 asahi417/lm-vocab-trimmer

Vocabulary Trimming (VT) is a model compression technique, which reduces a...

33
Emerging
2251 Scicrop/llm-vision-basics

Educational notebooks that demystify Large Language Models and Computer...

33
Emerging
2252 SuperBianC/scMulan

Repository for paper scMulan: a multitask generative pre-trained language...

33
Emerging
2253 JoelDeonDsouza/Zenpool_LLM

Zenpool is a compact, fine-tuned MLL (Mini Language Learner) model

33
Emerging
2254 lpalbou/AbstractLLM

A unified interface for Large Language Models with memory, reasoning, and...

33
Emerging
2255 rafalposwiata/depression-detection-lt-edi-2022

This repository contains the code of our winning solution for the Shared...

33
Emerging
2256 deepmancer/advanced-recommender-system

Advance information retrieval system that combines advanced indexing,...

33
Emerging
2257 AchiraNadeeshan/social-activity-job-matcher

PathFinder is a job recommendation web application that allows users to...

33
Emerging
2258 GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs

Multi GPU Fine Training LLMs using DeepSpeed and Accelerate.

33
Emerging
2259 YuanGongND/ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language...

33
Emerging
2260 daskol/llama.py

Python bindings to llama.cpp

33
Emerging
2261 adithya-s-k/CompanionLLM

CompanionLLM - A framework to finetune LLMs to be your own sentient...

33
Emerging
2262 microsoft/MMLU-CF

A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]

33
Emerging
2263 maciekt07/Lecture-Note-Generator-POC

📒 A proof-of-concept app that transcribes lecture recordings into text and...

33
Emerging
2264 CodingPlatelets/transformer_MM

Accelerator for LLM Based on Chisel3

33
Emerging
2265 davzoku/cria

An end-to-end LLM app prototype based on Llama 2

33
Emerging
2266 hasanisaeed/C-Transformer

Implementation of the core Transformer architecture in pure C

33
Emerging
2267 IIT-DM/BattleofLLMs

Benchmarks of LLMs with Conversational QA datasets.

33
Emerging
2268 SachinKalsi/annotated-research-papers

This repository is a comprehensive collection of research papers,...

33
Emerging
2269 isaacus-dev/emubert-creator

The training code behind EmuBert, the largest open-source masked language...

33
Emerging
2270 JonnoB/training_lms_with_synthetic_data

A repo for training Language models to correct errors in OCR text

33
Emerging
2271 zatevakhin/obsidian-local-llm

Obsidian Local LLM is a plugin for Obsidian that provides access to a...

33
Emerging
2272 GiorgiaAuroraAdorni/gansformer-reproducibility-challenge

Replication of the novel Generative Adversarial Transformer.

33
Emerging
2273 SertraFurr/DuckDuckAI

Python API Wrapper to interact with DuckDuckAI

33
Emerging
2274 XavierSpycy/hands-on-lora

Explore practical fine-tuning of LLMs with Hands-on Lora. Dive into examples...

33
Emerging
2275 krishnapriya-18/COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers

Rank 1 / 216

33
Emerging
2276 martin-wey/CodeUltraFeedback

CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)

33
Emerging
2277 HKUDS/RecLM

[ACL2025] "RecLM: Recommendation Instruction Tuning"

33
Emerging
2278 ISNE11/CheatSheet-LLM

Run local Large Language Models (LLMs) offline using Ollama – interact with...

33
Emerging
2279 Srijan-D/LangChain-v0.2-HuggingFace-Llama3

This project integrates LangChain v0.2.6, HuggingFace Serverless Inference...

33
Emerging
2280 zake7749/Kyara

[Kaggle-2nd] Lightweight yet Effective Chinese LLM.

33
Emerging
2281 NotYuSheng/Multimodal-Large-Language-Model

Localized Multimodal Large Language Model (MLLM) integrated with Streamlit...

33
Emerging
2282 wangcongcong123/transection

Transection: Transformers for English to Chinese Translation

33
Emerging
2283 yingding/applyllm

A python package for applying LLM with LangChain and Hugging Face on local...

33
Emerging
2284 DoubleVII/lithft

Pretrain, finetune any LLMs from huggingface on your own data.

33
Emerging
2285 micahondiwa/applied-ai

Deep Learning for Computer Vision: A collection of 6 end-to-end applied AI...

33
Emerging
2286 caua1503/llm-tool-fusion

llm-tool-fusion é uma biblioteca Python que unifica e simplifica o uso de...

33
Emerging
2287 TheAnkurGoswami/Neural-Networks-from-Scratch

Implementation of different neural networks with back-propagation logic.

33
Emerging
2288 rabiloo/llm-finetuning

Sample for Fine-Tuning LLMs & VLMs

33
Emerging
2289 gabe00122/jaxrl

Partially Observable Multi-Agent RL with Transformers

33
Emerging
2290 lennartpollvogt/ollama-instructor

Python library for the instruction and reliable validation of structured...

33
Emerging
2291 black-roland/homeassistant-cloud-ru-ai

Cloud.ru Foundation Models — cloud-based AI assistants for Home Assistant

33
Emerging
2292 KRR-Oxford/LLMap-Prelim

A preliminary investigation for ontology alignment (OM) with large language...

33
Emerging
2293 levashi/reprobe

Phase-aware LLM activation steering and linear probing. A memory-efficient,...

33
Emerging
2294 gunnarnordqvist/opencode-context-filter

Transparent HTTP proxy that automatically filters repository context for...

33
Emerging
2295 yonahgraphics/openevalkit

Production-grade Python framework for evaluating LLM and agentic systems...

33
Emerging
2296 Naman-ntc/FastCode

Utilities for efficient fine-tuning, inference and evaluation of code...

33
Emerging
2297 dhpollack/huggingface_libtorch

Minimal example of using a traced huggingface transformers model with libtorch

33
Emerging
2298 sajjjadayobi/ParsBigBird

Persian Bert For Long-Range Sequences

33
Emerging
2299 shizhouxing/Robustness-Verification-for-Transformers

[ICLR 2020] Code for paper "Robustness Verification for Transformers"

33
Emerging
2300 aniass/Spam-detection

Spam detection in SMS messages with BERT model and Machine Learning algorithms

33
Emerging
« Prev 1 2 3 21 22 23 24 25 76 77 78 Next »