All Transformer Models

7,795 models ranked by quality score · Page 24 of 78

Showing 2301–2400 of 7,795
# Model Score Tier
2301 guoriyue/LangCommand

LangCommand is a local inference command-line tool that transforms natural...

33
Emerging
2302 AlgonetLabs/Cable

Context-aware Biases for Length Extrapolation

33
Emerging
2303 thinkall/featcopilot

Next-generation LLM-powered auto feature engineering framework

33
Emerging
2304 xinyanghuang7/Basic-Visual-Language-Model

Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖

33
Emerging
2305 Anjum48/commonlitreadabilityprize

4th Place solution for the Kaggle CommonLit Readability Prize

33
Emerging
2306 Chunjiang-Intelligence/Credal-Transformer

论文「Credal Transformer: A Principled Approach for Quantifying and Mitigating...

33
Emerging
2307 cgjosephlee/ollama-save-load

Save and load ollama models just like operating docker images.

33
Emerging
2308 Kitsunp/Prueba-de-modelo-de-ByteLatentTransformer

Este es una prueba de concepto del paper mencionado de Meta junto a otros...

33
Emerging
2309 pat-jj/KG-FIT

[NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs

33
Emerging
2310 LunjunZhang/ema-pg

Code for "EMA Policy Gradient: Taming Reinforcement Learning for LLMs with...

33
Emerging
2311 andreped/vit-explainer

🔥 Demonstrating Explainable AI with Vision Transformer in web app

33
Emerging
2312 rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice

AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and...

33
Emerging
2313 kmaurinjones/AllMeans

Automatic topic modelling using minimal external input and computational resources

33
Emerging
2314 VITA-Group/TAPE

[ICML'25] "Rethinking Addressing in Language Models via Contextualized...

33
Emerging
2315 parameterlab/apricot

Source code of "Calibrating Large Language Models Using Their Generations...

33
Emerging
2316 swainshashwat/Flock

Craft custom Language Model Models (LLMs) effortlessly using Flock. Build...

33
Emerging
2317 cocacola-lab/Awesome-Transformer-in-Transportation

Papers & resources linked to Transformer-based research mainly for...

33
Emerging
2318 siwei-li/NLP_summarization

Summarization of lecture video transcripts using BERT.

33
Emerging
2319 franckalbinet/iomeval

Streamline evaluation evidence mapping at scale with LLMs

33
Emerging
2320 martin-wey/cl-code-apis

Replication package of the paper "On the Usage of Continual Learning for...

33
Emerging
2321 haesleinhuepf/vlm-pictionary

Play pictionary with Vision Language Models!

33
Emerging
2322 InquestGeronimo/tllm

An LLM training library for instruction-tuning.

33
Emerging
2323 AlenVelocity/langchain-llama

Run LLAMA LLMs in Node with Langchain

33
Emerging
2324 nightdessert/Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains...

33
Emerging
2325 uiuctml/Localize-and-Stitch

Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic

33
Emerging
2326 markendo/downscaling_intelligence

Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in...

33
Emerging
2327 yuecao0119/MMFuser

The official implementation of the paper "MMFuser: Multimodal Multi-Layer...

33
Emerging
2328 PromptMixerDev/prompt-mixer-ollama-connector

Ollama Connector

33
Emerging
2329 jianzhnie/LLMToolkit

LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large...

33
Emerging
2330 hollobit/GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

33
Emerging
2331 GURPREETKAURJETHRA/LLaMA3-Quantization

LLaMA3-Quantization

33
Emerging
2332 sanjaradylov/moleculegen-ml

Generate novel molecules using neural language models

33
Emerging
2333 HariomJangra/project-lumen

A 128M parameter language model built from scratch for learning how large...

33
Emerging
2334 yang-ai-lab/OSF-Open-Sleep-FM

OSF: On Pre-training and Scaling of Sleep Foundation Models

33
Emerging
2335 actypedef/ARCQuant

Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented...

33
Emerging
2336 josStorer/llama.cpp-unicode-windows

llama.cpp with unicode (windows) support

33
Emerging
2337 AshishGautamX/K8s-LLM-Scheduler

An intelligent Kubernetes scheduler powered by Meta's Llama-3.3-70B model...

33
Emerging
2338 stchakwdev/kan_transformer

Baantu Research: Hybrid KAN-Transformer for investigating learnable...

33
Emerging
2339 yaojin17/Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large...

33
Emerging
2340 UCDvision/NOLA

Code for NOLA, an implementation of "nola: Compressing LoRA using Linear...

33
Emerging
2341 mkofinas/neural-graphs

Official source code for "Graph Neural Networks for Learning Equivariant...

33
Emerging
2342 horseee/LLaMA-Pruning

Structural Pruning for LLaMA

33
Emerging
2343 sail-sg/dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

33
Emerging
2344 Beomi/KcBERT-Finetune

KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from...

33
Emerging
2345 tanulsingh/Humour.ai-Language-model-that-can-crack-Jokes

Language Model that makes you Laugh .

33
Emerging
2346 duyhominhnguyen/Exgra-Med

[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment

33
Emerging
2347 tanishqgautam/Image-Captioning

Implemented 3 different architectures to tackle the Image Caption problem,...

33
Emerging
2348 psmarter/mini-infer

A high-performance LLM inference engine with PagedAttention |...

33
Emerging
2349 rkinas/reasoning_models_how_to

This repository serves as a collection of research notes and resources on...

33
Emerging
2350 microsoft/InteractiveTextGeneration

An implementation of the paper "Interactive Text Generation"

33
Emerging
2351 kyegomez/DifferentialTransformer

An open source community implementation of the model from "DIFFERENTIAL...

33
Emerging
2352 omron-sinicx/crystalformer

The official code respository for "Crystalformer: Infinitely Connected...

33
Emerging
2353 UCSB-NLP-Chang/ULD

Implementation of paper 'Reversing the Forget-Retain Objectives: An...

33
Emerging
2354 codewithdark-git/QuantLLM

QuantLLM is a Python library designed for developers, researchers, and teams...

33
Emerging
2355 Gapi505/Sparky-2

This is a discord bot running on llama cpp with the llama 3 model and image...

33
Emerging
2356 ananttripathi/Resume-Analyzer-MLOps

Resume Analyzer is an AI-powered MLOps platform that optimizes your resume...

33
Emerging
2357 bloomberg/minilmv2.bb

Our open source implementation of MiniLMv2...

33
Emerging
2358 smitkiri/news-qa

Reading comprehension based question-answering model for news articles.

33
Emerging
2359 Esmail-ibraheem/Tinyllamas-pytorch

Tinyllamas🦙 is an Extensible advanced language model framework, inspired by...

33
Emerging
2360 SAP-samples/btp-running-language-models

This repository contains different code examples around the topic of...

33
Emerging
2361 poloclub/tsr-convstem

High-Performance Transformers for Table Structure Recognition Need Early Convolutions

33
Emerging
2362 nicolay-r/Reasoning-for-Sentiment-Analysis-Framework

The official code for CoT / ZSL reasoning framework 🧠, utilized in paper:...

33
Emerging
2363 MLD3/steerability

An open-source evaluation framework for measuring LLM steerability.

33
Emerging
2364 andreped/INF1600-ai-workshop

🔥 Workshop in AI Deployment (INF-1600, UiT)

33
Emerging
2365 jseeio/gpt2-tfjs

GPT2 with Tensorflow.js

33
Emerging
2366 songxiaoshuai/progco

Official Implementation of "ProgCo: Program Helps Self-Correction of Large...

33
Emerging
2367 bipinKrishnan/ml-recipe-book

A book containing step by step instructions to train deep learning models...

33
Emerging
2368 ApplyU-ai/ColorBlindnessEval

ColorBlindnessEval: Can Vision Language Models Pass Color Blindness Tests?

33
Emerging
2369 Wang-ML-Lab/multimodal-needle-in-a-haystack

[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking...

33
Emerging
2370 richouzo/hate-speech-detection-survey

Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers,...

33
Emerging
2371 adapter-hub/efficient-task-transfer

Research code for "What to Pre-Train on? Efficient Intermediate Task...

33
Emerging
2372 UBC-MDS/fixml

LLM Tool for effective test evaluation of ML projects with curated...

33
Emerging
2373 GURPREETKAURJETHRA/LLMs-Evaluation

LLMs Evaluation

33
Emerging
2374 cosmoquester/transformers-tf-finetune

Scripts to finetune huggingface transformers models with Tensorflow 2

33
Emerging
2375 asigalov61/Lars-Ulrich-Transformer

[DEPRECIATED] [339M] [88% acc] Fast full-featured drums inpainting...

33
Emerging
2376 ROIM1998/APT

[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models...

33
Emerging
2377 Lanerra/reasoning-bank-slm

An experiment that applies Google Research's `ReasoningBank` technique to...

33
Emerging
2378 submarat/removing-layer-norm

Transformers Don’t Need LayerNorm at Inference Time

33
Emerging
2379 chrisjob1021/transformer-examples

A collection of educational toy implementations and examples of key...

33
Emerging
2380 anyscale/llm-router

Tutorial for building LLM router

33
Emerging
2381 zjunlp/LightThinker

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

33
Emerging
2382 avsrma/LLM-based-AI-Assistant

A general purpose AI voice assistant built using GPT-4.

33
Emerging
2383 yotamnahum/DNA-Data-Storage

Single Read Reconstruction for DNA Data Storage Using Transformers (official...

33
Emerging
2384 declare-lab/TEAM

Our EMNLP 2022 paper on MCQA

33
Emerging
2385 xuanlinli17/large_vlm_distillation_ood

Distilling Large Vision-Language Model with Out-of-Distribution...

33
Emerging
2386 WooooDyy/BAPO

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for...

33
Emerging
2387 ZigeW/data_management_LLM

Collection of training data management explorations for large language models

33
Emerging
2388 mcbal/deep-implicit-attention

Implementation of deep implicit attention in PyTorch

33
Emerging
2389 BIDS-Xu-Lab/Me-LLaMA

A novel medical large language model family with 13/70B parameters, which...

33
Emerging
2390 telekom/transformer-tools

Transformers Training Tools

33
Emerging
2391 YunzeMan/Lexicon3D

[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D...

33
Emerging
2392 Nondzu/LlamaTor

LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,...

33
Emerging
2393 crux82/u-deppllama

Dependency parsing with Large Language Models

33
Emerging
2394 monk1337/NanoPeft

The simplest repository & Neat implementation of different Lora methods for...

33
Emerging
2395 Vitgracer/DinoV3-Object-Tracking

Object tracking using the DINOv3 model.

33
Emerging
2396 elephantmipt/compressors

A small library with distillation, quantization and pruning pipelines

33
Emerging
2397 Marvin-VW/python-ollama-local

This Python script enables hands-free interaction with a local Llama2...

33
Emerging
2398 Orlando-CS/Awesome-VLA

✨✨latest advancements in VLA models(VIsion Language Action)

33
Emerging
2399 srsawant34/efficient_instruction_learning

Code base for the paper "Instruction Tuned Models are Quick Learners".

33
Emerging
2400 ES7/LLaMA-from-Scratch

In this repository, I have explained the working of the LLaMA Model,...

33
Emerging
« Prev 1 2 3 22 23 24 25 26 76 77 78 Next »