All Transformer Models
7,795 models ranked by quality score · Page 26 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2501 |
rti/gptvis
Understanding Transformers Using A Minimal Example |
|
Emerging |
| 2502 |
EternityYW/BiasEval-LLM-MentalHealth
Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models |
|
Emerging |
| 2503 |
kennethleungty/DeepSeek-R1-Ollama-Simple-Evals
Run and Evaluate DeepSeek-R1 Distilled Models Locally with Ollama and... |
|
Emerging |
| 2504 |
m3hrdadfi/news-headline-generation
A Bert2Bert model which able to generate headlines! |
|
Emerging |
| 2505 |
MurtyShikhar/TreeProjections
Tool to measure tree-structuredness of the internal algorithm learnt by a... |
|
Emerging |
| 2506 |
affjljoo3581/polyglot-jax-inference
TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다. |
|
Emerging |
| 2507 |
BerkeliumLabs/Berkelium-labs
Your personal AI Lab, accessible everywhere! Explore, experiment, and... |
|
Emerging |
| 2508 |
softsys4ai/differentiable-proving
Code and data for the paper "Pretrained Language Models are Symbolic... |
|
Emerging |
| 2509 |
QwenLM/ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling |
|
Emerging |
| 2510 |
kyegomez/VLM-Mamba
We introduce VLM-Mamba, the first Vision-Language Model built entirely on... |
|
Emerging |
| 2511 |
jose-compu/cerebras-coding-agent
A Cerebras AI LLM coding agent for the command line |
|
Emerging |
| 2512 |
pleisto/yuren-13b
Yuren 13B is an information synthesis large language model that has been... |
|
Emerging |
| 2513 |
rd-serendipity/ai-research-paper-explainer
AI-powered tool that transforms complex research papers into clear,... |
|
Emerging |
| 2514 |
HyperMink/inferenceable
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes... |
|
Emerging |
| 2515 |
rajaswa/indic-syntax-evaluation
Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages |
|
Emerging |
| 2516 |
taesiri/ArXivQA
WIP - Automated Question Answering for ArXiv Papers with Large Language... |
|
Emerging |
| 2517 |
pyladiesams/llm-guardrails-jul2024
Dive into the world of LLM Guardrails using tools like NVIDIA’s NeMo... |
|
Emerging |
| 2518 |
kanchengw/cnllm
统一的中文大模型适配库,将主流中国大模型 API 输出封装为 OpenAI 格式,无缝协作openai、langchain等大多数openai结构适配的python库 |
|
Emerging |
| 2519 |
clip-italian/clip-italian
CLIP (Contrastive Language–Image Pre-training) for Italian |
|
Emerging |
| 2520 |
namgyu-youn/PyTorch-Pruning
Benchmark and profile pruning researches and open-sources |
|
Emerging |
| 2521 |
amazon-science/wqa-contextual-qa
Coala is a python package for Contextual Answer Sentence Selection. |
|
Emerging |
| 2522 |
lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend |
|
Emerging |
| 2523 |
xmindflow/SSCT
[ICCV 2023] Self-supervised Semantic Segmentation: Consistency over Transformation |
|
Emerging |
| 2524 |
asigalov61/Google-Magenta-Piano-Transformer-Colab
[DEAD/NOT SUPPORTED ANYMORE] This is the only fully working and functioning... |
|
Emerging |
| 2525 |
microsoft/encoder-decoder-slm
Efficient encoder-decoder architecture for small language models (≤1B... |
|
Emerging |
| 2526 |
BoHuangLab/CELL-E_2
Multimodal encoder-only transformer model for image-based protein predictions |
|
Emerging |
| 2527 |
PeterGriffinJin/Heterformer
Heterformer: Transformer-based Deep Node Representation Learning on... |
|
Emerging |
| 2528 |
ksm26/Pretraining-LLMs
Master the essential steps of pretraining large language models (LLMs).... |
|
Emerging |
| 2529 |
ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning
An implementation of the DISP-LLM method from the NeurIPS 2024 paper:... |
|
Emerging |
| 2530 |
HeegyuKim/language-model
한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate) |
|
Emerging |
| 2531 |
AspirinCode/AlphaPPImd
Exploring the conformational ensembles of protein-protein complexes with... |
|
Emerging |
| 2532 |
gia-uh/cecilia
The Cuban Language Model |
|
Emerging |
| 2533 |
AbhinaavRamesh/ollama-local-serve
Local LLM infrastructure for distributed AI applications. Serve... |
|
Emerging |
| 2534 |
psychbruce/FMAT
😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language. |
|
Emerging |
| 2535 |
anyantudre/Florence-2-Vision-Language-Model
Florence-2 is a novel vision foundation model with a unified, prompt-based... |
|
Emerging |
| 2536 |
Bruce-Lee-LY/cutlass_gemm
Multiple GEMM operators are constructed with cutlass to support LLM inference. |
|
Emerging |
| 2537 |
The-Martyr/CausalMM
[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal... |
|
Emerging |
| 2538 |
AntonGuan/TimeOmni-1
[ICLR 2026] Official implementation of " 🦙 TimeOmni-1: Incentivizing Complex... |
|
Emerging |
| 2539 |
tommasocerruti/detllm
Deterministic-mode checks for LLM inference: measure run/batch variance,... |
|
Emerging |
| 2540 |
Simplifine-gamedev/Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud... |
|
Emerging |
| 2541 |
MaxwellYaoNi/PACE
[NeurIPS 2024 Spotlight] Official implementation for "PACE: marrying... |
|
Emerging |
| 2542 |
mahsasheikh/DrugGen
DrugGen: Advancing Drug Discovery with Large Language Models and... |
|
Emerging |
| 2543 |
rabilrbl/llamafile-builder
A simple github actions script to build a llamafile and uploads to huggingface |
|
Emerging |
| 2544 |
zTgx/llmweb-rs
Webpage to structured data in Rust & LLM |
|
Emerging |
| 2545 |
ybubnov/metalchat
Pure C++23 Llama inference for Apple Silicon chips |
|
Emerging |
| 2546 |
voidism/Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual... |
|
Emerging |
| 2547 |
juzhengz/LoRI
[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation |
|
Emerging |
| 2548 |
ShengcaiLiao/TransMatcher
[NeurIPS 2021] TransMatcher: Deep Image Matching Through Transformers for... |
|
Emerging |
| 2549 |
KasraAhmadi/PII-360
An open-source Chrome Extension that identifies Personally Identifiable... |
|
Emerging |
| 2550 |
mddunlap924/PyTorch-LLM
Fine-tuning an LLM using a Generic Workflow and Best Practices with PyTorch |
|
Emerging |
| 2551 |
guanwei49/DABL
DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models |
|
Emerging |
| 2552 |
oxidized-transformers/oxidized-transformers
Modular Rust transformer/LLM library using Candle |
|
Emerging |
| 2553 |
ShiZhengyan/InstructionModelling
[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With... |
|
Emerging |
| 2554 |
leondz/lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities |
|
Emerging |
| 2555 |
shunk031/allennlp-shiba-model
AllenNLP integration for Shiba: Japanese CANINE model |
|
Emerging |
| 2556 |
Tebmer/Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply... |
|
Emerging |
| 2557 |
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024) |
|
Emerging |
| 2558 |
MusadiqPasha/Turkish-Hate-Speech-Classification-Explanation
Classify, explain, and rewrite Turkish hate speech tweets using BERT, SHAP,... |
|
Emerging |
| 2559 |
BFCmath/FinetuneAI_Learning
How to effectively finetune CV/LLM models (without local gpu) |
|
Emerging |
| 2560 |
bayer-science-for-a-better-life/data2text-bioleaflets
Biomedical Data-to-Text Generation via Fine-Tuning Transformers |
|
Emerging |
| 2561 |
xdevfaheem/Transformers
A Comprehensive Implementation of Transformers Architecture from Scratch |
|
Emerging |
| 2562 |
samadon1/LLM-From-Scratch
Medical Language Model fine-tuned using pretraining, instruction tuning, and... |
|
Emerging |
| 2563 |
kodejuice/ai-trade
A smart AI-powered trading assistant that uses large language models (LLMs)... |
|
Emerging |
| 2564 |
prakash-aryan/debatebrawl-app
A sophisticated AI-powered debate platform that integrates Large Language... |
|
Emerging |
| 2565 |
anas-zafar/LLM-Survey
The official GitHub page for the survey paper "A Survey on Large Language... |
|
Emerging |
| 2566 |
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs... |
|
Emerging |
| 2567 |
RakePants/nerdless
Conversational AI Telegram bot based on a finetuned language model |
|
Emerging |
| 2568 |
didier-durand/llms-in-clouds
Experiments with LLMs in clouds (powered by SGLang) |
|
Emerging |
| 2569 |
systems-genomics-lab/deeptaxa
A deep learning framework for hierarchical taxonomy classification of 16S... |
|
Emerging |
| 2570 |
ScottCampit/personalized-marketing-chatbot
personalized marketing chatbot |
|
Emerging |
| 2571 |
rezazad68/TMUnet
Contextual Attention Network: Transformer Meets U-Net |
|
Emerging |
| 2572 |
azminewasi/Awesome-LLMs-ICLR-24
It is a comprehensive resource hub compiling all LLM papers accepted at the... |
|
Emerging |
| 2573 |
yinzhangyue/SelfAware
Do Large Language Models Know What They Don’t Know? |
|
Emerging |
| 2574 |
Buyun-Liang/SECA
[NeurIPS 2025] SECA: Semantically Equivalent and Coherent Attacks for... |
|
Emerging |
| 2575 |
XunshanMan/MVGFormer
This is the official implementation of the work presented at CVPR 2024,... |
|
Emerging |
| 2576 |
cmu-flame/FLAME-MoE
Official repository for FLAME-MoE: A Transparent End-to-End Research... |
|
Emerging |
| 2577 |
smvorwerk/xlstm-cuda
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and... |
|
Emerging |
| 2578 |
open-compass/ANAH
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO |
|
Emerging |
| 2579 |
synlp/R2-LLM
The official GitHub repository of the AAAI-2024 paper "Bootstrapping Large... |
|
Emerging |
| 2580 |
HKUNLP/efficient-attention
[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control... |
|
Emerging |
| 2581 |
Nota-NetsPresso/shortened-llm
Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop] |
|
Emerging |
| 2582 |
bernardoleite/fairytaleqa-translated
Code for paper "FairytaleQA Translated: Enabling Educational Question and... |
|
Emerging |
| 2583 |
deep-symbolic-mathematics/llm-srbench
[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation... |
|
Emerging |
| 2584 |
SafeAILab/RAIN
[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning |
|
Emerging |
| 2585 |
AILab-CVC/M2PT
[CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data... |
|
Emerging |
| 2586 |
dsdanielpark/open-llm-datasets
Repository for organizing datasets and papers used in Open LLM. |
|
Emerging |
| 2587 |
BorealisAI/flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are... |
|
Emerging |
| 2588 |
zubair-irshad/NeRF-MAE
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders... |
|
Emerging |
| 2589 |
alvion427/PerroPastor
Run Llama based LLMs in Unity entirely in compute shaders with no dependencies |
|
Emerging |
| 2590 |
ymoslem/Adaptive-MT-LLM
Adaptive Machine Translation with Large Language Models |
|
Emerging |
| 2591 |
mlverse/mall
Run multiple LLM predictions against a data frame with R and Python |
|
Emerging |
| 2592 |
BillChan226/HALC
[ICML 2024] Official implementation for "HALC: Object Hallucination... |
|
Emerging |
| 2593 |
rasbt/faster-pytorch-blog
Outlining techniques for improving the training performance of your PyTorch... |
|
Emerging |
| 2594 |
CJReinforce/PURE
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is... |
|
Emerging |
| 2595 |
TIGER-AI-Lab/TIGERScore
"TIGERScore: Towards Building Explainable Metric for All Text Generation... |
|
Emerging |
| 2596 |
alexliap/greek_gpt
MoE Decoder Transformer implementation with MLX |
|
Emerging |
| 2597 |
Niez-Gharbi/Youtube-Summariser
Summarize your youtube videos with BART on streamlit app. |
|
Emerging |
| 2598 |
xmartlabs/spoter-embeddings
Create embeddings from sign pose videos using Transformers |
|
Emerging |
| 2599 |
fvliang/DART
Official Implementation of DART (DART: Diffusion-Inspired Speculative... |
|
Emerging |
| 2600 |
AIRI-Institute/Probing_framework
Framework for probing tasks |
|
Emerging |