All Transformer Models
7,795 models ranked by quality score · Page 29 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2801 |
khiwniti/kaggle-llm-api
🤖 Comprehensive solution for running Ollama/vLLM API servers in Kaggle... |
|
Emerging |
| 2802 |
OpenVanguard/remma-o1
Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch... |
|
Emerging |
| 2803 |
obss/turkish-question-generation
Automated question generation and question answering from Turkish texts... |
|
Emerging |
| 2804 |
Trustworthy-ML-Lab/VLG-CBM
[NeurIPS 24] A new training and evaluation framework for learning... |
|
Emerging |
| 2805 |
gsilvamartin/Backforge
Experimental Intelligent AI Backend Agent |
|
Emerging |
| 2806 |
Sharukesh3/LLM-for-hydrogen-storage
The LARGE LANGUAGE MODEL FOR HYDROGEN STORAGE project uses advanced natural... |
|
Emerging |
| 2807 |
EternityYW/TRAM-Benchmark
TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of... |
|
Emerging |
| 2808 |
bowen-upenn/llm_token_bias
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet... |
|
Emerging |
| 2809 |
Kacper-W-Kozdon/promptflow_unify_integration
The tool package for Microsoft's Prompt flow and the VS Code extension |
|
Emerging |
| 2810 |
hsisaberi/single-trait-electra
A complete ELECTRA-based framework for Big Five personality trait... |
|
Emerging |
| 2811 |
gitabtion/ConvBert-PyTorch
🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers. |
|
Emerging |
| 2812 |
yeyupiaoling/Chinese-LLM-Chat
大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama |
|
Emerging |
| 2813 |
PlanTL-GOB-ES/lm-biomedical-clinical-es
Official source for Spanish pretrained biomedical and clinical language... |
|
Emerging |
| 2814 |
Strifee/arabic2english
Arabic to English machine translation with Transformers and Pytorch |
|
Emerging |
| 2815 |
aimagelab/Emuru-autoregressive-text-img
Official PyTorch implementation for "Zero-Shot Styled Text Image Generation,... |
|
Emerging |
| 2816 |
voxel51/fiftyone-huggingface-plugins
Hugging Face Plugins for FiftyOne |
|
Emerging |
| 2817 |
samestrin/llm-services-api
A FastAPI-powered REST API offering a comprehensive suite of natural... |
|
Emerging |
| 2818 |
thefilesareinthecomputer/offline_file_translation
Text file language translation app that translates .txt, .csv, and .xlsx... |
|
Emerging |
| 2819 |
kurnevsky/llama-cpp.el
A client for llama-cpp server |
|
Emerging |
| 2820 |
Honee-W/U-SAM
Official repository for U-SAM (Interspeech 2025) |
|
Emerging |
| 2821 |
snu-mllab/GuidedQuant
Official PyTorch implementation of "GuidedQuant: Large Language Model... |
|
Emerging |
| 2822 |
18907305772/KCA
EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud |
|
Emerging |
| 2823 |
s-omranpour/MIDI-Transformer
Another implementation of the paper "Compound Word Transformer: Learning to... |
|
Emerging |
| 2824 |
NikolasMarkou/fsm_llm
A Finite State Machine hybrid with Large Language Models |
|
Emerging |
| 2825 |
danielsobrado/llm_notebooks
Concepts and examples on using and training LLMs |
|
Emerging |
| 2826 |
MoFHeka/LLaMA-Megatron
A LLaMA1/LLaMA12 Megatron implement. |
|
Emerging |
| 2827 |
mscheong01/speculative_decoding.c
minimal C implementation of speculative decoding based on llama2.c |
|
Emerging |
| 2828 |
load1n9/chat
leverage llama3.2 and other large language models to generate responses to... |
|
Emerging |
| 2829 |
SJTU-DENG-Lab/LightningRL
LightningRL: Breaking the Accuracy–Parallelism Trade-off of Block-wise dLLMs... |
|
Emerging |
| 2830 |
louisoutin/rat_crypto_trader
Relation-Aware Transformer for Portfolio Policy Learning using Binance provider |
|
Emerging |
| 2831 |
bfilar/URLTran
PyTorch/HuggingFace Implementation of URLTran: Improving Phishing URL... |
|
Emerging |
| 2832 |
kvignesh1420/cot-icl-lab
[ACL 2025] Official implementation of the "CoT-ICL Lab" framework |
|
Emerging |
| 2833 |
asimsinan/LLM-Research
A collection of LLM related papers, thesis, tools, datasets, courses, open... |
|
Emerging |
| 2834 |
TirendazAcademy/Llama3-Tutorials
Hands-on projects with Llama 3, Ollama, Streamlit |
|
Emerging |
| 2835 |
kyegomez/MMCA
The open source community's implementation of the all-new Multi-Modal Causal... |
|
Emerging |
| 2836 |
AnasMohammad4321/BERT-Pytorch
Comprehensive BERT model training and visualization, detailing pre-training,... |
|
Emerging |
| 2837 |
jw-source/LlamaSim
Simulate human behavior with mass LLMs |
|
Emerging |
| 2838 |
ndoll1998/AppliedTransformers
State-Of-The-Art Transformer Models |
|
Emerging |
| 2839 |
Stamir36/CursusAI-ChatBot
Chatbot based on artificial intelligence (AI) for communication, image... |
|
Emerging |
| 2840 |
liashchynskyi/ggufer
Convert & quantize HuggingFace models using llama.cpp on premises |
|
Emerging |
| 2841 |
mbzuai-oryx/Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models |
|
Emerging |
| 2842 |
kyegomez/JaxTransformer
This repository demonstrates how to build a Decoder-Only Transformer with... |
|
Emerging |
| 2843 |
kyegomez/SelfExtend
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend... |
|
Emerging |
| 2844 |
finxlab/hgait
Official implementation of HGAIT: Heterogeneous Graph Attention with... |
|
Emerging |
| 2845 |
twitter-research/multilingual-alignment-tpp
Code for reproducing the paper Improved Multilingual Language Model... |
|
Emerging |
| 2846 |
dhakalnirajan/LLaMA-BitNet
LLaMA-BitNet is a repository dedicated to empowering users to train their... |
|
Emerging |
| 2847 |
philogicae/gpt4all-telegram-bot
Simple Telegram bot using GPT4All |
|
Emerging |
| 2848 |
hukenovs/slovo
Slovo: Russian Sign Language Dataset and Models |
|
Emerging |
| 2849 |
llm-misinformation/llm-misinformation
The dataset and code for the ICLR 2024 paper "Can LLM-Generated... |
|
Emerging |
| 2850 |
whucs21Mzy/Model-Phase-Transitions
Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A... |
|
Emerging |
| 2851 |
sdpkjc/SATQuest
🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs |
|
Emerging |
| 2852 |
EchoSingh/GitHub_Profile_Picture
A guide code to generate your ai profile picture |
|
Emerging |
| 2853 |
Riccorl/transformer-srl
Reimplementation of a BERT based model (Shi et al, 2019), currently the... |
|
Emerging |
| 2854 |
imanslab/poc-uncensored-language-with-wizard-vicuna
Uncensored Language Model using FastAPI and Wizard Vicuna 30B (PoC) |
|
Emerging |
| 2855 |
theosorus/GPT2-Hasktorch
GPT2 implementation in Haskell with the Hasktorch library, inspired by... |
|
Emerging |
| 2856 |
rockyco/estFreqOffset
LLM-Assisted FPGA Design for Carrier Frequency Offset Estimation |
|
Emerging |
| 2857 |
microsoft/AMOS
[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training... |
|
Emerging |
| 2858 |
hexuandeng/DRPruning
Implementation for our paper “DRPruning: Efficient Large Language Model... |
|
Emerging |
| 2859 |
leodeveloper/Pdf-Parse-LlamaParse
using Llama Parse to read pdf and convert into mark down or text |
|
Emerging |
| 2860 |
voidful/nlp2go
🏃 hosting nlp models in one line |
|
Emerging |
| 2861 |
mfekadu/nimbus-transformer
it's like Nimbus but uses a transformer language model |
|
Emerging |
| 2862 |
HYUNJS/STTM
[ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free... |
|
Emerging |
| 2863 |
GeorgiosIoannouCoder/mindscanner
Deep learning models and fine-tuned transformers for detecting mental... |
|
Emerging |
| 2864 |
ssbuild/llm_finetuning
Large language Model fintuning bloom , opt , gpt, gpt2... |
|
Emerging |
| 2865 |
sandyresearch/chipmunk
🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E... |
|
Emerging |
| 2866 |
hkproj/mistral-llm-notes
Notes on the Mistral AI model |
|
Emerging |
| 2867 |
izmttk/ullm
Lightweight LLM inference engine inspired by nano-vllm, with radix-tree... |
|
Emerging |
| 2868 |
msamprovalaki/Exploring-Multimodal-Large-Language-Models-for-Medical-Image-Captioning
This repository includes the code for my Master Thesis, which investigates... |
|
Emerging |
| 2869 |
conditionWang/FLNK
Federated Learning with New Knowledge -- explore to incorporate various new... |
|
Emerging |
| 2870 |
fangevo/KD-efficient-text-summarization
The project leverages a larger model, Qwen2.5-14B, to generate high-quality... |
|
Emerging |
| 2871 |
Kuberwastaken/MiniLMs
A research project focused on studying and implementing minimalist language... |
|
Emerging |
| 2872 |
ulab-uiuc/Time-R1
Time-R1: Framework and resources for endowing LLMs with comprehensive... |
|
Emerging |
| 2873 |
tunib-ai/joker
AI model designed to test the effectiveness in handling external ethical attacks. |
|
Emerging |
| 2874 |
mubingshen/MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025... |
|
Emerging |
| 2875 |
salesforce/factualNLG
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing... |
|
Emerging |
| 2876 |
OrigamiDream/CoRT
CoRT: Contrastive Rhetorical Tagging - KISTI 2022 AI/ML Competition |
|
Emerging |
| 2877 |
IST-DASLab/Quartet-II
Quartet II Official Code |
|
Emerging |
| 2878 |
ViLab-UCSD/LaGTran_ICML2024
Code and models for the ICML 2024 paper "Tell, Don`t Show!: Language... |
|
Emerging |
| 2879 |
reasoning-machines/CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022) |
|
Emerging |
| 2880 |
bobxwu/learning-from-rewards-llm-papers
A comrephensive collection of learning from rewards in the post-training and... |
|
Emerging |
| 2881 |
EvilFreelancer/MoDA
Is a framework designed to enhance the performance and flexibility of large... |
|
Emerging |
| 2882 |
avrtt/telegram-content-moderator
NLP/ViT-driven bot for detection & moredation of inappropriate content in... |
|
Emerging |
| 2883 |
trekhleb/homemade-gpt-js
A minimal TensorFlow.js re-implementation of Karpathy's minGPT (Generative... |
|
Emerging |
| 2884 |
CyberAgentAILab/japanese-nli-model
This repository provides the code for Japanese NLI model, a fine-tuned... |
|
Emerging |
| 2885 |
aws-samples/multi-modal-examples-for-amazon-sagemaker
A workshop for collections of multi-modal LLM examples, samples, reference... |
|
Emerging |
| 2886 |
yunkai1841/recipe-generation
NLP Text generation task. Generate recipe by fine tuned LLaMA model. |
|
Emerging |
| 2887 |
AGI-Edgerunners/LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning |
|
Emerging |
| 2888 |
calhounpaul/LLaMA-PEFT-LoRa-subreddit-chatbot-colab
Parameter Efficient Fine Tuning (PEFT) to create a chatbot from Facebook's... |
|
Emerging |
| 2889 |
GAIR-NLP/abel
SOTA Math Opensource LLM |
|
Emerging |
| 2890 |
NVlabs/HMAR
[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation |
|
Emerging |
| 2891 |
Beomi/megatronlm_dataset_autotokenizer
Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer. |
|
Emerging |
| 2892 |
piotrmaciejbednarski/pllum-cookbook
This repository contains example Jupyter notebooks demonstrating how to use... |
|
Emerging |
| 2893 |
OSUPCVLab/MobileUNETR
Official Implementation of MobileUNETR: A Lightweight End-To-End Hybrid... |
|
Emerging |
| 2894 |
neuro-symbolic-ai/explanation_based_ethical_reasoning
Code and data for Paper "Enhancing Ethical Explanations of Large Language... |
|
Emerging |
| 2895 |
mickymultani/LLM-Architecture
Visualize some important concepts related to LLM architectures. |
|
Emerging |
| 2896 |
yandricr/gpti-py
This package simplifies your interaction with various GPT models, removing... |
|
Emerging |
| 2897 |
Infini-AI-Lab/TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with... |
|
Emerging |
| 2898 |
andreaceto/multimodal-crisis-classification
Multimodal Classification of Crisis-related social media contents. |
|
Emerging |
| 2899 |
Shekswess/tiny-reasoning-language-model
Code repository dedicated to experimenting and research with tiny reasoning... |
|
Emerging |
| 2900 |
Awni00/abstract_transformer
This is the project repo associated with the paper "Disentangling and... |
|
Emerging |