All Transformer Models
7,795 models ranked by quality score · Page 48 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 4701 |
kasia-kobalczyk/guess_llm
Implementation of the probing models presented in the ICLR 2026 paper... |
|
Experimental |
| 4702 |
landry-some/LLM-streaming
Efficient streaming inference for large language models (LLMs). |
|
Experimental |
| 4703 |
kyegomez/HeptapodLM
An Implementation of an Transformer model that generates tokens non-linearly... |
|
Experimental |
| 4704 |
gowtamyreddy/NLP
Text Generation using RNN, LSTM, and Transformer |
|
Experimental |
| 4705 |
liam8421/faster-llm
🚀 Accelerate LLM training with Fast-LLM, an open-source library for... |
|
Experimental |
| 4706 |
fattorib/ZeRO-transformer
Two implementations of ZeRO-1 optimizer sharding in JAX |
|
Experimental |
| 4707 |
onlychara553-debug/dgx-spark-inference-stack
🚀 Serve large language models efficiently at home with this Docker-based... |
|
Experimental |
| 4708 |
YousfiNahed/KoValPlus
🌍 Evaluate cultural and value alignment of LLMs with Korean responses using... |
|
Experimental |
| 4709 |
Exahia/llm-benchmark-fr
Benchmarks LLM sur tâches métier françaises — Mistral vs Llama vs Qwen vs DeepSeek |
|
Experimental |
| 4710 |
PKU-Alignment/llms-resist-alignment
[ACL2025 Best Paper] Language Models Resist Alignment |
|
Experimental |
| 4711 |
KhoiBui16/UIT_CS221_Basic_Natural_Language_Processing
The project focuses on classifying hallucinations in Vietnamese LLM outputs... |
|
Experimental |
| 4712 |
vkhamesi/proteins
🧬 Fine-Tuning Large Language and Protein Models on a single T4 GPU via... |
|
Experimental |
| 4713 |
aakasharya09/llm-leaderboard
📊 Compare LLM models effortlessly with our tool, showcasing performance... |
|
Experimental |
| 4714 |
eduardopini/Dresguardian
🛡️ Elevate your privacy with Dresguardian, a self-hosted Telegram bot that... |
|
Experimental |
| 4715 |
Kelvinkeoma/AI-Digital-Doppelganger
Build a personal AI Telegram bot that processes text, voice, and images with... |
|
Experimental |
| 4716 |
nluninja/drugsLLM
An intelligent conversational assistant study designed to provide accurate,... |
|
Experimental |
| 4717 |
cvssn/shade
ai pair programming in your terminal |
|
Experimental |
| 4718 |
LlamaFlowJs/LlamaFlowJs
LlamaFlow is a framework that has inbuilt agentic workflows,reiterative... |
|
Experimental |
| 4719 |
n1405732043/pi-token-burden
Analyze system prompt tokens to identify usage and manage token budgets... |
|
Experimental |
| 4720 |
M-e-r-c-u-r-y/pytorch-transformers
Collection of different types of transformers for learning purposes |
|
Experimental |
| 4721 |
malith153/token-forge
🔑 Build robust identity solutions with TokenForge, an enterprise-ready... |
|
Experimental |
| 4722 |
awpggexcutor-beep/T5-Refiner-DomainFocus
🌟 Enhance T5 model performance with domain-specific word masking for... |
|
Experimental |
| 4723 |
AIDajiangtang/LLM-from-scratch
从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch |
|
Experimental |
| 4724 |
ozyurtf/attention-and-transformers
The purpose of this project is to understand how the Transformers work and... |
|
Experimental |
| 4725 |
CheongWoong/impact_of_cooccurrence
A repository for analyzing the impact of co-occurrence statistics on factual... |
|
Experimental |
| 4726 |
jasminwolf/ZakeyTeam-arabic-qa-system-arabert
🤖 Enhance Arabic NLP capabilities with this AI-powered question answering... |
|
Experimental |
| 4727 |
r-kovalch/omnigec-models
Reproducible QLoRA recipes and configs that fine‑tune Aya‑Expanse‑8B and... |
|
Experimental |
| 4728 |
ertosns/wiki-summary
wikipedia summarizer transformer |
|
Experimental |
| 4729 |
jstilb/timeseries-forecasting
Multi-variate time series forecasting: LSTM, Transformer, and statistical... |
|
Experimental |
| 4730 |
resetpaid/lumina
Perform passive domain reconnaissance using public data sources without... |
|
Experimental |
| 4731 |
gameofdimension/seven8wen
大语言模型高效微调 |
|
Experimental |
| 4732 |
khalidm31415/fastapi-transformers-zsl
Zero-shot learning text classification web app with FastAPI backend |
|
Experimental |
| 4733 |
lechmazur/deception
Benchmark evaluating LLMs on their ability to create and resist... |
|
Experimental |
| 4734 |
ENTITY107/rlmgw
🔄 Explore Recursive Language Models (RLMs) to enhance natural language... |
|
Experimental |
| 4735 |
KeepALifeUS/ml-attention-mechanisms
Flash Attention, RoPE, multi-head attention for temporal patterns |
|
Experimental |
| 4736 |
Skyline-9/Shotluck-Holmes
[ACM MMGR '24] 🔍 Shotluck Holmes: A family of small-scale LLVMs for... |
|
Experimental |
| 4737 |
mrconter1/PullRequestBenchmark
Evaluating LLMs performance in PR reviews as an indicator for their... |
|
Experimental |
| 4738 |
Gmail1995/llm-course
🧩 Explore LLM essentials, build advanced models, and develop applications... |
|
Experimental |
| 4739 |
lzyrapx/LLM-Grandmaster-Notes
🎓The path to LLM mastery is paved with broken embeddings and resurrected gradients. |
|
Experimental |
| 4740 |
ph-ausseil/llm-training-dataset-builder
Streamlines the creation of dataset to train a Large Language Model with... |
|
Experimental |
| 4741 |
bupt-ai-club/llm-compression-papers
papers of llm compression |
|
Experimental |
| 4742 |
adkwn1/question-answer-app
Question and Answer web applicaiton using fine-tuned and pre-trained T5... |
|
Experimental |
| 4743 |
villagecomputing/superpipe
Superpipe - optimized LLM pipelines for structured data |
|
Experimental |
| 4744 |
amazon-science/mezo_svrg
Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for... |
|
Experimental |
| 4745 |
CaterinaBi/health-communication-paper2
Bonan & Samo. January 2023. Paper on cross-linguistic bias in health-related... |
|
Experimental |
| 4746 |
MaxLSB/mini-paligemma2
Minimalist implementation of PaliGemma 2 & PaliGemma VLM from scratch |
|
Experimental |
| 4747 |
princeton-nlp/MultilingualAnalysis
Repository for the paper titled: "When is BERT Multilingual? Isolating... |
|
Experimental |
| 4748 |
declare-lab/flacuna
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive... |
|
Experimental |
| 4749 |
aratan/ApiCloudLLaMA
The idea is to make an api that everyone can consume in their GPT4-like... |
|
Experimental |
| 4750 |
francesco-s/document-claim-mapping
A tool using LLMs and few-shot learning for document-claim mapping and... |
|
Experimental |
| 4751 |
malvads/whatsapp-gpt-bot
WhatsApp GPT bot for doing weird stuff |
|
Experimental |
| 4752 |
sak96/rust_llama_app
Chat bot (llama) written in rust using Yew and Tauri. |
|
Experimental |
| 4753 |
PrivateDennis/InfinityGame
Craft infinit items with the help of AI based on idea of neil.fun |
|
Experimental |
| 4754 |
lionajuanabel/Fine-Dllm
LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven... |
|
Experimental |
| 4755 |
ahmedbesbes/audiolizr
A bentoML-powered API to transcribe audio and make sense of it |
|
Experimental |
| 4756 |
afondiel/LangChain-For-LLM-Application-Dev-DeepLearningAI
Crash course on LangChain for LLM Application Developement by DeepLearningAI |
|
Experimental |
| 4757 |
Bhattacharya-Lab/CASP15
CASP15 performance benchmarking of the state-of-the-art protein structure... |
|
Experimental |
| 4758 |
Microsatellites-and-Space-Microsystems/pose_estimation_domain_gap
Two methods for solving domain gap in satellite pose estimation in space... |
|
Experimental |
| 4759 |
Siesher/Generator_for_reasoning
🧠 Reasoning data generator for LLM training |
|
Experimental |
| 4760 |
AikyamLab/llm-memorization
Understanding the memorization property of Large Language Models using Model... |
|
Experimental |
| 4761 |
daolytica/Panther
A desktop application for multi-LLM brainstorming, debate, local model... |
|
Experimental |
| 4762 |
Muhammad-Hammad-59/Qwen05B-lora-qlora-finetuning-for-customer-support
Parameter-efficient fine-tuning (LoRA + QLoRA) of Qwen2.5-0.5B-Instruct for... |
|
Experimental |
| 4763 |
G-Art/matrix_steering_vector_research
Iterative Sparse Matrix Steering: Closed-Form Subspace Alignment for... |
|
Experimental |
| 4764 |
AIdventures/flora
Fine-tuning LLMs with LoRA |
|
Experimental |
| 4765 |
seehiong/micronaut-llama3
A high-performance Llama3 implementation using Micronaut and GraalVM Native Image |
|
Experimental |
| 4766 |
nlx-group/Shortcutted-Commonsense-Reasoning
Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep... |
|
Experimental |
| 4767 |
tanishqmudaliar/Silver-Guard-AI-Model-Training
TRAI‑aware Indian SMS scam detector that fine‑tunes MobileBERT on real +... |
|
Experimental |
| 4768 |
pacifikus/itmo_ods_nlp_course
NLP course materials at ITMO |
|
Experimental |
| 4769 |
EastTower16/LLMDataDistill
distill large scale web page text |
|
Experimental |
| 4770 |
frost-beta/llama2-high-level-cpp
Inference Llama2 with High-Level C++. |
|
Experimental |
| 4771 |
Eleanor-H/MUSTARD
Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform... |
|
Experimental |
| 4772 |
AKSW/LLMDatasetGenerator
LLM based datatset generator for KGQA on user defined knowledge graphs |
|
Experimental |
| 4773 |
Kcrypto126/Multi-Ai-Chat-App
chatting app |
|
Experimental |
| 4774 |
PRITHIVSAKTHIUR/Vit-Mature-Content-Detection
Vit-Mature-Content-Detection is an image classification vision-language... |
|
Experimental |
| 4775 |
itsvaibhav01/Immune
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against... |
|
Experimental |
| 4776 |
melove297/reddit-factuality-detection
🧐 Detect factual reliability in Reddit posts using machine learning with... |
|
Experimental |
| 4777 |
ayinedjimi/ComplianceBot
AI-Powered Compliance Assistant with Transformers and Gradio |
|
Experimental |
| 4778 |
ahs95/restaurant-idea-generator
Offline‑first app that generates restaurant names, 3‑item menus, and... |
|
Experimental |
| 4779 |
SemanticWave-Hoyeon/NavtexRecovery
AI-powered restoration system for damaged NAVTEX (NAVigational TEleX)... |
|
Experimental |
| 4780 |
vtrnnhlinh/Graph-of-Models
My proposed idea to create a graph of models, or a network of models... |
|
Experimental |
| 4781 |
AndreiRoibu/Transformer-Classifiers
This repository contains NLP classification models built with the Hugging... |
|
Experimental |
| 4782 |
svjack/docvqa-gen
Question Answering dataset generator of Document Visual in English and Chinese |
|
Experimental |
| 4783 |
hadrienbdc/bert-sentiment-analysis-pytorch
Fine-tuning Bert for sentiment analysis with pytorch |
|
Experimental |
| 4784 |
taeminlee/intent_classifier
Korean Intention classifier with pytorch lightning ⚡ |
|
Experimental |
| 4785 |
arpitpatelsitapur/ScholarLensAI
A FastAPI app for research paper recommendation and chat with those papers.... |
|
Experimental |
| 4786 |
zzbright1998/SentenceKV
Official implementation of "SentenceKV: Efficient LLM Inference via... |
|
Experimental |
| 4787 |
jaisenbe58r/NLP-Transformer_Translator
Implementación Transformers, adaptación del curso: "Procesamiento del... |
|
Experimental |
| 4788 |
abc1203/transformer-model
An implementation of the transformer deep learning model, based on the... |
|
Experimental |
| 4789 |
CatnipCoders/Lambda-Driver
Lambda-Driver optimizes a small pre-trained model for resource-constrained... |
|
Experimental |
| 4790 |
akshantchaudhary09/YouTube-Transcript-Summarizer
A chrome extension that can summarize the transcript of youtube videos. |
|
Experimental |
| 4791 |
nininau/awesome-llm-services
🔍 Discover 106+ open-source LLM services and tools for AI, ideal for local... |
|
Experimental |
| 4792 |
Clinical-Quality-Artifical-Intelligence/NurseSim-RL
AI-powered clinical triage simulation using Manchester Triage System (MTS).... |
|
Experimental |
| 4793 |
axonura/axonura-X1
The First AI Model Of Axonura |
|
Experimental |
| 4794 |
theanasuddin/Deep-Learning-Fundamentals
Python implementations of deep learning fundamentals, from multilayer... |
|
Experimental |
| 4795 |
ruslanmv/ollabridge
OllaBridge transforms your laptop or workstation into a production-grade,... |
|
Experimental |
| 4796 |
Torim98/regime-switching-daa
Systematischer Vergleich ökonometrischer Modelle und moderner... |
|
Experimental |
| 4797 |
llamaplushiesYT/HTML-Games
Just some random HTML games that you can play in school or any where |
|
Experimental |
| 4798 |
MathewJobey/linux-logsummary-justpy
Turn raw Linux logs into executive insights using **Drain3** and **Ollama**.... |
|
Experimental |
| 4799 |
RichardScottOZ/geoscience-transformers-for-predictive-mapping-of-critical-minerals
First pass paper implementation |
|
Experimental |
| 4800 |
1010code/github-models-tutorial
GitHub Models API 教學,免費試玩 GPT-4o、Llama、DeepSeek,Colab 範例程式 |
|
Experimental |