All Transformer Models

7,795 models ranked by quality score · Page 19 of 78

Showing 1801–1900 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1801	niclasgriesshaber/llm_patent_pipeline LLMs for Historical Dataset Construction from Archival Image Scans	36	Emerging	llm-data-labeling	4	HTML
1802	fmueller/scribae CLI to turn Markdown notes into SEO briefs, drafts, metadata, and...	36	Emerging	ai-powered-business-analytics	1	Python
1803	nanxiang11/CodeLab_LLM 🌟 从LLaMA2开启大语言模型原理与实践教程	36	Emerging	llm-learning-resources	76	Python
1804	ykjaat6104/LLM-Cost-and-Token-Efficiency-Analysis A benchmark study analyzing cost and token efficiency across 14 LLMs from 5...	36	Emerging	llm-benchmark-leaderboards	4	Jupyter Notebook
1805	clint-kristopher-morris/llm-guided-evolution LLM Guided Evolution - The Automation of Models Advancing Models	36	Emerging	llm-finetuning-frameworks	16	Python
1806	jackaduma/ChatGLM-LoRA-RLHF-PyTorch A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer...	36	Emerging	rlhf-alignment-training	140	Python
1807	MaxiDonkey/DelphiGroqCloud The GroqCloud API wrapper for Delphi provides access to models from Meta,...	36	Emerging	llm-terminal-automation	20	Pascal
1808	jagilley/fact-checker Fact-checking LLM outputs with self-ask	36	Emerging	fact-checking-systems	307	Jupyter Notebook
1809	OSU-STARLAB/Simul-LLM [ACL 2024] An easily extensible framework for simultaneous, text-to-text...	36	Emerging	llm-scaling-architecture	18	Python
1810	PaddlePaddle/PALM a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and...	36	Emerging	llm-training-experimentation	185	Python
1811	assembly-automation-hub/repo-governance ⚙️ Reusable GitHub repository governance kit: CI/CD workflows, CodeQL SAST,...	36	Emerging	—	2	Python
1812	zrr1999/emotion-recognition 多模态情绪识别方法研究（Multimodal Emotion Recognition）	36	Emerging	emotion-detection-transformers	25	Python
1813	JRC1995/BERT-Disaster-Classification-Capsule-Routing Exploration of BERT-BiLSTM models with Layer Aggregation (attention-based...	36	Emerging	disaster-tweet-classification	25	Python
1814	ariya/query-llm Query LLM with Chain-of-Tought	36	Emerging	llm-terminal-automation	14	JavaScript
1815	SimeonHristov99/DL_25-26 Practice sessions for the course "Introduction to deep learning" in the...	36	Emerging	ml-foundations-curricula	4	Jupyter Notebook
1816	martin-wey/peft-llm-code Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...	36	Emerging	llm-scaling-architecture	25	Python
1817	jaketae/alibi PyTorch implementation of Train Short, Test Long: Attention with Linear...	36	Emerging	transformer-architecture-tutorials	33	Python
1818	ziplab/HVT [ICCV 2021] Official implementation of "Scalable Vision Transformers with...	36	Emerging	vision-transformer-implementations	33	Python
1819	luciusssss/ZhuangBench [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly	36	Emerging	llm-scaling-architecture	25	Python
1820	antonyvigouret/Pay-Attention-to-MLPs My implementation of the gMLP model from the paper "Pay Attention to MLPs".	36	Emerging	transformer-architecture-tutorials	25	Python
1821	zhongkaifu/TensorSharp A C# inference engine for running large language models (LLMs) locally using...	36	Emerging	—	2	C++
1822	AristotelisPap/Question-Answering-with-BERT-and-Knowledge-Distillation Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD)...	36	Emerging	question-answering-systems	26	Jupyter Notebook
1823	aquadzn/deploy-transformers Easily deploy a state-of-the-art language model from HuggingFace's Transformers	36	Emerging	transformer-frameworks-wrappers	12	Python
1824	sigeisler/reinforce-attacks-llms REINFORCE Adversarial Attacks on Large Language Models: An Adaptive,...	36	Emerging	jailbreak-attacks-analysis	23	Python
1825	deep-symbolic-mathematics/Multimodal-Math-Pretraining [ICLR 2024 Spotlight] This is the official code for the paper "SNIP:...	36	Emerging	mathematical-reasoning-transformers	58	Python
1826	camenduru/alpaca-lora-colab Alpaca Lora	36	Emerging	llm-quantization-methods	25	Jupyter Notebook
1827	AutonomicPerfectionist/PipeInfer PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	36	Emerging	llm-cuda-optimization	32	C++
1828	shoppollama/shoppollama Open Source Agentic Commerce Platform built on Ollama and Stripe — Run...	36	Emerging	interactive-ai-chat-uis	1	Elixir
1829	JHansiduYapa/Fine-Tuning-a-Small-Language-Model-for-Cypher-Query-Generation This project fine-tunes Unsloth's Gemma-3 4B IT (4-bit) model to translate...	36	Emerging	code-model-training	6	Jupyter Notebook
1830	sinanuozdemir/oreilly-bert-nlp This repository contains code for the O'Reilly Live Online Training for BERT	36	Emerging	model-evaluation-diagnostics	32	Jupyter Notebook
1831	NgJaBach/dark-kit Collect and share guidance + code snippets for running LM-related tasks.	36	Emerging	llm-fine-tuning	4	Python
1832	dvgodoy/LLM-visuals Over 60 figures and diagrams of LLMs, quantization, low-rank adapters...	36	Emerging	llm-learning-resources	23	—
1833	flowersteam/LLM-Culture Code for the "Cultural evolution in populations of Large Language Models" paper	36	Emerging	llm-agent-training-gyms	34	Python
1834	alantess/gtrxl-torch Gated Transformer Model for Computer Vision	36	Emerging	vision-language-models	25	Python
1835	thanhlecongg/Invalidator Invalidator: Automated Patch Correctness Assessment via Semantic and...	36	Emerging	vulnerability-detection-llm	6	Python
1836	gotzmann/booster Booster - open accelerator for LLM models. Better inference and debugging...	36	Emerging	llm-inference-engines	167	C++
1837	m-horky/sllm Tools using small Large Language Models	36	Emerging	llm-inference-engines	4	Python
1838	yongchao98/R1-Code-Interpreter R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...	36	Emerging	llm-reasoning-research	31	Python
1839	RishabSA/Sketch2Graphviz Sketch2Graphviz allows you to convert sketches or images of graphs and...	36	Emerging	text-to-image-generation	7	Python
1840	crux82/BISS-2024 This repository hosts materials from the Bertinoro International Spring...	36	Emerging	nlp-learning-coursework	4	Jupyter Notebook
1841	abdur75648/V-Zen V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel...	36	Emerging	llm-terminal-automation	9	—
1842	trzy/llava-cpp-server LLaVA server (llama.cpp).	36	Emerging	local-llm-deployment	183	C++
1843	sashazykov/json-repair-rb A simple Ruby gem designed to repair broken JSON strings	36	Emerging	local-llm-deployment	10	Ruby
1844	TIGER-AI-Lab/VisualWebInstruct The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction...	36	Emerging	instruction-tuning-datasets	38	Python
1845	etaoxing/multigame-dt Implementation of Multi-Game Decision Transformers in PyTorch	36	Emerging	power-transformer-design	49	Python
1846	qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM A professional list on Large (Language) Models and Foundation Models (LLM,...	36	Emerging	multimodal-vision-language-models	1,203	—
1847	phvv-me/frame-representation-hypothesis Official Repository for Frame Representation Hypothesis paper	36	Emerging	llm-interpretability-explainability	8	Jupyter Notebook
1848	sampathkethineedi/bert-topic-sentiment Topic Based Sentiment Detection using BERT	36	Emerging	review-sentiment-classification	9	Python
1849	dravenk/ollama-zig Ollama Zig library	36	Emerging	local-llm-deployment	35	Zig
1850	rickiepark/the-lm-book <대규모 언어 모델, 핵심만 빠르게!>(인사이트, 2025)의 코드 저장소	36	Emerging	llm-training-experimentation	5	Jupyter Notebook
1851	weiserlab/TinyLLM Bringing Language Models to the Most Resource Constrained Devices	36	Emerging	llm-frameworks-libraries	50	Python
1852	HenryNdubuaku/nanodl Build GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more in JAX.	36	Emerging	transformer-frameworks-wrappers	299	Python
1853	warner-benjamin/commented-transformers Highly commented implementations of Transformers in PyTorch	36	Emerging	transformer-architecture-tutorials	138	Python
1854	urban-mobility-generation/Language-Modeling-for-Urban-Mobility Language Modeling for Urban Mobility: A Data-Centric Review and Guidelines	36	Emerging	multimodal-vision-language-models	9	—
1855	saeeddhqan/tiny-transformer Tiny transformer models implemented in pytorch.	36	Emerging	transformer-architecture-tutorials	9	Python
1856	grigio/llm-eval-simple llm-eval-simple is a simple LLM evaluation framework with intermediate...	36	Emerging	evaluation-frameworks-metrics	59	Python
1857	yihongXU/TransCenter This is the official implementation of TransCenter (TPAMI). The code and...	36	Emerging	3d-vision-transformers	118	—
1858	nehalvaghasiya/interview-bot AI-powered virtual interview bot to simulate real interview practice.	36	Emerging	streamlit-llm-interfaces	9	Python
1859	markusaksli/ai-music A vanilla Trasformer Decoder music generation model trained on Final Fantasy...	36	Emerging	music-generation-transformers	14	Python
1860	seedatnabeel/CLLM Curated LLM (ICML 2024)	36	Emerging	llm-domain-datasets	14	Jupyter Notebook
1861	DAMO-NLP-SG/LLM-Zoo LLM Zoo collects information of various open- and close-sourced LLMs	36	Emerging	multilingual-llm-adaptation	271	—
1862	Alsace08/Chain-of-Embedding [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...	36	Emerging	llm-reasoning-research	95	Python
1863	SingleZombie/LLSA Official implementation of Log-linear Sparse Attention (LLSA).	36	Emerging	attention-mechanism-implementations	62	Python
1864	jaketae/vit-breast-cancer Transfer learning pretrained vision transformers for breast histopathology	36	Emerging	medical-image-diagnosis-transformers	14	Python
1865	laurab222/TSAD Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate...	36	Emerging	time-series-forecasting-transformers	6	Python
1866	Praveengovianalytics/falcon-evaluate Falcon Evaluate is an open-source Python library aims to revolutionise the...	36	Emerging	evaluation-frameworks-metrics	14	Python
1867	Azure/nlp-samples Japanese NLP sample codes	36	Emerging	model-evaluation-diagnostics	10	Shell
1868	0x7o/text2keywords Trained T5 and T5-large model for creating keywords from text	36	Emerging	t5-mt5-fine-tuning	73	Jupyter Notebook
1869	UCSC-REAL/DS2 [ICLR 2025] Official implementation of paper "Improving Data Efficiency via...	36	Emerging	graph-language-models	101	Python
1870	TIGER-AI-Lab/LongICLBench Code and Data for "Long-context LLMs Struggle with Long In-context Learning"...	36	Emerging	math-reasoning-datasets	112	Python
1871	MME-Benchmarks/Video-MME ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark...	36	Emerging	multimodal-vision-language	732	—
1872	SculptAI/GIMKit Guided Infilling Modeling Toolkit	36	Emerging	llm-fine-tuning	2	Python
1873	abenechehab/dicl [ICLR 2025] Official implementation of DICL (Disentangled In-Context...	36	Emerging	rlhf-alignment-training	25	Jupyter Notebook
1874	IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving [WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving	36	Emerging	multimodal-vision-language-models	309	—
1875	ray-project/ray-llm RayLLM - LLMs on Ray (Archived). Read README for more info.	36	Emerging	llm-inference-serving	1,267	—
1876	styfeng/DataAug4NLP Collection of papers and resources for data augmentation for NLP.	36	Emerging	essay-scoring-grading	831	—
1877	monologg/HanBert-Transformers HanBert on 🤗 Huggingface Transformers 🤗	36	Emerging	korean-language-models	87	Python
1878	Aloereed/llama.cpp-server-ohos Llama.cpp server for OpenHarmony	36	Emerging	local-llm-deployment	9	C++
1879	suamin/T2NER T2NER: Transformers based Transfer Learning Framework for Named Entity...	36	Emerging	named-entity-recognition	11	Python
1880	hristijanpeshov/SHAP-Explainable-Lexicon-Model This project proposes a novel methodology to automatically learn financial...	36	Emerging	financial-sentiment-analysis	15	Jupyter Notebook
1881	bvanaken/visbert VisBERT: Demo web app for "How Does BERT Answer Questions?"	36	Emerging	transformer-interpretability-mechanistic	11	JavaScript
1882	gbaptista/ollama-ai A Ruby gem for interacting with Ollama's API that allows you to run open...	36	Emerging	interactive-ai-chat-uis	255	Ruby
1883	cakshat/AlloyBERT Introducing AlloyBERT: a transformer encoder-based model for predicting...	36	Emerging	bert-model-implementations	12	Python
1884	epfl-dlab/llm-latent-language Repo accompanying our paper "Do Llamas Work in English? On the Latent...	36	Emerging	llm-frameworks-libraries	80	Jupyter Notebook
1885	cosbidev/NAIM Official implementation for the paper ``Not Another Imputation Method: A...	36	Emerging	transformer-architecture-tutorials	11	Python
1886	JinjieNi/MegaDLMs GPU-optimized framework for training diffusion language models at any scale....	36	Emerging	diffusion-language-models	327	Python
1887	kyegomez/M2PT Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway:...	36	Emerging	parameter-efficient-adapters	14	Python
1888	AaronFeng753/Ollama-Model-Dumper Export and Backup Ollama models into GGUF and ModelFile	36	Emerging	llm-quantization-methods	92	Python
1889	aryan-jadon/Regression-Loss-Functions-in-Time-Series-Forecasting-Tensorflow This repository contains the implementation of paper Temporal Fusion...	36	Emerging	time-series-forecasting-transformers	85	Python
1890	yaph/charla A terminal based chat application that works with AI language models.	36	Emerging	interactive-ai-chat-uis	12	Python
1891	abhilashreddys/Fake-News-Article Detecting fake news articles by analyzing patterns in writing.	36	Emerging	fake-news-detection	10	Jupyter Notebook
1892	hao-ai-lab/DistCA Efficient Long-context Language Model Training by Core Attention Disaggregation	36	Emerging	diffusion-language-models	93	Python
1893	GeorgeMichailidis/multi-task-mixed-freq Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on...	36	Emerging	time-series-forecasting-transformers	12	Python
1894	real-stanford/reflect [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation...	36	Emerging	llm-robot-planning	103	Jupyter Notebook
1895	BUAADreamer/Chinese-LLaVA-Med 中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine	36	Emerging	vision-language-instruction-tuning	103	Python
1896	robinhad/kruk Ukrainian instruction-tuned language models and datasets	36	Emerging	multilingual-llm-adaptation	96	Jupyter Notebook
1897	ahans30/goldfish-loss [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs	36	Emerging	mixup-augmentation-frameworks	97	Python
1898	asigalov61/Perceiver-Music-Transformer SOTA Google's Perceiver-AR Music Transformer Implementation and Model	36	Emerging	ai-music-generation	104	Python
1899	zhilizju/Awesome-instruction-tuning A curated list of awesome instruction tuning datasets, models, papers and...	36	Emerging	instruction-tuning-datasets	347	Python
1900	DFKI-NLP/thermostat Collection of NLP model explanations and accompanying analysis tools	36	Emerging	transformer-interpretability-mechanistic	144	Jsonnet

« Prev 1 2 3 … 17 18 19 20 21 … 76 77 78 Next »