All Transformer Models

7,795 models ranked by quality score · Page 20 of 78

Showing 1901–2000 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1901	sh0416/llama-classification Text classification with Foundation Language Model LLaMA	36	Emerging	llama-model-implementations	113	Python
1902	declare-lab/LLM-PuzzleTest This repository is maintained to release dataset and models for multimodal...	36	Emerging	math-reasoning-datasets	113	Python
1903	jeffreysijuntan/lloco The official repo for "LLoCo: Learning Long Contexts Offline"	36	Emerging	llm-compression-optimization	118	Python
1904	R-D-BioTech-Alaska/Brain Brain is an innovative concept that combines Qelm with Nueron to harness the...	36	Emerging	quantum-nlp-processing	2	Python
1905	HenryHZY/Awesome-Multimodal-LLM Research Trends in LLM-guided Multimodal Learning.	36	Emerging	multimodal-vision-language-models	356	—
1906	vorobeevich/ml-snippets-classification The source code of "Machine learning code snippets semantic classification"...	36	Emerging	ml-foundations-curricula	12	Python
1907	pleisto/yuren-baichuan-7b 基于baichuan-7b的开源多模态大语言模型	36	Emerging	multilingual-llm-adaptation	72	Python
1908	surrey-nlp/PLOD-AbbreviationDetection This repository contains the PLOD Dataset for Abbreviation Detection...	36	Emerging	nlp-learning-coursework	12	Jupyter Notebook
1909	rese1f/aurora [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a...	36	Emerging	image-captioning-transformers	139	Python
1910	TIGER-AI-Lab/MAmmoTH Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid...	36	Emerging	math-reasoning-datasets	383	Jupyter Notebook
1911	zhenyi4/ssa Official repository for "SSA: Sparse Sparse Attention by Aligning Full and...	36	Emerging	sparse-attention-optimization	10	Python
1912	ChuloAI/BrainChulo Harnessing the Memory Power of the Camelids	36	Emerging	multilingual-llm-adaptation	147	Python
1913	SeekingDream/DyCodeEval Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning...	36	Emerging	math-reasoning-datasets	255	Python
1914	diogok/llama.cpp.zig A build.zig for llama.cpp	36	Emerging	local-llm-deployment	1	Zig
1915	RhinoDevel/mt_llm Pure C wrapper library to use llama.cpp with Linux and Windows as simple as...	36	Emerging	llm-docker-deployments	14	C++
1916	jaketae/param-share-transformer PyTorch implementation of Lessons on Parameter Sharing across Layers in Transformers	36	Emerging	machine-translation-transformers	26	Python
1917	jordandeklerk/SwinViT Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification	36	Emerging	vit-image-classification	10	Python
1918	Am1n3e/active-learning-transformer A hands-on tutorial on how to use Active Learning with Transformer models.	36	Emerging	machine-translation-transformers	15	Jupyter Notebook
1919	frankaging/ReCOGS ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of...	36	Emerging	transformer-architecture-tutorials	10	Jupyter Notebook
1920	GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography A 3D Vision Transformer-based neural network for reconstructing electrical...	36	Emerging	vision-transformer-implementations	9	Python
1921	SkyworkAI/MoE-plus-plus [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with...	36	Emerging	mixture-of-experts-llms	264	Python
1922	yinboc/trans-inr Transformers as Meta-Learners for Implicit Neural Representations, in ECCV 2022	36	Emerging	mixup-augmentation-frameworks	160	Python
1923	mrdbourke/mac-ml-speed-test A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.	36	Emerging	apple-silicon-llm-inference	202	Jupyter Notebook
1924	avocardio/Zicklein Finetuning instruct-LLaMA on german datasets.	36	Emerging	lora-qlora-fine-tuning	33	Python
1925	alphasecio/llama-guard A web app for exploring content moderation with Llama Guard on Groq.	36	Emerging	llm-terminal-automation	1	Python
1926	m0dulo/InferSpore 🌱 A fully independent Large Language Model (LLM) inference engine, built...	36	Emerging	llm-inference-engines	32	Cuda
1927	lechmazur/writing This benchmark tests how well LLMs incorporate a set of 10 mandatory story...	36	Emerging	llm-benchmark-leaderboards	353	Batchfile
1928	general-preference/general-preference-model [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for...	36	Emerging	direct-preference-optimization	39	Python
1929	dreamingjudith/KoGPT2-personachat Fine-tuned KoGPT2 chatbot demo with translated PersonaChat (ongoing)	36	Emerging	gpt2-pretraining-fine-tuning	13	Jupyter Notebook
1930	Relaxed-System-Lab/Flash-Sparse-Attention 🚀🚀 Efficient implementations of Native Sparse Attention	36	Emerging	sparse-attention-optimization	983	Python
1931	bnosac/golgotha Contextualised Embeddings and Language Modelling using BERT and Friends using R	36	Emerging	bert-model-implementations	47	R
1932	dev-sufyaan/Nexlify Unified API platform for free access to enterprise-grade AI models from...	36	Emerging	local-llm-deployment	13	Python
1933	modal-labs/stopwatch A tool for benchmarking LLMs on Modal	36	Emerging	llm-evaluation-benchmarking	50	Python
1934	xuyang-liu16/GlobalCom2 [AAAI 2026] Global Compression Commander: Plug-and-Play Inference...	36	Emerging	llm-compression-optimization	39	Python
1935	AIFrameResearch/SPO Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL...	36	Emerging	rlhf-alignment-training	45	Python
1936	yifanzhang-pro/HLA Official Project Page for HLA: Higher-order Linear Attention...	36	Emerging	llm-knowledge-distillation	45	HTML
1937	FengheTan9/LLM4Seg [MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and...	36	Emerging	instruction-tuning-datasets	51	Python
1938	jqtangust/Robust-R1 🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...	36	Emerging	llm-reasoning-research	520	Python
1939	FSoft-AI4Code/CodeCapybara Open-source Self-Instruction Tuning Code LLM	36	Emerging	multilingual-llm-adaptation	172	Python
1940	moeru-ai/demodel 🚀🛸 Easily boost the speed of pulling your models and datasets from various...	36	Emerging	llm-inference-engines	10	Go
1941	NVlabs/RocketKV [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage...	36	Emerging	llm-quantization-methods	34	Python
1942	dobriban/Principles-of-AI-LLMs Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring...	36	Emerging	llm-training-experimentation	44	—
1943	BatsResearch/trove A Flexible Toolkit for Dense Retrieval	36	Emerging	power-transformer-design	44	Python
1944	twiecki/transpailer LLM-based, self-correcting transpiler, supports Jax, PyToch, Rust, PyMC, Stan	36	Emerging	rust-agent-frameworks	3	Python
1945	cankocagil/SwinDetr Integration of Swin Transformer to DETR for Robust Object Detection (DEMO)	36	Emerging	object-detection-transformers	30	Jupyter Notebook
1946	retarfi/language-pretraining Pre-training Language Models for Japanese	36	Emerging	bert-model-implementations	50	Python
1947	abcsys/libem Compound AI toolchain for fast and accurate entity matching, powered by LLMs.	36	Emerging	multilingual-llm-adaptation	26	Python
1948	devdhananjay14/multim 🔍 Experiment with neural networks for binary classification on multimodal...	36	Emerging	multimodal-fusion-transformers	1	Python
1949	harryjdavies/HeartGPT Interpretable Pre-Trained Transformers for Heart Time-Series Data	36	Emerging	academic-thesis-repositories	50	Python
1950	kaist-cvml/I-HallA-v1.0 [AAAI 2025] Official Implementation of I-HallA v1.0	36	Emerging	llm-hallucination-mitigation	13	Python
1951	shahrukhx01/siamese-nn-semantic-text-similarity A repository containing comprehensive Neural Networks based PyTorch...	36	Emerging	semantic-textual-similarity	53	Python
1952	kaistAI/Janus [NeurIPS 2024] Train LLMs with diverse system messages reflecting...	36	Emerging	rlhf-alignment-training	53	Python
1953	shahriargolchin/time-travel-in-llms The official repository for the paper entitled "Time Travel in LLMs: Tracing...	36	Emerging	llm-domain-datasets	12	Python
1954	chaitjo/gated-graph-transformers Transformers are Graph Neural Networks!	36	Emerging	graph-transformers	54	Python
1955	tlc4418/llm_optimization A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.	36	Emerging	rlhf-alignment-training	47	Python
1956	Samyak-777/nomodel The world's most accurate LLM. It achieves 0% hallucination rate by...	36	Emerging	llm-finetuning-frameworks	3	Dockerfile
1957	gtausa197-svg/-Project-Nord-Spiking-Neural-Network-Language-Model The first pure SNN language model trained from scratch with a fully original...	36	Emerging	llm-framework-abstractions	35	Python
1958	msakarvadia/memorization Localizing Memorized Sequences in Language Models	36	Emerging	llm-interpretability-explainability	20	Jupyter Notebook
1959	kyegomez/PALI Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"	36	Emerging	vision-language-models	94	Python
1960	abhisheknair10/llama3.cu Lightweight Llama 3 8B Inference Engine in CUDA C	36	Emerging	local-llm-deployment	54	Cuda
1961	HKUDS/SepLLM [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One...	36	Emerging	diffusion-language-models	567	Python
1962	Mmorgan-ML/Neuromodulatory-Control-Networks Neuromodulatory Control Networks (NCNs), a novel LLM architectural...	36	Emerging	llm-provider-sdks	21	Python
1963	Anshita1Saxena/transformer_time_series_forecasting Transformers applied on Time Series Forecasting	35	Emerging	time-series-forecasting-transformers	8	Python
1964	Jathurshan0330/Cross-Modal-Transformer Official repository of cross-modal transformer for interpretable automatic...	35	Emerging	multimodal-fusion-transformers	75	Jupyter Notebook
1965	yashbonde/rasp Implementing RASP transformer programming language...	35	Emerging	browser-based-ml-inference	60	Python
1966	llcuda/llcuda CUDA 12-first backend inference for Unsloth on Kaggle — Optimized for small...	35	Emerging	llm-cuda-optimization	8	Jupyter Notebook
1967	vipulraheja/iterater Official implementation of the paper "IteraTeR: Understanding Iterative...	35	Emerging	llm-implementation-from-scratch	80	Python
1968	nikolaydubina/llama2.go LLaMA-2 in native Go	35	Emerging	local-llm-deployment	194	Go
1969	lfunderburk/automate-tech-post LLM application: fine tuned model to generate social media posts from...	35	Emerging	llm-training-experimentation	13	Jupyter Notebook
1970	andrewliao11/LongPerceptualThoughts [COLM'25] The official implementation of "LongPerceptualThoughts: Distilling...	35	Emerging	llm-reasoning-research	11	Python
1971	vbdi/divprune [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large...	35	Emerging	multimodal-vision-language	71	Python
1972	oshindutta/TVAprune [ICML 2024 Es-FoMo] - Efficient LLM Pruning with Global Token-Dependency...	35	Emerging	llm-pruning-compression	5	Python
1973	arshadshk/SAINT-pytorch SAINT PyTorch implementation	35	Emerging	transformer-architecture-tutorials	92	Python
1974	uncbiag/Awesome-Foundation-Models A curated list of foundation models for vision and language tasks	35	Emerging	multimodal-vision-language-models	1,149	—
1975	Cryolite/kanachan A Japanese (Riichi) Mahjong AI Framework	35	Emerging	ml-foundations-curricula	332	Python
1976	palonso/MAEST Pre-training, fine-tuning, and inference code with the MAEST models for...	35	Emerging	audio-classification-transformers	54	Python
1977	automorphic-ai/trex Enforce structured output from LLMs 100% of the time	35	Emerging	structured-output-enforcement	251	Python
1978	vietanhdev/llama-assistant-train Training Scripts for Llama Assistant: Your Local AI Assistant That Respects...	35	Emerging	llm-chatbot-applications	7	Jupyter Notebook
1979	TayeeChang/keras_transformers the implement of transformer family such as bert, alber, roberta, nezha, etc.	35	Emerging	bert-model-implementations	7	Python
1980	alibaba/easydist Automated Parallelization System and Infrastructure for Multiple Ecosystems	35	Emerging	llm-inference-engines	82	Python
1981	wxjiao/ParroT The ParroT framework to enhance and regulate the Translation Abilities...	35	Emerging	multilingual-llm-adaptation	176	Python
1982	Ankur3107/nlp_notebooks Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.	35	Emerging	huggingface-learning-resources	78	Jupyter Notebook
1983	kyegomez/MambaDecoderBlock MambaDecoderBlock is a novel decoder architecture that replaces traditional...	35	Emerging	3d-vision-transformers	5	Python
1984	Airmomo/transformers-docs-zh 【持续更新中】完全中文版的 Transformers 学习笔记及演示示例，支持 Jupyter Notebook，主要内容来自 🤗 Hugging...	35	Emerging	huggingface-learning-resources	71	—
1985	EvanZhouDev/llm.pdf Run LLMs inside a PDF file.	35	Emerging	pdf-qa-systems	755	Python
1986	TideDra/VL-RLHF A RLHF Infrastructure for Vision-Language Models	35	Emerging	rlhf-alignment-training	198	Python
1987	OnlyTerp/turboquant First open-source implementation of Google TurboQuant (ICLR 2026) --...	35	Emerging	kv-cache-optimization	36	Python
1988	wangcongcong123/ttt A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+	35	Emerging	tokenizer-libraries	37	Python
1989	astrobleem/Simple-StableLM-Chat This is a very simple python app that you can use to get up and chatting...	35	Emerging	multi-provider-llm-interfaces	16	Python
1990	uakarsh/latr Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel...	35	Emerging	vision-transformer-optimization	56	Python
1991	tranquoctrinh/transformer This is a PyTorch implementation of the Transformer model in the paper...	35	Emerging	attention-mechanism-implementations	37	Python
1992	hoof-ai/hoof "Just hoof it!" - A spotlight like interface to Ollama	35	Emerging	local-llm-deployment	63	Rust
1993	ntt-dkiku/route-explainer The official implementation of "RouteExplainer: An Explanation Framework for...	35	Emerging	llm-interpretability-explainability	17	Python
1994	Mya-Mya/CBF-LLM "CBF-LLM: Safe Control for LLM Alignment"	35	Emerging	llm-evaluation-benchmarking	12	Python
1995	BaohaoLiao/RSD [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and...	35	Emerging	speculative-decoding-algorithms	56	Python
1996	sail-sg/Attention-Sink [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical...	35	Emerging	diffusion-language-models	159	Python
1997	BUAADreamer/SPN4CIR [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning...	35	Emerging	clip-image-embeddings	39	Python
1998	OatmealLiu/FineR [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models	35	Emerging	llm-knowledge-distillation	190	Python
1999	frankluise5220/ComfyUI-Lorahelper A professional automation toolkit for ComfyUI to prepare LoRA training data...	35	Emerging	lora-qlora-fine-tuning	10	Python
2000	CogitoNTNU/course-on-large-language-models This is a course on how to to program with Large Language Models.	35	Emerging	generative-ai-learning	10	Jupyter Notebook

« Prev 1 2 3 … 18 19 20 21 22 … 76 77 78 Next »