All Transformer Models

7,795 models ranked by quality score · Page 23 of 78

Showing 2201–2300 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2201	michaelnny/QLoRA-LLM A simple custom QLoRA implementation for fine-tuning a language model (LLM)...	34	Emerging	lora-qlora-fine-tuning	10	Python
2202	johndpope/OmniTransfer-hack OmniTransfer implementation for LTX-2 (work in progress)	34	Emerging	vision-transformer-optimization	7	Python
2203	liaoyuhua/LLM4TS Large Language & Foundation Models for Time Series.	34	Emerging	multimodal-vision-language-models	560	—
2204	steinbergmedia/libmusictok C++ Library for tokenizing MIDI files, designed to be compatible with the...	34	Emerging	music-generation-transformers	46	C++
2205	OneInterface/realtime-bakllava llama.cpp with BakLLaVA model describes what does it see	34	Emerging	local-llm-deployment	379	Python
2206	zerovl/ZeroVL [ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources	34	Emerging	vision-language-models	46	Python
2207	Adversing/hf-model-checker A tool to analyze HuggingFace models and determine their compatibility with...	34	Emerging	ml-benchmarking-frameworks	10	Python
2208	jaygala24/fed-hate-speech The official code repository for the paper titled "A Federated Approach for...	34	Emerging	hate-speech-detection	10	Python
2209	nanowell/Differential-Transformer-PyTorch PyTorch implementation of the Differential-Transformer architecture for...	34	Emerging	transformer-architecture-education	86	Python
2210	RLHFlow/Online-RLHF A recipe for online RLHF and online iterative DPO.	34	Emerging	rlhf-alignment-training	543	Python
2211	google/curie Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long...	34	Emerging	math-reasoning-datasets	29	Jupyter Notebook
2212	Sunona-AI-labs/sunona Sunona: Next-generation voice AI infrastructure. Orchestrate intelligent,...	34	Emerging	conversational-chatbot-applications	15	Python
2213	Kaleidophon/nlp-uncertainty-zoo Model zoo for different kinds of uncertainty quantification methods used in...	34	Emerging	power-transformer-design	55	Python
2214	CEC-Agent/CEC Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for...	34	Emerging	power-transformer-design	31	Python
2215	moritztng/fltr Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.	34	Emerging	local-llm-deployment	387	Rust
2216	mcp-tool-shop-org/backpropagate Headless LLM fine-tuning in 3 lines — smart defaults, VRAM-aware batch...	34	Emerging	llm-finetuning-frameworks	1	Python
2217	kkahatapitiya/LangRepo Code for our ACL 2025 paper "Language Repository for Long Video Understanding"	34	Emerging	multimodal-vision-language	36	Python
2218	suyash/mlt Multilingual Neural Machine Translation using Transformers with Conditional...	34	Emerging	neural-machine-translation	18	Jupyter Notebook
2219	hesamsheikh/llm-mechanics Coding an LLM and its building blocks from scratch.	34	Emerging	llm-implementation-tutorials	116	Jupyter Notebook
2220	florist-notes/aicore_n Artificial Intelligence > Machine Learning > Deep Learning	34	Emerging	ml-foundations-curricula	5	Python
2221	PKU-Alignment/beavertails BeaverTails is a collection of datasets designed to facilitate research on...	34	Emerging	rlhf-alignment-training	176	Makefile
2222	starmpcc/CAMEL Clinically Adapted Model Enhanced from LLaMA	34	Emerging	multilingual-llm-adaptation	89	Python
2223	Hamtech-ai/Persian-Image-Captioning A Persian Image Captioning model based on Vision Encoder Decoder Models of...	34	Emerging	image-captioning-transformers	20	Jupyter Notebook
2224	18907305772/Explore-Instruct EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage...	34	Emerging	instruction-tuning-datasets	5	Python
2225	hpdps-group/ElasticMM ElasticMM: Elastic and Efficient MLLM Serving System	34	Emerging	llm-inference-serving	41	Python
2226	JessicaLopezEspejel/HazPi HazPi is a modified Transformer(Vaswani et al., 2017) neural network...	34	Emerging	text-summarization-transformers	3	Python
2227	GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning Llama 3 ORPO Fine Tuning on A100 in Colab Pro.	34	Emerging	llm-fine-tuning	4	Jupyter Notebook
2228	holarissun/RewardModelingBeyondBradleyTerry official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models...	34	Emerging	rlhf-alignment-training	71	Python
2229	egaoharu-kensei/flash-attention-triton Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with...	34	Emerging	sparse-attention-optimization	21	Python
2230	nestordemeure/stop_word Huggingface transformers stopping criteria that halts the generation when a...	34	Emerging	huggingface-learning-resources	9	Python
2231	deep-div/PlotLLM Data Visualization with LLM automatically analyzes data and generates...	34	Emerging	llm-data-visualization	7	Jupyter Notebook
2232	StargazerX0/ScaleKV [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with...	34	Emerging	llm-quantization-methods	50	Python
2233	DunnBC22/Vision_Audio_and_Multimodal_Projects This repository includes all computer vision, audio, document AI, and...	34	Emerging	multimodal-fusion-transformers	51	Jupyter Notebook
2234	Beomi/BitNet-Transformers 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of...	34	Emerging	llm-quantization-methods	313	Python
2235	hhy-huang/GraphJudge [EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good...	34	Emerging	llm-knowledge-graph-generation	27	Python
2236	asigalov61/Giant-Music-Transformer [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with...	34	Emerging	music-similarity-embeddings	87	Python
2237	CristiVlad25/ai-papers Tracing the evolution of AI and large language models from early neural...	34	Emerging	llm-training-experimentation	10	—
2238	wang2226/Awesome-LLM-Decoding 📜 Paper list on decoding methods for LLMs and LVLMs	34	Emerging	llm-research-curation	70	—
2239	fboulnois/llm-leaderboard-csv CSVs of the Huggingface and LMArena LLM leaderboards, along with the code to...	34	Emerging	llm-benchmark-leaderboards	30	Python
2240	Gen-Verse/ReasonFlux [NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux,...	34	Emerging	code-model-training	524	Python
2241	llmapi-io/llmapi-cli Command-line client and python development library for accessing LLM's...	34	Emerging	llm-terminal-automation	9	Python
2242	SqueezeAILab/KVQuant [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with...	34	Emerging	llm-quantization-methods	406	Python
2243	bayartsogt-ya/albert-mongolian ALBERT trained on Mongolian text corpus	34	Emerging	korean-language-models	18	Jupyter Notebook
2244	kingabzpro/French-to-Fongbe-and-Ewe-MT The objective of this challenge is to create a machine translation system...	34	Emerging	neural-machine-translation	9	Jupyter Notebook
2245	tugot17/Discord-Language-Detection-Bot Restrict the use of forbidden languages on your discord server with transformers	34	Emerging	messaging-platform-chatbots	3	Python
2246	VITA-Group/Ms-PoE "Found in the Middle: How Language Models Use Long Contexts Better via...	34	Emerging	diffusion-language-models	31	Python
2247	CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths The open-source Mixture of Depths code and the official implementation of...	34	Emerging	mixture-of-experts-llms	28	Python
2248	bobazooba/xllm-demo Demo project using XLLM	34	Emerging	conversational-chatbot-applications	10	Python
2249	DAMO-NLP-SG/multilingual-safety-for-LLMs [ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"	34	Emerging	jailbreak-attacks-analysis	101	—
2250	asahi417/lm-vocab-trimmer Vocabulary Trimming (VT) is a model compression technique, which reduces a...	33	Emerging	llm-compression-optimization	63	Python
2251	Scicrop/llm-vision-basics Educational notebooks that demystify Large Language Models and Computer...	33	Emerging	defect-detection-quality-forensics	18	Jupyter Notebook
2252	SuperBianC/scMulan Repository for paper scMulan: a multitask generative pre-trained language...	33	Emerging	llm-learning-resources	62	Jupyter Notebook
2253	JoelDeonDsouza/Zenpool_LLM Zenpool is a compact, fine-tuned MLL (Mini Language Learner) model	33	Emerging	llm-implementation-tutorials	5	Jupyter Notebook
2254	lpalbou/AbstractLLM A unified interface for Large Language Models with memory, reasoning, and...	33	Emerging	llm-evaluation-benchmarking	2	Python
2255	rafalposwiata/depression-detection-lt-edi-2022 This repository contains the code of our winning solution for the Shared...	33	Emerging	emotion-detection-transformers	27	Python
2256	deepmancer/advanced-recommender-system Advance information retrieval system that combines advanced indexing,...	33	Emerging	recommendation-systems-transformers	11	Jupyter Notebook
2257	AchiraNadeeshan/social-activity-job-matcher PathFinder is a job recommendation web application that allows users to...	33	Emerging	resume-job-matching	1	JavaScript
2258	GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs Multi GPU Fine Training LLMs using DeepSpeed and Accelerate.	33	Emerging	llm-implementation-tutorials	2	Jupyter Notebook
2259	YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language...	33	Emerging	llm-scaling-architecture	472	Python
2260	daskol/llama.py Python bindings to llama.cpp	33	Emerging	local-llm-deployment	27	C
2261	adithya-s-k/CompanionLLM CompanionLLM - A framework to finetune LLMs to be your own sentient...	33	Emerging	lora-qlora-fine-tuning	50	Jupyter Notebook
2262	microsoft/MMLU-CF A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]	33	Emerging	llm-interpretability-explainability	123	—
2263	maciekt07/Lecture-Note-Generator-POC 📒 A proof-of-concept app that transcribes lecture recordings into text and...	33	Emerging	text-to-speech-tts	4	TypeScript
2264	CodingPlatelets/transformer_MM Accelerator for LLM Based on Chisel3	33	Emerging	llm-cuda-optimization	12	Scala
2265	davzoku/cria An end-to-end LLM app prototype based on Llama 2	33	Emerging	interactive-ai-chat-uis	6	TypeScript
2266	hasanisaeed/C-Transformer Implementation of the core Transformer architecture in pure C	33	Emerging	transformer-architecture-tutorials	8	C
2267	IIT-DM/BattleofLLMs Benchmarks of LLMs with Conversational QA datasets.	33	Emerging	llm-evaluation-benchmarking	6	Python
2268	SachinKalsi/annotated-research-papers This repository is a comprehensive collection of research papers,...	33	Emerging	nlp-learning-coursework	6	—
2269	isaacus-dev/emubert-creator The training code behind EmuBert, the largest open-source masked language...	33	Emerging	bert-model-implementations	3	Python
2270	JonnoB/training_lms_with_synthetic_data A repo for training Language models to correct errors in OCR text	33	Emerging	llm-evaluation-benchmarking	2	Python
2271	zatevakhin/obsidian-local-llm Obsidian Local LLM is a plugin for Obsidian that provides access to a...	33	Emerging	local-llm-deployment	135	TypeScript
2272	GiorgiaAuroraAdorni/gansformer-reproducibility-challenge Replication of the novel Generative Adversarial Transformer.	33	Emerging	multimodal-fusion-transformers	3	Dockerfile
2273	SertraFurr/DuckDuckAI Python API Wrapper to interact with DuckDuckAI	33	Emerging	interactive-ai-chat-uis	7	Python
2274	XavierSpycy/hands-on-lora Explore practical fine-tuning of LLMs with Hands-on Lora. Dive into examples...	33	Emerging	llm-fine-tuning	8	—
2275	krishnapriya-18/COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers Rank 1 / 216	33	Emerging	text-classification-transformers	28	Jupyter Notebook
2276	martin-wey/CodeUltraFeedback CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)	33	Emerging	math-reasoning-datasets	73	Python
2277	HKUDS/RecLM [ACL2025] "RecLM: Recommendation Instruction Tuning"	33	Emerging	llm-recommendation-systems	109	Python
2278	ISNE11/CheatSheet-LLM Run local Large Language Models (LLMs) offline using Ollama – interact with...	33	Emerging	nlp-fundamentals-tutorials	1	Python
2279	Srijan-D/LangChain-v0.2-HuggingFace-Llama3 This project integrates LangChain v0.2.6, HuggingFace Serverless Inference...	33	Emerging	prompt-engineering-security	5	Python
2280	zake7749/Kyara [Kaggle-2nd] Lightweight yet Effective Chinese LLM.	33	Emerging	llm-frameworks-libraries	53	Jupyter Notebook
2281	NotYuSheng/Multimodal-Large-Language-Model Localized Multimodal Large Language Model (MLLM) integrated with Streamlit...	33	Emerging	multimodal-vision-language-models	5	Python
2282	wangcongcong123/transection Transection: Transformers for English to Chinese Translation	33	Emerging	neural-machine-translation	6	Python
2283	yingding/applyllm A python package for applying LLM with LangChain and Hugging Face on local...	33	Emerging	llm-inference-engines	2	Jupyter Notebook
2284	DoubleVII/lithft Pretrain, finetune any LLMs from huggingface on your own data.	33	Emerging	llm-fine-tuning	4	Python
2285	micahondiwa/applied-ai Deep Learning for Computer Vision: A collection of 6 end-to-end applied AI...	33	Emerging	ml-foundations-curricula	5	Jupyter Notebook
2286	caua1503/llm-tool-fusion llm-tool-fusion é uma biblioteca Python que unifica e simplifica o uso de...	33	Emerging	llm-function-calling	3	Python
2287	TheAnkurGoswami/Neural-Networks-from-Scratch Implementation of different neural networks with back-propagation logic.	33	Emerging	ml-foundations-curricula	3	Python
2288	rabiloo/llm-finetuning Sample for Fine-Tuning LLMs & VLMs	33	Emerging	lora-qlora-fine-tuning	2	Python
2289	gabe00122/jaxrl Partially Observable Multi-Agent RL with Transformers	33	Emerging	transformer-frameworks-wrappers	17	Python
2290	lennartpollvogt/ollama-instructor Python library for the instruction and reliable validation of structured...	33	Emerging	llm-docker-deployments	77	Python
2291	black-roland/homeassistant-cloud-ru-ai Cloud.ru Foundation Models — cloud-based AI assistants for Home Assistant	33	Emerging	conversational-chatbot-applications	10	Python
2292	KRR-Oxford/LLMap-Prelim A preliminary investigation for ontology alignment (OM) with large language...	33	Emerging	llm-domain-datasets	5	Python
2293	levashi/reprobe Phase-aware LLM activation steering and linear probing. A memory-efficient,...	33	Emerging	mathematical-reasoning-transformers	2	Python
2294	gunnarnordqvist/opencode-context-filter Transparent HTTP proxy that automatically filters repository context for...	33	Emerging	llm-inference-engines	2	Python
2295	yonahgraphics/openevalkit Production-grade Python framework for evaluating LLM and agentic systems...	33	Emerging	llm-evaluation-platforms	3	Python
2296	Naman-ntc/FastCode Utilities for efficient fine-tuning, inference and evaluation of code...	33	Emerging	transformer-training-optimization	21	Python
2297	dhpollack/huggingface_libtorch Minimal example of using a traced huggingface transformers model with libtorch	33	Emerging	machine-translation-transformers	35	C++
2298	sajjjadayobi/ParsBigBird Persian Bert For Long-Range Sequences	33	Emerging	korean-language-models	63	Jupyter Notebook
2299	shizhouxing/Robustness-Verification-for-Transformers [ICLR 2020] Code for paper "Robustness Verification for Transformers"	33	Emerging	graph-transformers	27	Python
2300	aniass/Spam-detection Spam detection in SMS messages with BERT model and Machine Learning algorithms	33	Emerging	spam-detection-transformers	22	Jupyter Notebook

« Prev 1 2 3 … 21 22 23 24 25 … 76 77 78 Next »