All Transformer Models

7,795 models ranked by quality score · Page 24 of 78

Showing 2301–2400 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2301	guoriyue/LangCommand LangCommand is a local inference command-line tool that transforms natural...	33	Emerging	llm-terminal-automation	118	C++
2302	AlgonetLabs/Cable Context-aware Biases for Length Extrapolation	33	Emerging	diffusion-language-models	22	Python
2303	thinkall/featcopilot Next-generation LLM-powered auto feature engineering framework	33	Emerging	data-pipeline-frameworks	3	Python
2304	xinyanghuang7/Basic-Visual-Language-Model Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖	33	Emerging	multimodal-vision-language	47	Python
2305	Anjum48/commonlitreadabilityprize 4th Place solution for the Kaggle CommonLit Readability Prize	33	Emerging	essay-scoring-grading	38	Jupyter Notebook
2306	Chunjiang-Intelligence/Credal-Transformer 论文「Credal Transformer: A Principled Approach for Quantifying and Mitigating...	33	Emerging	transformer-implementation-education	12	Python
2307	cgjosephlee/ollama-save-load Save and load ollama models just like operating docker images.	33	Emerging	local-llm-deployment	26	Python
2308	Kitsunp/Prueba-de-modelo-de-ByteLatentTransformer Este es una prueba de concepto del paper mencionado de Meta junto a otros...	33	Emerging	llm-scaling-architecture	8	Python
2309	pat-jj/KG-FIT [NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs	33	Emerging	llm-knowledge-graph-generation	130	Python
2310	LunjunZhang/ema-pg Code for "EMA Policy Gradient: Taming Reinforcement Learning for LLMs with...	33	Emerging	rlhf-alignment-training	8	Python
2311	andreped/vit-explainer 🔥 Demonstrating Explainable AI with Vision Transformer in web app	33	Emerging	transformer-interpretability-mechanistic	3	Python
2312	rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and...	33	Emerging	therapeutic-chatbot-applications	8	Python
2313	kmaurinjones/AllMeans Automatic topic modelling using minimal external input and computational resources	33	Emerging	text-clustering-topic-modeling	2	Python
2314	VITA-Group/TAPE [ICML'25] "Rethinking Addressing in Language Models via Contextualized...	33	Emerging	diffusion-language-models	14	Python
2315	parameterlab/apricot Source code of "Calibrating Large Language Models Using Their Generations...	33	Emerging	llm-interpretability-explainability	22	Jupyter Notebook
2316	swainshashwat/Flock Craft custom Language Model Models (LLMs) effortlessly using Flock. Build...	33	Emerging	llm-benchmark-leaderboards	4	Jupyter Notebook
2317	cocacola-lab/Awesome-Transformer-in-Transportation Papers & resources linked to Transformer-based research mainly for...	33	Emerging	multimodal-vision-language-models	6	—
2318	siwei-li/NLP_summarization Summarization of lecture video transcripts using BERT.	33	Emerging	text-summarization-transformers	3	Jupyter Notebook
2319	franckalbinet/iomeval Streamline evaluation evidence mapping at scale with LLMs	33	Emerging	evaluation-frameworks-metrics	1	Jupyter Notebook
2320	martin-wey/cl-code-apis Replication package of the paper "On the Usage of Continual Learning for...	33	Emerging	code-model-training	5	Python
2321	haesleinhuepf/vlm-pictionary Play pictionary with Vision Language Models!	33	Emerging	multimodal-vision-language	6	Jupyter Notebook
2322	InquestGeronimo/tllm An LLM training library for instruction-tuning.	33	Emerging	llm-frameworks-libraries	26	Python
2323	AlenVelocity/langchain-llama Run LLAMA LLMs in Node with Langchain	33	Emerging	local-llm-deployment	39	TypeScript
2324	nightdessert/Retrieval_Head open-source code for paper: Retrieval Head Mechanistically Explains...	33	Emerging	llm-bias-evaluation	236	Python
2325	uiuctml/Localize-and-Stitch Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic	33	Emerging	diffusion-language-models	32	Python
2326	markendo/downscaling_intelligence Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in...	33	Emerging	defect-detection-quality-forensics	25	Python
2327	yuecao0119/MMFuser The official implementation of the paper "MMFuser: Multimodal Multi-Layer...	33	Emerging	multimodal-vision-language	64	Python
2328	PromptMixerDev/prompt-mixer-ollama-connector Ollama Connector	33	Emerging	interactive-ai-chat-uis	3	JavaScript
2329	jianzhnie/LLMToolkit LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large...	33	Emerging	llm-fine-tuning	6	Python
2330	hollobit/GenAI_LLM_timeline ChatGPT, GenerativeAI and LLMs Timeline	33	Emerging	prompt-engineering-security	956	—
2331	GURPREETKAURJETHRA/LLaMA3-Quantization LLaMA3-Quantization	33	Emerging	llm-quantization-techniques	3	Python
2332	sanjaradylov/moleculegen-ml Generate novel molecules using neural language models	33	Emerging	molecular-generation-transformers	5	Python
2333	HariomJangra/project-lumen A 128M parameter language model built from scratch for learning how large...	33	Emerging	llm-frameworks-libraries	8	Jupyter Notebook
2334	yang-ai-lab/OSF-Open-Sleep-FM OSF: On Pre-training and Scaling of Sleep Foundation Models	33	Emerging	llm-knowledge-distillation	10	Jupyter Notebook
2335	actypedef/ARCQuant Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented...	33	Emerging	llm-quantization-techniques	18	Cuda
2336	josStorer/llama.cpp-unicode-windows llama.cpp with unicode (windows) support	33	Emerging	interactive-ai-chat-uis	54	C
2337	AshishGautamX/K8s-LLM-Scheduler An intelligent Kubernetes scheduler powered by Meta's Llama-3.3-70B model...	33	Emerging	llm-inference-engines	2	Python
2338	stchakwdev/kan_transformer Baantu Research: Hybrid KAN-Transformer for investigating learnable...	33	Emerging	kolmogorov-arnold-networks	6	Python
2339	yaojin17/Unlearning_LLM [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large...	33	Emerging	rlhf-alignment-training	66	Python
2340	UCDvision/NOLA Code for NOLA, an implementation of "nola: Compressing LoRA using Linear...	33	Emerging	lora-qlora-fine-tuning	57	Python
2341	mkofinas/neural-graphs Official source code for "Graph Neural Networks for Learning Equivariant...	33	Emerging	graph-transformers	82	Python
2342	horseee/LLaMA-Pruning Structural Pruning for LLaMA	33	Emerging	llm-pruning-compression	54	Python
2343	sail-sg/dice Official implementation of Bootstrapping Language Models via DPO Implicit Rewards	33	Emerging	direct-preference-optimization	47	Python
2344	Beomi/KcBERT-Finetune KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from...	33	Emerging	bert-model-implementations	47	Python
2345	tanulsingh/Humour.ai-Language-model-that-can-crack-Jokes Language Model that makes you Laugh .	33	Emerging	gpt2-pretraining-fine-tuning	41	Python
2346	duyhominhnguyen/Exgra-Med [NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment	33	Emerging	clinical-llm-tools	41	Python
2347	tanishqgautam/Image-Captioning Implemented 3 different architectures to tackle the Image Caption problem,...	33	Emerging	image-captioning-transformers	40	Jupyter Notebook
2348	psmarter/mini-infer A high-performance LLM inference engine with PagedAttention \|...	33	Emerging	llm-inference-engines	61	Python
2349	rkinas/reasoning_models_how_to This repository serves as a collection of research notes and resources on...	33	Emerging	llm-reasoning-research	132	Python
2350	microsoft/InteractiveTextGeneration An implementation of the paper "Interactive Text Generation"	33	Emerging	creative-text-generation	4	Python
2351	kyegomez/DifferentialTransformer An open source community implementation of the model from "DIFFERENTIAL...	33	Emerging	power-transformer-design	39	Python
2352	omron-sinicx/crystalformer The official code respository for "Crystalformer: Infinitely Connected...	33	Emerging	graph-transformers	27	Python
2353	UCSB-NLP-Chang/ULD Implementation of paper 'Reversing the Forget-Retain Objectives: An...	33	Emerging	llm-implementation-from-scratch	26	Python
2354	codewithdark-git/QuantLLM QuantLLM is a Python library designed for developers, researchers, and teams...	33	Emerging	llm-quantization-methods	13	Python
2355	Gapi505/Sparky-2 This is a discord bot running on llama cpp with the llama 3 model and image...	33	Emerging	messaging-platform-chatbots	5	Python
2356	ananttripathi/Resume-Analyzer-MLOps Resume Analyzer is an AI-powered MLOps platform that optimizes your resume...	33	Emerging	resume-job-matching	6	Python
2357	bloomberg/minilmv2.bb Our open source implementation of MiniLMv2...	33	Emerging	llm-implementation-from-scratch	61	Python
2358	smitkiri/news-qa Reading comprehension based question-answering model for news articles.	33	Emerging	question-answering-systems	11	Jupyter Notebook
2359	Esmail-ibraheem/Tinyllamas-pytorch Tinyllamas🦙 is an Extensible advanced language model framework, inspired by...	33	Emerging	llama-model-implementations	6	Python
2360	SAP-samples/btp-running-language-models This repository contains different code examples around the topic of...	33	Emerging	llm-learning-resources	2	Jupyter Notebook
2361	poloclub/tsr-convstem High-Performance Transformers for Table Structure Recognition Need Early Convolutions	33	Emerging	academic-thesis-repositories	45	Python
2362	nicolay-r/Reasoning-for-Sentiment-Analysis-Framework The official code for CoT / ZSL reasoning framework 🧠, utilized in paper:...	33	Emerging	chain-of-thought-reasoning	4	Python
2363	MLD3/steerability An open-source evaluation framework for measuring LLM steerability.	33	Emerging	llm-bias-evaluation	4	Jupyter Notebook
2364	andreped/INF1600-ai-workshop 🔥 Workshop in AI Deployment (INF-1600, UiT)	33	Emerging	ml-foundations-curricula	1	Python
2365	jseeio/gpt2-tfjs GPT2 with Tensorflow.js	33	Emerging	gpt2-pretraining-fine-tuning	4	JavaScript
2366	songxiaoshuai/progco Official Implementation of "ProgCo: Program Helps Self-Correction of Large...	33	Emerging	llm-interpretability-explainability	5	Python
2367	bipinKrishnan/ml-recipe-book A book containing step by step instructions to train deep learning models...	33	Emerging	ml-foundations-curricula	37	HTML
2368	ApplyU-ai/ColorBlindnessEval ColorBlindnessEval: Can Vision Language Models Pass Color Blindness Tests?	33	Emerging	domain-specific-benchmarks	4	—
2369	Wang-ML-Lab/multimodal-needle-in-a-haystack [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking...	33	Emerging	multimodal-vision-language	54	Python
2370	richouzo/hate-speech-detection-survey Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers,...	33	Emerging	hate-speech-detection	21	Jupyter Notebook
2371	adapter-hub/efficient-task-transfer Research code for "What to Pre-Train on? Efficient Intermediate Task...	33	Emerging	parameter-efficient-adapters	37	Python
2372	UBC-MDS/fixml LLM Tool for effective test evaluation of ML projects with curated...	33	Emerging	llm-comparison-evaluation	4	Python
2373	GURPREETKAURJETHRA/LLMs-Evaluation LLMs Evaluation	33	Emerging	evaluation-frameworks-metrics	3	Jupyter Notebook
2374	cosmoquester/transformers-tf-finetune Scripts to finetune huggingface transformers models with Tensorflow 2	33	Emerging	transformer-frameworks-wrappers	8	Python
2375	asigalov61/Lars-Ulrich-Transformer [DEPRECIATED] [339M] [88% acc] Fast full-featured drums inpainting...	33	Emerging	ai-music-generation	8	Python
2376	ROIM1998/APT [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models...	33	Emerging	llm-knowledge-distillation	47	Python
2377	Lanerra/reasoning-bank-slm An experiment that applies Google Research's `ReasoningBank` technique to...	33	Emerging	llm-reasoning-research	99	Python
2378	submarat/removing-layer-norm Transformers Don’t Need LayerNorm at Inference Time	33	Emerging	transformer-architecture-education	3	Python
2379	chrisjob1021/transformer-examples A collection of educational toy implementations and examples of key...	33	Emerging	transformer-architecture-education	3	Jupyter Notebook
2380	anyscale/llm-router Tutorial for building LLM router	33	Emerging	llm-request-routing	246	Python
2381	zjunlp/LightThinker [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression	33	Emerging	neural-data-compression	134	Python
2382	avsrma/LLM-based-AI-Assistant A general purpose AI voice assistant built using GPT-4.	33	Emerging	conversational-chatbot-applications	33	Python
2383	yotamnahum/DNA-Data-Storage Single Read Reconstruction for DNA Data Storage Using Transformers (official...	33	Emerging	protein-transformers-ml	5	Python
2384	declare-lab/TEAM Our EMNLP 2022 paper on MCQA	33	Emerging	question-answering-systems	23	Python
2385	xuanlinli17/large_vlm_distillation_ood Distilling Large Vision-Language Model with Out-of-Distribution...	33	Emerging	domain-adaptation-frameworks	61	Python
2386	WooooDyy/BAPO Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for...	33	Emerging	rlhf-alignment-training	91	Python
2387	ZigeW/data_management_LLM Collection of training data management explorations for large language models	33	Emerging	llm-scaling-architecture	337	—
2388	mcbal/deep-implicit-attention Implementation of deep implicit attention in PyTorch	33	Emerging	transformer-architecture-tutorials	65	Python
2389	BIDS-Xu-Lab/Me-LLaMA A novel medical large language model family with 13/70B parameters, which...	33	Emerging	multilingual-llm-adaptation	167	Python
2390	telekom/transformer-tools Transformers Training Tools	33	Emerging	transformer-architecture-tutorials	6	Python
2391	YunzeMan/Lexicon3D [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D...	33	Emerging	multimodal-vision-language	100	Python
2392	Nondzu/LlamaTor LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,...	33	Emerging	llama-model-implementations	58	Python
2393	crux82/u-deppllama Dependency parsing with Large Language Models	33	Emerging	llm-training-experimentation	5	Python
2394	monk1337/NanoPeft The simplest repository & Neat implementation of different Lora methods for...	33	Emerging	lora-qlora-fine-tuning	7	Jupyter Notebook
2395	Vitgracer/DinoV3-Object-Tracking Object tracking using the DINOv3 model.	33	Emerging	object-detection-transformers	5	Python
2396	elephantmipt/compressors A small library with distillation, quantization and pruning pipelines	33	Emerging	llm-quantization-methods	26	Python
2397	Marvin-VW/python-ollama-local This Python script enables hands-free interaction with a local Llama2...	33	Emerging	ollama-chat-interfaces	3	Python
2398	Orlando-CS/Awesome-VLA ✨✨latest advancements in VLA models(VIsion Language Action)	33	Emerging	multimodal-vision-language-models	109	—
2399	srsawant34/efficient_instruction_learning Code base for the paper "Instruction Tuned Models are Quick Learners".	33	Emerging	lora-qlora-fine-tuning	5	Python
2400	ES7/LLaMA-from-Scratch In this repository, I have explained the working of the LLaMA Model,...	33	Emerging	llama-model-implementations	5	Python

« Prev 1 2 3 … 22 23 24 25 26 … 76 77 78 Next »