All Transformer Models

7,795 models ranked by quality score · Page 17 of 78

Showing 1601–1700 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1601	WisconsinAIVision/ViP-LLaVA [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary...	38	Emerging	vision-language-instruction-tuning	336	Python
1602	EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty Official implementation of the ACL 2024: Scientific Inspiration Machines...	38	Emerging	llm-learning-resources	94	Python
1603	HOLYKEYZ/model-unfetter The production engine for directional ablation. Unalign / remove models...	38	Emerging	llm-compression-optimization	19	Python
1604	NisaarAgharia/Indian-LawyerGPT Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model...	38	Emerging	lora-qlora-fine-tuning	93	Jupyter Notebook
1605	Yxxxb/VoCo-LLaMA [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of...	38	Emerging	vision-language-instruction-tuning	203	Python
1606	rasbt/pytorch-memory-optim This code repository contains the code used for my "Optimizing Memory Usage...	38	Emerging	llm-implementation-tutorials	92	Python
1607	RishabSA/interp-refusal-tokens We study whether categorical refusal tokens enable controllable and...	38	Emerging	rlhf-alignment-training	7	Python
1608	Jagatmohan46/tiny-recursive-model 🚀 Implement the Tiny Recursive Model (TRM) for improved performance in...	38	Emerging	transformer-training-optimization	1	Python
1609	ParCIS/Chimera Chimera: bidirectional pipeline parallelism for efficiently training...	38	Emerging	transformer-training-optimization	70	Python
1610	hscspring/llama.np Inference Llama/Llama2/Llama3 Modes in NumPy	38	Emerging	llama-model-implementations	21	Python
1611	BarCodeReader/SelfReformer [TMM-2023] Official implementation of "Towards Complete and Detail-Preserved...	38	Emerging	object-detection-transformers	73	Python
1612	The-Swarm-Corporation/Hyena-Y A PyTorch implementation of the Hyena-Y model, a convolution-based...	38	Emerging	transformer-architecture-tutorials	11	Python
1613	gnai-creator/aletheion-llm-v2 Decoder-only LLM with integrated epistemic tomography. Knows what it doesn't know.	38	Emerging	llm-bias-evaluation	2	Python
1614	readytensor/rt-llm-eng-cert-week3 Week 3 of LLM Engineering Certification: Learn to fine-tune large language...	38	Emerging	llm-fine-tuning	1	Jupyter Notebook
1615	ictnlp/BayLing “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT...	38	Emerging	multilingual-llm-adaptation	318	Python
1616	ChanMeng666/interactive-story-generator 【Join our constellation of stargazers!⭐️】An interactive AI-powered story...	38	Emerging	prompt-engineering-security	11	Python
1617	dropbox/grallama-panel GraLLAMA panel for LLAMA data	38	Emerging	interactive-ai-chat-uis	16	JavaScript
1618	matlab-deep-learning/transformer-networks-for-time-series-prediction Deep Learning in Quantitative Finance: Transformer Networks for Time Series...	38	Emerging	time-series-forecasting-transformers	61	MATLAB
1619	sshh12/llm_optimize LLM Optimize is a proof-of-concept library for doing LLM (large language...	38	Emerging	llm-scaling-architecture	61	Python
1620	thruthseeker/LionLock_FDE_OSS Open source fatigue detection engine for large language models with trust overlay	38	Emerging	llm-inference-engines	3	Python
1621	VITA-Group/LiGO [ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer...	38	Emerging	llm-pruning-compression	92	Python
1622	mtuann/llm-updated-papers Papers related to Large Language Models in all top venues	38	Emerging	llm-research-curation	11	—
1623	ariannamethod/doe DoE Janus Architecture: Democracy of Experts	38	Emerging	llm-quantization-methods	4	C
1624	flozi00/atra An open source NLP as a service project focused on providing state of the...	37	Emerging	ml-api-deployment	20	Jupyter Notebook
1625	ximinng/LLM4SVG [CVPR 2025] Official implementation for "Empowering LLMs to Understand and...	37	Emerging	multimodal-vision-language	617	Python
1626	GT-RIPL/robo-vln Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics...	37	Emerging	multimodal-fusion-transformers	88	Python
1627	awinml/llama-cpp-python-bindings Run fast LLM Inference using Llama.cpp in Python	37	Emerging	llm-docker-deployments	19	Jupyter Notebook
1628	litus-ai/classy classy is a simple-to-use library for building high-performance Machine...	37	Emerging	text-classification-transformers	87	Python
1629	K024/llm-sharp Language models in C#	37	Emerging	local-llm-deployment	50	C#
1630	coderonion/awesome-llm-and-aigc 🚀🚀🚀A collection of some awesome public projects about Large Language...	37	Emerging	llm-training-experimentation	804	—
1631	voidism/DoLa Official implementation for the paper "DoLa: Decoding by Contrasting Layers...	37	Emerging	llm-hallucination-mitigation	544	Python
1632	declare-lab/exemplary-empathy This repository contains the source codes of the paper -- Exemplars-guided...	37	Emerging	emotion-detection-transformers	24	Python
1633	ImKeTT/AdaVAE [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling...	37	Emerging	variational-autoencoders-nlp	37	Python
1634	TIGER-AI-Lab/VL-Rethinker The official code of "VL-Rethinker: Incentivizing Self-Reflection of...	37	Emerging	llm-reasoning-research	184	Python
1635	HKUNLP/icl-ceil [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.	37	Emerging	rlhf-alignment-training	103	Python
1636	mohyunho/NAS_transformer Evolutionary Neural Architecture Search on Transformers for RUL Prediction	37	Emerging	transformer-architecture-tutorials	50	Python
1637	DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across...	37	Emerging	math-reasoning-datasets	18	Jupyter Notebook
1638	casinca/LLM-quest Verbose implementations of LLMs architectures, techniques and research...	37	Emerging	llm-finetuning-frameworks	12	Python
1639	CognitiveAISystems/RATE [ICLR 2026] Official implementation of Recurrent Action Transformer with...	37	Emerging	paper-implementation-collections	18	Python
1640	kyegomez/MGQA The open source implementation of the multi grouped query attention by the...	37	Emerging	vision-language-models	15	Python
1641	joelbarmettlerUZH/ConceptFormer Towards Finding the Essence of Everything in Large Language Models	37	Emerging	llm-implementation-from-scratch	13	Python
1642	iil-postech/semantic-attention Official implementation of "Attention-aware semantic communications for...	37	Emerging	transformer-architecture-tutorials	13	Jupyter Notebook
1643	pagraf/Seabed-Net Quick start guide for Seabed-Net	37	Emerging	vision-transformer-classification	8	Python
1644	Shanghai-Digital-Brain-Laboratory/BDM-DB1 A large-scale multi-modal pre-trained model	37	Emerging	multimodal-fusion-transformers	134	Python
1645	justADeni/intel-npu-llm A simple Python script for running LLMs on Intel's Neural Processing Units (NPUs)	37	Emerging	llm-inference-serving	35	Python
1646	snapllm/snapllm 🔥 🔥 Alternative to Ollama 🔥 🔥 multi-model <1ms LLM switching	37	Emerging	llm-inference-serving	32	C++
1647	StupidTrees/SplitLLM Split Learning Simulation Framework for LLMs	37	Emerging	llm-scaling-architecture	38	Python
1648	nlpkeg/Know-MRI This is an official code for the [ACL 2025 Demo] paper: Know-MRI: A...	37	Emerging	llm-interpretability-explainability	14	Jupyter Notebook
1649	aws-samples/fine-tuning-llm-with-domain-knowledge This repo walks you through how to use transfer learning to fine tune a LLM...	37	Emerging	llm-fine-tuning	42	Jupyter Notebook
1650	jhcho99/CoFormer [CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for...	37	Emerging	3d-vision-transformers	50	Python
1651	chensyCN/llm4ea_official [NeurIPS‘24] LLM4EA: Entity Alignment with Noisy Annotations from Large...	37	Emerging	llm-knowledge-editing	61	Python
1652	tommyip/mamba2-minimal Minimal Mamba-2 implementation in PyTorch	37	Emerging	diffusion-language-models	243	Python
1653	whunextgen/LLMindCraft Shaping Language Models with Cognitive Insights	37	Emerging	llm-benchmark-leaderboards	15	Python
1654	varunshenoy/super-json-mode Low latency JSON generation using LLMs ⚡️	37	Emerging	structured-output-enforcement	398	Jupyter Notebook
1655	all-things-vits/code-samples Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and...	37	Emerging	vit-image-classification	197	Jupyter Notebook
1656	readme-generator/alreadyme-ai-serving Serving large language model with transformers	37	Emerging	creative-text-generation	13	Python
1657	saltudelft/codefill Contains the code and data for our #ICSE2022 paper titled as "CodeFill:...	37	Emerging	creative-text-generation	15	Jupyter Notebook
1658	Agora-Lab-AI/Atom a suite of finetuned LLMs for atomically precise function calling 🧪	37	Emerging	local-llm-deployment	17	Python
1659	ccmdi/geobench GeoGuessr benchmark for language models	37	Emerging	llm-benchmark-leaderboards	51	Python
1660	sandseb123/local-lora-cookbook Fine-tune a local LLM on your own app's data in 15 minutes. Runs entirely...	37	Emerging	llm-fine-tuning-optimization	13	Python
1661	canjiali/PARADE code and data to faciliate BERT/ELECTRA for document ranking. Details refer...	37	Emerging	semantic-search-retrieval	96	Python
1662	VachanVY/Transfusion.torch PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse...	37	Emerging	3d-vision-transformers	28	Python
1663	jqwangai/Medical-LLM A Repository of Medical Large Language Models	37	Emerging	clinical-llm-tools	4	—
1664	yangjianxin1/Firefly Firefly:...	37	Emerging	multilingual-llm-adaptation	6,644	Python
1665	knagrecha/saturn Saturn accelerates the training of large-scale deep learning models with a...	37	Emerging	llm-inference-engines	24	Python
1666	gentaiscool/miners MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual...	37	Emerging	semantic-search-retrieval	14	Python
1667	nishad/llm-workshop-notebooks Getting Started with Local LLMs - Workshop Notebooks	37	Emerging	llm-learning-resources	3	Jupyter Notebook
1668	iiis-ai/cumulative-reasoning [TMLR] Cumulative Reasoning With Large Language Models...	37	Emerging	llm-reasoning-research	308	Python
1669	RLado/STB-VMM STB-VMM: Swin Transformer Based Video Motion Magnification (official repository)	37	Emerging	vision-transformer-implementations	50	Python
1670	xmindflow/MS-Former [MIDL 2023] MS-Former: Multi-Scale Self-Guided Transformer for Medical Image...	37	Emerging	medical-image-segmentation-transformers	16	Jupyter Notebook
1671	p-nordmann/eqx-llama LLaMA implementation with Jax and Equinox	37	Emerging	llama-model-implementations	4	Jupyter Notebook
1672	OSU-NLP-Group/AmpleGCG AmpleGCG: Learning a Universal and Transferable Generator of Adversarial...	37	Emerging	adversarial-nlp-robustness	85	Python
1673	Harish25/StudyScreeningLanguageModel Core LLM for M.A.R.S. (Model Assisted Review System). Utilizes fine-tuned...	37	Emerging	multilingual-llm-adaptation	1	Jupyter Notebook
1674	xiangking/prompt_uie_torch 基于PaddleNLP开源的抽取式UIE进行医学命名实体识别（torch实现）	37	Emerging	transformer-frameworks-wrappers	44	Python
1675	dirmacs/lancor A Rust client library for llama.cpp's OpenAI-compatible API server	37	Emerging	local-llm-deployment	2	Rust
1676	fangyuan-ksgk/Mini-LLaVA A minimal implementation of LLaVA-style VLM with interleaved image & text &...	37	Emerging	multimodal-vision-language	98	Python
1677	WANGXinyiLinda/concept-based-demonstration-selection Offical code of the paper Large Language Models Are Implicitly Topic Models:...	37	Emerging	llm-scaling-architecture	75	Python
1678	locuslab/massive-activations Code accompanying the paper "Massive Activations in Large Language Models"	37	Emerging	llm-scaling-architecture	197	Python
1679	SeungyounShin/Llama2-Code-Interpreter Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet	37	Emerging	local-llm-deployment	685	Python
1680	adalkiran/llama-nuts-and-bolts A holistic way of understanding how Llama and its components run in...	37	Emerging	local-llm-deployment	317	Go
1681	extreme-bert/extreme-bert ExtremeBERT is a toolkit that accelerates the pretraining of customized...	37	Emerging	bert-model-frameworks	268	Python
1682	AlphaPav/mem-kk-logic On Memorization of Large Language Models in Logical Reasoning	37	Emerging	math-reasoning-datasets	76	Python
1683	user1342/Tomato LLM steganography with minimum-entropy coupling - Hiding encrypted messages...	37	Emerging	llm-frameworks-libraries	94	Python
1684	zjohn77/lightning-mlflow-hf Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow	37	Emerging	llm-fine-tuning	65	Python
1685	muhammad-fiaz/finetune-web-ui Finetune Web UI is a user-interface for training and deploying pre-trained models.	37	Emerging	lora-qlora-fine-tuning	12	Python
1686	YoannDev90/AlphaLLM An AI Discord Bot generating text and images, advanced features, full...	37	Emerging	messaging-platform-chatbots	12	Python
1687	Love-Asuka/Etude-LLM "Etude"一词源自法语，原意为"研习曲"或"练习曲"，在音乐领域特指为提高演奏技巧而创作的短小精悍的乐曲。在本项目中，"Etude...	37	Emerging	llm-finetuning-frameworks	5	Python
1688	alexa/ramen A software for transferring pre-trained English models to foreign languages	37	Emerging	bert-model-implementations	19	Python
1689	torchspec-project/TorchSpec A PyTorch native library for training speculative decoding models	37	Emerging	speculative-decoding-algorithms	32	Python
1690	LucknowAI/Lucknow-LLM Collecting data for Building Lucknow's first LLM	37	Emerging	llm-learning-resources	21	Jupyter Notebook
1691	kyegomez/AudioMamba Implementation of the paper: "Audio Mamba: Bidirectional State Space Model...	37	Emerging	3d-vision-transformers	14	Shell
1692	potamides/uniformers Token-free Language Modeling with ByGPT5 & Friends!	37	Emerging	gpt2-pretraining-fine-tuning	12	Python
1693	gyunggyung/LFM2-KoEn-Tuning Fine-tuning LFM2-1.2B for Korean-English bidirectional translation....	37	Emerging	gpt2-language-models	7	Jupyter Notebook
1694	nrimsky/LM-exp LLM experiments done during SERI MATS - focusing on activation steering /...	37	Emerging	llm-training-experimentation	103	Jupyter Notebook
1695	promptslab/LLMtuner FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)	37	Emerging	llm-fine-tuning	247	Python
1696	cahlen/conversation-dataset-generator Craft conversational datasets (JSONL format with rich metadata) using LLMs....	37	Emerging	llm-training-experimentation	12	Python
1697	mhw32/prototransformer-public PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to...	37	Emerging	transformer-architecture-tutorials	16	Python
1698	HomebrewML/HomebrewNLP-torch A case study of efficient training of large language models using commodity hardware.	37	Emerging	gpt-multilingual-training	68	Python
1699	mantasu/cs224n Solutions for CS224n (2022)	37	Emerging	nlp-learning-coursework	72	Python
1700	lliai/D2MoE D^2-MoE: Delta Decompression for MoE-based LLMs Compression	37	Emerging	mixture-of-experts-llms	74	Python

« Prev 1 2 3 … 15 16 17 18 19 … 76 77 78 Next »