All Transformer Models

7,795 models ranked by quality score · Page 28 of 78

Showing 2701–2800 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2701	aigc-apps/PertEval [NeurIPS '24 Spotlight] PertEval: Unveiling Real Knowledge Capacity of LLMs...	31	Emerging	llm-bias-evaluation	14	Jupyter Notebook
2702	DomHudson/bert-in-production A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )...	31	Emerging	bert-model-implementations	96	—
2703	discountry/forever-chat chatgpt with forever memory!	31	Emerging	multi-provider-llm-interfaces	3	TypeScript
2704	titanml/takeoff-community TitanML Takeoff Server is an optimization, compression and deployment...	31	Emerging	llm-inference-engines	114	—
2705	Agora-Lab-AI/HydraNet HydraNet is a state-of-the-art transformer architecture that combines...	31	Emerging	transformer-architecture-tutorials	9	Shell
2706	yangjianxin1/LongQLoRA LongQLoRA: Extent Context Length of LLMs Efficiently	31	Emerging	llm-fine-tuning	168	Python
2707	zzz47zzz/codebase-for-incremental-learning-with-llm [ACL2024] A Codebase for Incremental Learning with Large Language Models;...	31	Emerging	llm-scaling-architecture	60	Python
2708	elijahnzeli1/CausalTorch CausalTorch is a PyTorch library for building generative models with...	31	Emerging	mathematical-reasoning-transformers	5	Python
2709	ryoungj/ObsScaling [NeurIPS'24 Spotlight] Observational Scaling Laws	31	Emerging	llm-scaling-architecture	60	Jupyter Notebook
2710	ant-louis/belgpt2 🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.	31	Emerging	gpt2-pretraining-fine-tuning	34	Python
2711	Yifan-Song793/ETO Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents...	31	Emerging	llm-robot-planning	159	Python
2712	euclaise/SlimTrainer Full finetuning of large language models without large memory requirements	31	Emerging	llm-benchmark-leaderboards	94	Python
2713	hao-ai-lab/d3LLM d3LLM: Ultra-Fast Diffusion LLM 🚀	31	Emerging	diffusion-language-models	105	Python
2714	Agora-Lab-AI/OmniByteGPT An implementation of an all-new foundation model architecture that trains on...	31	Emerging	gpt2-pretraining-fine-tuning	9	Python
2715	w1bb/ATE A server application that provides the user answers to trivia-like questions.	31	Emerging	question-answering-systems	3	Python
2716	Shaurya-Sethi/transqlate End-to-end natural language to SQL system: schema-aware model fine-tuning,...	31	Emerging	text-to-sql-rag	22	Python
2717	ChaitanyaK77/Optimal-Detection-of-Diabetic-Retinopathy-Severity-Using-Attention-Based-CNN-and-Vision-Transformers This repository contains the implementation of a hybrid model combining...	31	Emerging	medical-image-diagnosis-transformers	4	Jupyter Notebook
2718	Iteranya/AktivaAI Local LLM Discord Bot	31	Emerging	messaging-platform-chatbots	18	Python
2719	JunyiYe/FaultyMathProblem From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity...	31	Emerging	llm-reasoning-research	4	—
2720	NiuTrans/Introduction-to-Transformers An introduction to basic concepts of Transformers and key techniques of...	31	Emerging	transformer-architecture-tutorials	51	—
2721	abdur75648/MedicalGPT Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)	31	Emerging	medical-image-diagnosis-transformers	14	Python
2722	bpevangelista/vfastml Inference and Training Engine for LLMs, Image2Image and Other Models	31	Emerging	llm-inference-engines	3	Python
2723	py-lama/weblama A web-based Markdown editor with syntax highlighting, Mermaid diagram...	31	Emerging	interactive-ai-chat-uis	2	JavaScript
2724	jiaowoguanren0615/DLinear This is a warehouse for DLinear-Pytorch-model, can be used to train your...	31	Emerging	time-series-forecasting-transformers	3	Python
2725	EternityYW/RUPBench RUPBench: Benchmarking Reasoning Under Perturbations for Robustness...	31	Emerging	domain-specific-benchmarks	4	Jupyter Notebook
2726	serp-ai/LLaMA-8bit-LoRA Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on...	31	Emerging	llm-fine-tuning	150	Python
2727	VirtualRoyalty/gan-plus-nlp Generative adversarial approach to most popular NLP tasks	31	Emerging	model-evaluation-diagnostics	4	Jupyter Notebook
2728	FudanDISC/ReForm-Eval An benchmark for evaluating the capabilities of large vision-language models (LVLMs)	31	Emerging	safety-robustness-evaluation	46	Python
2729	KevinLee1110/dynamic-batching The official repo for the paper "Optimizing LLM Inference Throughput via...	31	Emerging	llm-inference-engines	17	—
2730	ApocryphalEditor/SRM-mapping-framework A framework for mapping the internal geometry of transformer representations...	31	Emerging	transformer-interpretability-mechanistic	2	Python
2731	LMOS-IO/ALMoAPI ALMoAPI, Agentic Language Model API, is a fork of tabbyAPI, designed to...	31	Emerging	multi-agent-orchestration	4	Python
2732	danieloquelis/natural-language-git Offline LLM-powered Git CLI tool. NLGit interprets your natural language...	31	Emerging	llm-terminal-automation	3	TypeScript
2733	NeurAI-Lab/MT-SfMLearner Official code for 'Transformers in Unsupervised Structure-from-Motion' and...	31	Emerging	3d-vision-transformers	14	Python
2734	shikiw/Modality-Integration-Rate [ICCV 2025] The official code of the paper "Deciphering Cross-Modal...	31	Emerging	vision-language-instruction-tuning	111	Python
2735	ImplicitLayer/agents_nlp Agents for solving NLP problems	31	Emerging	ml-foundations-curricula	2	Python
2736	AJAkil/LLMalMorph This repository contain the tool LLMalMorph, a semi automated tool that...	31	Emerging	llm-fine-tuning-frameworks	6	Python
2737	kyegomez/MobileVLM Implementation of the LDP module block in PyTorch and Zeta from the paper:...	31	Emerging	vision-language-models	15	Python
2738	Roboflow-Universe/finetune-RF-DETR Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on...	31	Emerging	object-detection-transformers	32	Python
2739	LikithMeruvu/Gemma2B_Finetuning_Medium This Repo contains How to Finetune Google's New Gemma LLm model using your...	31	Emerging	gemma-model-fine-tuning	4	Jupyter Notebook
2740	JarvisPei/FuseGPT The implementation for the paper, FuseGPT: Learnable Layers Fusion of...	31	Emerging	gpt2-pretraining-fine-tuning	4	Python
2741	graphcore-research/jax-scalify JAX Scalify: end-to-end scaled arithmetics	31	Emerging	llm-fine-tuning	18	Python
2742	XCollab/HuggingFace This repository provides an overview of Hugging Face's Transformers library,...	31	Emerging	nlp-learning-resources	3	Jupyter Notebook
2743	MartinaHutter/yaskawa-voice-commands NLP for yaskawa robot	31	Emerging	huggingface-learning-resources	3	Python
2744	Pranav-here/agentic-ai-chatbot This project is a modular AI chatbot framework that allows dynamic...	31	Emerging	multi-agent-orchestration	2	Python
2745	surrey-nlp/LLM4MT_eval This repository is for our paper "What do large language model need for...	31	Emerging	math-reasoning-datasets	4	Python
2746	FranxYao/FlanT5-CoT-Specialization Implementation of ICML 23 Paper: Specializing Smaller Language Models...	31	Emerging	chain-of-thought-reasoning	132	Jupyter Notebook
2747	smpanaro/coreml-llm-cli CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.	31	Emerging	llm-quantization-methods	124	Swift
2748	xmindflow/deformableLKA [WACV 2024] Beyond Self-Attention: Deformable Large Kernel Attention for...	31	Emerging	medical-image-segmentation-transformers	259	Python
2749	nrl-ai/CustomChar Your customized AI assistant - Personal assistants on any hardware! With...	31	Emerging	conversational-chatbot-applications	121	C++
2750	yihong1120/Llama2-Telegram-Bot Integration of the advanced llama2 AI model with Telegram to provide...	31	Emerging	llm-chatbot-interfaces	14	Python
2751	Arman176001/Oxidize ⚙️ Oxidize: A Python-to-Rust code translator to boost performance, safety,...	31	Emerging	browser-based-ml-inference	2	Python
2752	techthoughts2/pwshBedrock pwshBedrock is a PowerShell module designed to simplify interaction with...	31	Emerging	multi-agent-orchestration	7	PowerShell
2753	marqinhos/MedicalLiverSegmentationToolKit Medical Toolkit for Liver Volume Segmentation	31	Emerging	medical-image-segmentation-transformers	2	Python
2754	jaabmar/cp_fuse Implementation for the paper "Copyright-Protected Language Generation via...	31	Emerging	diffusion-model-frameworks	7	Python
2755	QwenLM/PolyMath [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath:...	31	Emerging	math-reasoning-datasets	42	Python
2756	OpenMOSS/LongLLaDA [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs	31	Emerging	diffusion-language-models	53	Python
2757	NiuTrans/Vision-LLM-Alignment This repository contains the code for SFT, RLHF, and DPO, designed for...	31	Emerging	rlhf-alignment-training	118	Python
2758	kyegomez/primus A multimodal foundation model for humanoid robotics that integrates multiple...	31	Emerging	multimodal-fusion-transformers	3	—
2759	mrcabbage972/simple-toolformer A Python implementation of Toolformer using Huggingface Transformers	31	Emerging	attention-mechanism-implementations	14	Python
2760	AGI-Edgerunners/LLM-Optimizers-Papers Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic...	31	Emerging	llm-research-curation	252	—
2761	jwergieluk/revllm RevLLM -- Reverse Engineering Tools for Large Language Models	31	Emerging	llm-interpretability-explainability	18	Python
2762	harleyszhang/llm_counts llm theoretical performance analysis tools and support params, flops, memory...	31	Emerging	llm-inference-engines	115	Python
2763	pdaicode/awesome-LLMs-finetuning Collection of resources for finetuning Large Language Models (LLMs).	31	Emerging	llm-knowledge-distillation	113	—
2764	dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...	31	Emerging	llm-scaling-architecture	24	Python
2765	Joyce94/LLM-RLHF-Tuning LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)	31	Emerging	lora-qlora-fine-tuning	453	Python
2766	kryptomrx/tonl-mcp-bridge Reduce LLM token costs by 30-60% with TONL format. TypeScript library & CLI...	31	Emerging	llm-learning-resources	3	TypeScript
2767	RahulSChand/llama2.c-for-dummies Step by step explanation/tutorial of llama2.c	31	Emerging	local-llm-deployment	225	C
2768	cutec-chris/matrix-llm-bot An Bot wich can use most of Large Language Models	31	Emerging	telegram-ai-assistants	7	Python
2769	hem9984/Dataset-label This will allow you to choose your labels, and then label every image in a...	31	Emerging	blip-image-captioning	3	Python
2770	iboing/CorDA CorDA: Context-Oriented Decomposition Adaptation of Large Language Models...	31	Emerging	llm-knowledge-distillation	55	Python
2771	apollosoldier/Advanced-Classifier The Advanced Classification Model is a deep learning-based approach for...	31	Emerging	vision-transformer-classification	3	Python
2772	ynes99/BraTS_Segmentation Segmentation of brain tumors (Glioma) in MRIs using Meta's model SAM...	31	Emerging	medical-image-segmentation-transformers	3	Jupyter Notebook
2773	yinizhilian/ICLR2025-Papers-with-Code 历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.	31	Emerging	llm-research-curation	562	—
2774	dmis-lab/Outlier-Safe-Pre-Training [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large...	31	Emerging	llm-compression-optimization	35	Python
2775	rajatrayaraddi/rul-prediction-bilstm-cnn A BiLSTM-CNN hybrid model with attention for predicting remaining useful life (RUL)	31	Emerging	ml-foundations-curricula	6	Jupyter Notebook
2776	waltonfuture/Diff-eRank [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models	31	Emerging	evaluation-frameworks-metrics	57	Python
2777	MoleculeTransformers/moleculenet-smiles-bert-mixup Training pre-trained BERT language model on molecular SMILES from the...	31	Emerging	molecular-generation-transformers	3	Python
2778	bhanuprathap2000/sign-language-recognition This repo contains the code for sign-language-recognition as part of our...	31	Emerging	3d-vision-transformers	3	Jupyter Notebook
2779	cosmic-heart/Benetech-Chart-Derendering Benetech Kaggle Competition Work. Fine Tuning Matcha (Multi Modal...	31	Emerging	ml-foundations-curricula	3	Jupyter Notebook
2780	garyb9/pytorch-transformers Transformers architecture code playground repository in python using PyTorch.	31	Emerging	transformer-architecture-tutorials	3	Python
2781	sitammeur/qwen2.5-web Qwen2.5 Instruct, large language model, operates within web browsers via 🤗...	31	Emerging	browser-based-ml-inference	2	JavaScript
2782	telekom/llm_evaluation_results LLM evaluation results	31	Emerging	evaluation-frameworks-metrics	4	Jupyter Notebook
2783	shhossain/BanglaTranslationKit BanglaTranslationKit is a open-source translation package for offline...	31	Emerging	indic-language-translation	3	Python
2784	Nikshaan/llm-from-scratch Implementation of build a LLM from scratch by Sebastian Raschka.	31	Emerging	llm-implementation-tutorials	15	Python
2785	fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI This research examines the performance of Large Language Models (GPT-3.5...	31	Emerging	llm-scaling-architecture	3	Jupyter Notebook
2786	D-Roberts/transformers-retrieval-ranking-nli-ECIR2021 Multilingual retrieval, ranking and natural language inference with...	31	Emerging	semantic-search-retrieval	3	Python
2787	upunaprosk/quantized-lm-confidence Code for NAACL paper When Quantization Affects Confidence of Large Language Models?	31	Emerging	llm-quantization-techniques	3	Jupyter Notebook
2788	ilias-ant/toxic-spans-detection An attempt at SemEval 2021 Task 5: Toxic Spans Detection.	31	Emerging	hate-speech-detection	4	Jupyter Notebook
2789	matteomedioli/BERT-KG Enriching Language Models Representations via Knowledge Graphs Regularisation	31	Emerging	model-evaluation-diagnostics	3	Python
2790	Nickil21/weakly-supervised-parsing Official Code for our Findings of ACL 2022 paper: Co-training an...	31	Emerging	model-evaluation-diagnostics	4	Python
2791	toriving/haafor-challenge-2020 The project for HAAFOR CHALLENGE 2020	31	Emerging	semantic-textual-similarity	3	Python
2792	stevezheng23/fewshot_nlp_pt Few-shot NLP in PyTorch	31	Emerging	model-evaluation-diagnostics	4	Python
2793	HLTCHKUST/VG-GPLMs The code repository for EMNLP 2021 paper "Vision Guided Generative...	31	Emerging	vision-language-models	57	Python
2794	mlane/llm-getting-started Practical, beginner-friendly LLM projects using Python, LangChain, and...	31	Emerging	langchain-prompt-templates	1	Python
2795	mtanghu/LEAP LEAP: Linear Explainable Attention in Parallel for causal language modeling...	31	Emerging	transformer-architecture-tutorials	4	Jupyter Notebook
2796	ParadoxZW/LLaVA-UHD-Better A bug-free and improved implementation of LLaVA-UHD, based on the code from...	31	Emerging	multimodal-vision-language	35	Python
2797	cambridgeltl/sail-bli Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL...	31	Emerging	llm-fine-tuning-frameworks	4	Python
2798	seonghyeonye/Flipped-Learning [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models...	31	Emerging	rlhf-alignment-training	117	Python
2799	UIC-Liu-Lab/ContinualLM An Extensible Continual Learning Framework Focused on Language Models (LMs)	31	Emerging	prompt-engineering-optimization	293	Python
2800	Skyline-9/Visionary-Vids Multi-modal transformer approach for natural language query based joint...	31	Emerging	vision-language-models	17	Jupyter Notebook

« Prev 1 2 3 … 26 27 28 29 30 … 76 77 78 Next »