All Transformer Models

7,795 models ranked by quality score · Page 18 of 78

Showing 1701–1800 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1701	princeton-nlp/LLMBar [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following	37	Emerging	evaluation-frameworks-metrics	137	Python
1702	mu-cai/matryoshka-mm Matryoshka Multimodal Models	37	Emerging	vision-language-instruction-tuning	122	Python
1703	zRzRzRzRzRzRzR/lm-fly 大模型推理框架加速，让 LLM 飞起来	37	Emerging	llm-inference-engines	24	Python
1704	FlatlinerDOA/PerceptivePyro Run and train Transformer based Large Language Models (LLMS) natively in...	37	Emerging	local-llm-deployment	24	C#
1705	rdenadai/BR-BERTo Transformer model for Portuguese language (Brazil pt_BR)	37	Emerging	bert-model-implementations	16	Python
1706	pdfosborne/elsciRL The core repository of the elsciRL framework.	37	Emerging	llm-scaling-architecture	18	Python
1707	AlexanderVNikitin/kernel-language-entropy Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...	37	Emerging	llm-reasoning-research	36	Python
1708	UCSB-NLP-Chang/SemanticSmooth Implementation of paper 'Defending Large Language Models against Jailbreak...	37	Emerging	jailbreak-attacks-analysis	23	Python
1709	ariannamethod/chuck.optimizer Adam is blind. Chuck sees. Lee 4ever.	37	Emerging	llm-benchmark-leaderboards	4	C
1710	mkuchnik/relm ReLM is a Regular Expression engine for Language Models	37	Emerging	llm-scaling-architecture	107	Python
1711	ai-glimpse/toyllm ToyLLM: Learning LLM from Scratch	37	Emerging	llm-implementation-tutorials	25	Python
1712	DebeshJha/TransNetR Official implementation of TransNetR: Transformer-based Residual Network for...	37	Emerging	medical-image-segmentation-transformers	24	Python
1713	surrey-nlp/NLP-2025 Labs for COM3029/COMM061 at University of Surrey	37	Emerging	nlp-learning-coursework	3	Jupyter Notebook
1714	TIGER-AI-Lab/StructLM Code and data for "StructLM: Towards Building Generalist Models for...	37	Emerging	math-reasoning-datasets	76	Python
1715	horus-ai-labs/DistillFlow Library for model distillation	37	Emerging	llm-knowledge-distillation	165	Python
1716	chziakas/redeval A library for red-teaming LLM applications with LLMs.	37	Emerging	evaluation-frameworks-metrics	29	Python
1717	shinomakoi/AI-Messenger A QT GUI for large language models	37	Emerging	interactive-ai-chat-uis	40	Python
1718	Bruce-Lee-LY/flash_attention_inference Performance of the C++ interface of flash attention and flash attention v2...	37	Emerging	sparse-attention-optimization	43	C++
1719	Ethyros-AI/ModelCypher ModelCypher - Decipher the high dimensional geometry of LLMs. An open source...	37	Emerging	llm-finetuning-frameworks	19	Python
1720	CVI-SZU/Linly Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集	37	Emerging	multilingual-llm-adaptation	3,056	Python
1721	osainz59/t5-encoder A extension of Transformers library to include T5ForSequenceClassification class.	37	Emerging	t5-mt5-fine-tuning	40	Python
1722	avnlp/llm-blender LLM-Blender: Ensembling framework that maximizes LLM performance via...	37	Emerging	llm-fine-tuning-optimization	36	Python
1723	olaflaitinen/llm-proteomics-hallucination Systematic evaluation of hallucination risks in Large Language Models...	37	Emerging	llm-finetuning-frameworks	9	Python
1724	sandylaker/ib-edl Calibrating LLMs with Information-Theoretic Evidential Deep Learning (ICLR 2025)	37	Emerging	llm-bias-evaluation	17	Python
1725	westlake-repl/NRPStransformer A Transformer-Based Predictor for Nonribosomal Peptide Synthetases (NRPS)...	37	Emerging	peptide-property-prediction	9	Python
1726	monologg/KoELECTRA-Pipeline Transformers Pipeline with KoELECTRA	37	Emerging	korean-language-models	40	Python
1727	openmedlab/PULSE PULSE: Pretrained and Unified Language Service Engine	37	Emerging	llm-fine-tuning	494	Python
1728	DebarshiChanda/Amazon-ML-Challenge2021 Scripts and Approach for Amazon ML Challenge	37	Emerging	semantic-textual-similarity	91	Jupyter Notebook
1729	LMLK-seal/HuggingGGUF Hugging Face Model downloader and GGUF Converter.	37	Emerging	llm-quantization-methods	13	Python
1730	benitomartin/food-images-finetuning Fine-tuning of LiquidAI LFM2-VL vision-language models on food image...	37	Emerging	lora-qlora-fine-tuning	7	Jupyter Notebook
1731	zd11024/NaviLLM [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for...	37	Emerging	multimodal-vision-language	229	Python
1732	CoderLSF/fast-llama Runs LLaMA with Extremely HIGH speed	37	Emerging	llm-inference-engines	95	C++
1733	Grenzlinie/MgBERT_LLM_Classification_for_Materials_Science Source code and result for Paper 'A Prompt-Engineered Large Language Model,...	37	Emerging	bert-model-frameworks	9	HTML
1734	arrmansa/Gpt-Neo-Limited-Vram-Cuda A notebook that runs GPT-Neo with low vram (6 gb) and cuda acceleration by...	37	Emerging	gpt2-pretraining-fine-tuning	14	Jupyter Notebook
1735	toriving/text-classification-transformers Easy text classification for everyone : Bert based models via Huggingface...	37	Emerging	korean-language-models	39	Python
1736	joslefaure/HERMES [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes...	37	Emerging	multimodal-vision-language	38	Python
1737	FuxiaoLiu/LRV-Instruction [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust...	37	Emerging	llm-evaluation-frameworks	297	Python
1738	MalihehIzadi/SoftwareTagRecommender A tag recommender based on SOTA machine learning algorithms to automatically...	37	Emerging	recommendation-systems-transformers	20	Jupyter Notebook
1739	gpustack/gguf-packer-go Deliver LLMs of GGUF format via Dockerfile.	37	Emerging	llm-quantization-methods	15	Go
1740	losttech/Torch.MinGPT A C# implementation of GPT	37	Emerging	gpt2-pretraining-fine-tuning	20	C#
1741	microsoft/AdaMix This is the implementation of the paper AdaMix: Mixture-of-Adaptations for...	37	Emerging	knowledge-distillation-compression	138	Python
1742	kyegomez/VortexFusion Transformers + Mambas + LSTMS All in One Model	37	Emerging	multimodal-fusion-transformers	14	Python
1743	amazon-science/transformers-data-augmentation Code associated with the "Data Augmentation using Pre-trained Transformer...	37	Emerging	transformer-architecture-education	51	Python
1744	YassWorks/Tuna Python library that makes fine-tuning transformer-based models easier and faster.	37	Emerging	lora-qlora-fine-tuning	5	Python
1745	LlamaFamily/Llama-Chinese Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用	37	Emerging	multilingual-llm-adaptation	14,737	Python
1746	SALT-NLP/LLaVAR Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for...	37	Emerging	multimodal-vision-language	269	Python
1747	JinjieNi/MixEval The official evaluation suite and dynamic data release for MixEval.	37	Emerging	evaluation-frameworks-metrics	255	Python
1748	juyongjiang/CodeUp CodeUp: A Multilingual Code Generation Llama-X Model with...	37	Emerging	code-model-training	127	Python
1749	leap-laboratories/PIZZA An attribution library for LLMs	37	Emerging	llm-interpretability-explainability	46	Python
1750	jrobine/twm Transformer-based World Models	37	Emerging	world-models-frameworks	89	Python
1751	conceptofmind/t5-pytorch Implementation of Exploring the Limits of Transfer Learning with a Unified...	37	Emerging	t5-mt5-fine-tuning	53	Python
1752	kyegomez/MM1 PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from...	37	Emerging	vision-language-models	26	Python
1753	lukechilds/humanscript A truly natural scripting language	37	Emerging	llm-terminal-automation	236	Shell
1754	rbitr/llm.f90 LLM inference in Fortran	37	Emerging	llm-inference-engines	64	Fortran
1755	liupras/Practical-local-LLM-programming Programming with local large language model.	37	Emerging	langchain-integration-patterns	24	Python
1756	kyegomez/MC-ViT Implementation of the model: "(MC-ViT)" from the paper: "Memory...	37	Emerging	vit-image-classification	27	Python
1757	ShinoharaHare/LLM-Training A distributed training framework for large language models powered by Lightning.	37	Emerging	llm-inference-engines	24	Python
1758	princeton-pli/AdaptMI [COLM 2025] Adaptive Skill-based In-context Math Instruction for Small...	37	Emerging	math-reasoning-datasets	9	Python
1759	kyegomez/MegaVIT The open source implementation of the model from "Scaling Vision...	37	Emerging	vision-transformer-optimization	32	Python
1760	Thrasher-Software/sigil A local-first LLM development studio. Build, test, and customize inference...	37	Emerging	local-llm-deployment	17	CSS
1761	earthai-tech/fusionlab-learn fusionlab-learn: Igniting Next-Gen Temporal Fusion Architectures	37	Emerging	time-series-forecasting-transformers	2	Python
1762	sytelus/nanuGPT Simple, reliable and well tested training code for quick experiments with...	37	Emerging	gpt2-pretraining-fine-tuning	13	Python
1763	iMoonLab/LLM4Hypergraph The source code of ICLR 2025 "Beyond Graphs: Can Large Language Models...	37	Emerging	graph-language-models	38	Python
1764	Rishit-dagli/GLU An easy-to-use library for GLU (Gated Linear Units) and GLU variants in TensorFlow.	37	Emerging	llm-quantization-methods	20	Python
1765	readme-generator/alreadyme-ai-research Generate README.md with GPT-3 few-shot learning	37	Emerging	gpt2-pretraining-fine-tuning	27	Python
1766	StarRing2022/ChatGPTX-Uni 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案，LLM-Base+LLM-X+Alpaca，初期，LLM-Base为...	37	Emerging	multilingual-llm-adaptation	116	Python
1767	teelinsan/camoscio Camoscio: An Italian instruction-tuned language model based on LLaMA	37	Emerging	multilingual-llm-adaptation	126	Jupyter Notebook
1768	invergent-ai/surogate Insanely fast LLM pre-training and fine-tuning for modern NVIDIA GPUs....	37	Emerging	llm-inference-engines	114	C++
1769	tirtharajdash/LMLFStar Generating target-specific novel lead molecules using an LLM	37	Emerging	protein-design-llms	4	HTML
1770	ksm26/Finetuning-Large-Language-Models Unlock the potential of finetuning Large Language Models (LLMs). Learn from...	37	Emerging	llm-fine-tuning	68	Jupyter Notebook
1771	IDSIA/fpainter Official repository for the paper "Images as Weight Matrices: Sequential...	37	Emerging	mathematical-reasoning-transformers	12	Python
1772	luohongyin/LangCode LangCode - Improving alignment and reasoning of large language models (LLMs)...	37	Emerging	llm-scaling-architecture	49	Python
1773	ivonajdenkoska/tulip [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"	37	Emerging	text-to-image-generation	33	Python
1774	huggingface/large_language_model_training_playbook An open collection of implementation tips, tricks and resources for training...	37	Emerging	llm-frameworks-libraries	497	Python
1775	desaixie/zeroverse Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction...	37	Emerging	3d-vision-transformers	153	Python
1776	poteminr/instruct-ner Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models...	37	Emerging	lora-qlora-fine-tuning	89	Python
1777	hyintell/awesome-refreshing-llms EMNLP'23 survey: a curation of awesome papers and resources on refreshing...	37	Emerging	llm-learning-resources	136	—
1778	ariya/gamal Research tool leveraging LLM for answers	37	Emerging	llm-terminal-automation	58	JavaScript
1779	Troyanovsky/llama-vision-image-tagger Use Llama3.2 Vision for tagging and searching images on your local machine.	37	Emerging	llm-terminal-automation	92	HTML
1780	zjunlp/Mol-Instructions [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset...	37	Emerging	rlhf-alignment-training	294	Python
1781	JonSnow1807/Medical-Prescription-OCR OCR system for handwritten medical prescriptions using Donut transformer and...	37	Emerging	ocr-document-extraction	9	Jupyter Notebook
1782	VityaVitalich/STASC [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models	37	Emerging	llm-scaling-architecture	11	Jupyter Notebook
1783	poloclub/Fine-tuning-LLMs Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial	37	Emerging	llm-fine-tuning	74	Jupyter Notebook
1784	pier-maker92/bachsformer A Bach music generator with Artificial Intelligence. This model is made by a...	37	Emerging	music-generation-transformers	44	Python
1785	jellydn/gpt4all-cli By utilizing GPT4All-CLI, developers can effortlessly tap into the power of...	37	Emerging	multi-provider-llm-interfaces	37	TypeScript
1786	MurrellGroup/InvariantPointAttention.jl Julia implementation of AlphaFold 2's Invariant Point Attention	37	Emerging	attention-mechanism-implementations	6	Julia
1787	partarstu/transformers-in-java Experimental project for AI and NLP based on Transformer Architecture	37	Emerging	transformer-frameworks-wrappers	16	Java
1788	rendezqueue/rendezllama CLI for llama.cpp with various commands to guide, edit, and regenerate...	37	Emerging	llm-terminal-automation	12	C++
1789	otvam/pyscalexfmr Optimization and Scaling of Medium-Frequency Transformers	37	Emerging	power-transformer-design	8	Python
1790	openshieldai/openshield OpenShield is a new generation security layer for AI models	37	Emerging	local-llm-deployment	84	Go
1791	alexeykarnachev/full_stack_transformer Pytorch library for end-to-end transformer models training, inference and serving	37	Emerging	transformer-architecture-tutorials	70	Python
1792	andrewkchan/yalm Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O	37	Emerging	llm-inference-engines	557	C++
1793	babycommando/machinascript-for-robots Build LLM-powered robots in your garage with MachinaScript For Robots!	37	Emerging	llm-terminal-automation	195	Python
1794	DeepLangAI/LingoWhale-8B LingoWhale-8B: Open Bilingual LLMs \| 开源双语预训练大模型	37	Emerging	llm-frameworks-libraries	147	Python
1795	guxm2021/ALT_SpeechBrain [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription	36	Emerging	wav2vec2-speech-recognition	49	Python
1796	SuyogKamble/simpleVLM building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2...	36	Emerging	vision-language-models	7	Jupyter Notebook
1797	c0sogi/llama-api An OpenAI-like LLaMA inference API	36	Emerging	local-llm-deployment	113	Python
1798	neosantara-xyz/glm-ocr-inference Fast and lightweight GLM-OCR inference on Modal with an OpenAI-compatible...	36	Emerging	ocr-document-extraction	3	Python
1799	ASSERT-KTH/agentic-evals-lab Framework for training and evaluating LLMs with reinforcement learning in...	36	Emerging	multi-agent-orchestration	4	Python
1800	Selozhd/FNet-tensorflow Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms."	36	Emerging	transformer-architecture-tutorials	22	Python

« Prev 1 2 3 … 16 17 18 19 20 … 76 77 78 Next »