All Transformer Models

7,795 models ranked by quality score · Page 18 of 78

Showing 1701–1800 of 7,795
# Model Score Tier
1701 princeton-nlp/LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

37
Emerging
1702 mu-cai/matryoshka-mm

Matryoshka Multimodal Models

37
Emerging
1703 zRzRzRzRzRzRzR/lm-fly

大模型推理框架加速,让 LLM 飞起来

37
Emerging
1704 FlatlinerDOA/PerceptivePyro

Run and train Transformer based Large Language Models (LLMS) natively in...

37
Emerging
1705 rdenadai/BR-BERTo

Transformer model for Portuguese language (Brazil pt_BR)

37
Emerging
1706 pdfosborne/elsciRL

The core repository of the elsciRL framework.

37
Emerging
1707 AlexanderVNikitin/kernel-language-entropy

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...

37
Emerging
1708 UCSB-NLP-Chang/SemanticSmooth

Implementation of paper 'Defending Large Language Models against Jailbreak...

37
Emerging
1709 ariannamethod/chuck.optimizer

Adam is blind. Chuck sees. Lee 4ever.

37
Emerging
1710 mkuchnik/relm

ReLM is a Regular Expression engine for Language Models

37
Emerging
1711 ai-glimpse/toyllm

ToyLLM: Learning LLM from Scratch

37
Emerging
1712 DebeshJha/TransNetR

Official implementation of TransNetR: Transformer-based Residual Network for...

37
Emerging
1713 surrey-nlp/NLP-2025

Labs for COM3029/COMM061 at University of Surrey

37
Emerging
1714 TIGER-AI-Lab/StructLM

Code and data for "StructLM: Towards Building Generalist Models for...

37
Emerging
1715 horus-ai-labs/DistillFlow

Library for model distillation

37
Emerging
1716 chziakas/redeval

A library for red-teaming LLM applications with LLMs.

37
Emerging
1717 shinomakoi/AI-Messenger

A QT GUI for large language models

37
Emerging
1718 Bruce-Lee-LY/flash_attention_inference

Performance of the C++ interface of flash attention and flash attention v2...

37
Emerging
1719 Ethyros-AI/ModelCypher

ModelCypher - Decipher the high dimensional geometry of LLMs. An open source...

37
Emerging
1720 CVI-SZU/Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

37
Emerging
1721 osainz59/t5-encoder

A extension of Transformers library to include T5ForSequenceClassification class.

37
Emerging
1722 avnlp/llm-blender

LLM-Blender: Ensembling framework that maximizes LLM performance via...

37
Emerging
1723 olaflaitinen/llm-proteomics-hallucination

Systematic evaluation of hallucination risks in Large Language Models...

37
Emerging
1724 sandylaker/ib-edl

Calibrating LLMs with Information-Theoretic Evidential Deep Learning (ICLR 2025)

37
Emerging
1725 westlake-repl/NRPStransformer

A Transformer-Based Predictor for Nonribosomal Peptide Synthetases (NRPS)...

37
Emerging
1726 monologg/KoELECTRA-Pipeline

Transformers Pipeline with KoELECTRA

37
Emerging
1727 openmedlab/PULSE

PULSE: Pretrained and Unified Language Service Engine

37
Emerging
1728 DebarshiChanda/Amazon-ML-Challenge2021

Scripts and Approach for Amazon ML Challenge

37
Emerging
1729 LMLK-seal/HuggingGGUF

Hugging Face Model downloader and GGUF Converter.

37
Emerging
1730 benitomartin/food-images-finetuning

Fine-tuning of LiquidAI LFM2-VL vision-language models on food image...

37
Emerging
1731 zd11024/NaviLLM

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for...

37
Emerging
1732 CoderLSF/fast-llama

Runs LLaMA with Extremely HIGH speed

37
Emerging
1733 Grenzlinie/MgBERT_LLM_Classification_for_Materials_Science

Source code and result for Paper 'A Prompt-Engineered Large Language Model,...

37
Emerging
1734 arrmansa/Gpt-Neo-Limited-Vram-Cuda

A notebook that runs GPT-Neo with low vram (6 gb) and cuda acceleration by...

37
Emerging
1735 toriving/text-classification-transformers

Easy text classification for everyone : Bert based models via Huggingface...

37
Emerging
1736 joslefaure/HERMES

[ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes...

37
Emerging
1737 FuxiaoLiu/LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust...

37
Emerging
1738 MalihehIzadi/SoftwareTagRecommender

A tag recommender based on SOTA machine learning algorithms to automatically...

37
Emerging
1739 gpustack/gguf-packer-go

Deliver LLMs of GGUF format via Dockerfile.

37
Emerging
1740 losttech/Torch.MinGPT

A C# implementation of GPT

37
Emerging
1741 microsoft/AdaMix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for...

37
Emerging
1742 kyegomez/VortexFusion

Transformers + Mambas + LSTMS All in One Model

37
Emerging
1743 amazon-science/transformers-data-augmentation

Code associated with the "Data Augmentation using Pre-trained Transformer...

37
Emerging
1744 YassWorks/Tuna

Python library that makes fine-tuning transformer-based models easier and faster.

37
Emerging
1745 LlamaFamily/Llama-Chinese

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

37
Emerging
1746 SALT-NLP/LLaVAR

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for...

37
Emerging
1747 JinjieNi/MixEval

The official evaluation suite and dynamic data release for MixEval.

37
Emerging
1748 juyongjiang/CodeUp

CodeUp: A Multilingual Code Generation Llama-X Model with...

37
Emerging
1749 leap-laboratories/PIZZA

An attribution library for LLMs

37
Emerging
1750 jrobine/twm

Transformer-based World Models

37
Emerging
1751 conceptofmind/t5-pytorch

Implementation of Exploring the Limits of Transfer Learning with a Unified...

37
Emerging
1752 kyegomez/MM1

PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from...

37
Emerging
1753 lukechilds/humanscript

A truly natural scripting language

37
Emerging
1754 rbitr/llm.f90

LLM inference in Fortran

37
Emerging
1755 liupras/Practical-local-LLM-programming

Programming with local large language model.

37
Emerging
1756 kyegomez/MC-ViT

Implementation of the model: "(MC-ViT)" from the paper: "Memory...

37
Emerging
1757 ShinoharaHare/LLM-Training

A distributed training framework for large language models powered by Lightning.

37
Emerging
1758 princeton-pli/AdaptMI

[COLM 2025] Adaptive Skill-based In-context Math Instruction for Small...

37
Emerging
1759 kyegomez/MegaVIT

The open source implementation of the model from "Scaling Vision...

37
Emerging
1760 Thrasher-Software/sigil

A local-first LLM development studio. Build, test, and customize inference...

37
Emerging
1761 earthai-tech/fusionlab-learn

fusionlab-learn: Igniting Next-Gen Temporal Fusion Architectures

37
Emerging
1762 sytelus/nanuGPT

Simple, reliable and well tested training code for quick experiments with...

37
Emerging
1763 iMoonLab/LLM4Hypergraph

The source code of ICLR 2025 "Beyond Graphs: Can Large Language Models...

37
Emerging
1764 Rishit-dagli/GLU

An easy-to-use library for GLU (Gated Linear Units) and GLU variants in TensorFlow.

37
Emerging
1765 readme-generator/alreadyme-ai-research

Generate README.md with GPT-3 few-shot learning

37
Emerging
1766 StarRing2022/ChatGPTX-Uni

实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为...

37
Emerging
1767 teelinsan/camoscio

Camoscio: An Italian instruction-tuned language model based on LLaMA

37
Emerging
1768 invergent-ai/surogate

Insanely fast LLM pre-training and fine-tuning for modern NVIDIA GPUs....

37
Emerging
1769 tirtharajdash/LMLFStar

Generating target-specific novel lead molecules using an LLM

37
Emerging
1770 ksm26/Finetuning-Large-Language-Models

Unlock the potential of finetuning Large Language Models (LLMs). Learn from...

37
Emerging
1771 IDSIA/fpainter

Official repository for the paper "Images as Weight Matrices: Sequential...

37
Emerging
1772 luohongyin/LangCode

LangCode - Improving alignment and reasoning of large language models (LLMs)...

37
Emerging
1773 ivonajdenkoska/tulip

[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"

37
Emerging
1774 huggingface/large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training...

37
Emerging
1775 desaixie/zeroverse

Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction...

37
Emerging
1776 poteminr/instruct-ner

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models...

37
Emerging
1777 hyintell/awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refreshing...

37
Emerging
1778 ariya/gamal

Research tool leveraging LLM for answers

37
Emerging
1779 Troyanovsky/llama-vision-image-tagger

Use Llama3.2 Vision for tagging and searching images on your local machine.

37
Emerging
1780 zjunlp/Mol-Instructions

[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset...

37
Emerging
1781 JonSnow1807/Medical-Prescription-OCR

OCR system for handwritten medical prescriptions using Donut transformer and...

37
Emerging
1782 VityaVitalich/STASC

[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models

37
Emerging
1783 poloclub/Fine-tuning-LLMs

Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial

37
Emerging
1784 pier-maker92/bachsformer

A Bach music generator with Artificial Intelligence. This model is made by a...

37
Emerging
1785 jellydn/gpt4all-cli

By utilizing GPT4All-CLI, developers can effortlessly tap into the power of...

37
Emerging
1786 MurrellGroup/InvariantPointAttention.jl

Julia implementation of AlphaFold 2's Invariant Point Attention

37
Emerging
1787 partarstu/transformers-in-java

Experimental project for AI and NLP based on Transformer Architecture

37
Emerging
1788 rendezqueue/rendezllama

CLI for llama.cpp with various commands to guide, edit, and regenerate...

37
Emerging
1789 otvam/pyscalexfmr

Optimization and Scaling of Medium-Frequency Transformers

37
Emerging
1790 openshieldai/openshield

OpenShield is a new generation security layer for AI models

37
Emerging
1791 alexeykarnachev/full_stack_transformer

Pytorch library for end-to-end transformer models training, inference and serving

37
Emerging
1792 andrewkchan/yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

37
Emerging
1793 babycommando/machinascript-for-robots

Build LLM-powered robots in your garage with MachinaScript For Robots!

37
Emerging
1794 DeepLangAI/LingoWhale-8B

LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型

37
Emerging
1795 guxm2021/ALT_SpeechBrain

[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription

36
Emerging
1796 SuyogKamble/simpleVLM

building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2...

36
Emerging
1797 c0sogi/llama-api

An OpenAI-like LLaMA inference API

36
Emerging
1798 neosantara-xyz/glm-ocr-inference

Fast and lightweight GLM-OCR inference on Modal with an OpenAI-compatible...

36
Emerging
1799 ASSERT-KTH/agentic-evals-lab

Framework for training and evaluating LLMs with reinforcement learning in...

36
Emerging
1800 Selozhd/FNet-tensorflow

Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms."

36
Emerging
« Prev 1 2 3 16 17 18 19 20 76 77 78 Next »