All Transformer Models

7,795 models ranked by quality score · Page 28 of 78

Showing 2701–2800 of 7,795
# Model Score Tier
2701 aigc-apps/PertEval

[NeurIPS '24 Spotlight] PertEval: Unveiling Real Knowledge Capacity of LLMs...

31
Emerging
2702 DomHudson/bert-in-production

A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )...

31
Emerging
2703 discountry/forever-chat

chatgpt with forever memory!

31
Emerging
2704 titanml/takeoff-community

TitanML Takeoff Server is an optimization, compression and deployment...

31
Emerging
2705 Agora-Lab-AI/HydraNet

HydraNet is a state-of-the-art transformer architecture that combines...

31
Emerging
2706 yangjianxin1/LongQLoRA

LongQLoRA: Extent Context Length of LLMs Efficiently

31
Emerging
2707 zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models;...

31
Emerging
2708 elijahnzeli1/CausalTorch

CausalTorch is a PyTorch library for building generative models with...

31
Emerging
2709 ryoungj/ObsScaling

[NeurIPS'24 Spotlight] Observational Scaling Laws

31
Emerging
2710 ant-louis/belgpt2

🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.

31
Emerging
2711 Yifan-Song793/ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents...

31
Emerging
2712 euclaise/SlimTrainer

Full finetuning of large language models without large memory requirements

31
Emerging
2713 hao-ai-lab/d3LLM

d3LLM: Ultra-Fast Diffusion LLM 🚀

31
Emerging
2714 Agora-Lab-AI/OmniByteGPT

An implementation of an all-new foundation model architecture that trains on...

31
Emerging
2715 w1bb/ATE

A server application that provides the user answers to trivia-like questions.

31
Emerging
2716 Shaurya-Sethi/transqlate

End-to-end natural language to SQL system: schema-aware model fine-tuning,...

31
Emerging
2717 ChaitanyaK77/Optimal-Detection-of-Diabetic-Retinopathy-Severity-Using-Attention-Based-CNN-and-Vision-Transformers

This repository contains the implementation of a hybrid model combining...

31
Emerging
2718 Iteranya/AktivaAI

Local LLM Discord Bot

31
Emerging
2719 JunyiYe/FaultyMathProblem

From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity...

31
Emerging
2720 NiuTrans/Introduction-to-Transformers

An introduction to basic concepts of Transformers and key techniques of...

31
Emerging
2721 abdur75648/MedicalGPT

Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)

31
Emerging
2722 bpevangelista/vfastml

Inference and Training Engine for LLMs, Image2Image and Other Models

31
Emerging
2723 py-lama/weblama

A web-based Markdown editor with syntax highlighting, Mermaid diagram...

31
Emerging
2724 jiaowoguanren0615/DLinear

This is a warehouse for DLinear-Pytorch-model, can be used to train your...

31
Emerging
2725 EternityYW/RUPBench

RUPBench: Benchmarking Reasoning Under Perturbations for Robustness...

31
Emerging
2726 serp-ai/LLaMA-8bit-LoRA

Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on...

31
Emerging
2727 VirtualRoyalty/gan-plus-nlp

Generative adversarial approach to most popular NLP tasks

31
Emerging
2728 FudanDISC/ReForm-Eval

An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

31
Emerging
2729 KevinLee1110/dynamic-batching

The official repo for the paper "Optimizing LLM Inference Throughput via...

31
Emerging
2730 ApocryphalEditor/SRM-mapping-framework

A framework for mapping the internal geometry of transformer representations...

31
Emerging
2731 LMOS-IO/ALMoAPI

ALMoAPI, Agentic Language Model API, is a fork of tabbyAPI, designed to...

31
Emerging
2732 danieloquelis/natural-language-git

Offline LLM-powered Git CLI tool. NLGit interprets your natural language...

31
Emerging
2733 NeurAI-Lab/MT-SfMLearner

Official code for 'Transformers in Unsupervised Structure-from-Motion' and...

31
Emerging
2734 shikiw/Modality-Integration-Rate

[ICCV 2025] The official code of the paper "Deciphering Cross-Modal...

31
Emerging
2735 ImplicitLayer/agents_nlp

Agents for solving NLP problems

31
Emerging
2736 AJAkil/LLMalMorph

This repository contain the tool LLMalMorph, a semi automated tool that...

31
Emerging
2737 kyegomez/MobileVLM

Implementation of the LDP module block in PyTorch and Zeta from the paper:...

31
Emerging
2738 Roboflow-Universe/finetune-RF-DETR

Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on...

31
Emerging
2739 LikithMeruvu/Gemma2B_Finetuning_Medium

This Repo contains How to Finetune Google's New Gemma LLm model using your...

31
Emerging
2740 JarvisPei/FuseGPT

The implementation for the paper, FuseGPT: Learnable Layers Fusion of...

31
Emerging
2741 graphcore-research/jax-scalify

JAX Scalify: end-to-end scaled arithmetics

31
Emerging
2742 XCollab/HuggingFace

This repository provides an overview of Hugging Face's Transformers library,...

31
Emerging
2743 MartinaHutter/yaskawa-voice-commands

NLP for yaskawa robot

31
Emerging
2744 Pranav-here/agentic-ai-chatbot

This project is a modular AI chatbot framework that allows dynamic...

31
Emerging
2745 surrey-nlp/LLM4MT_eval

This repository is for our paper "What do large language model need for...

31
Emerging
2746 FranxYao/FlanT5-CoT-Specialization

Implementation of ICML 23 Paper: Specializing Smaller Language Models...

31
Emerging
2747 smpanaro/coreml-llm-cli

CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.

31
Emerging
2748 xmindflow/deformableLKA

[WACV 2024] Beyond Self-Attention: Deformable Large Kernel Attention for...

31
Emerging
2749 nrl-ai/CustomChar

Your customized AI assistant - Personal assistants on any hardware! With...

31
Emerging
2750 yihong1120/Llama2-Telegram-Bot

Integration of the advanced llama2 AI model with Telegram to provide...

31
Emerging
2751 Arman176001/Oxidize

⚙️ Oxidize: A Python-to-Rust code translator to boost performance, safety,...

31
Emerging
2752 techthoughts2/pwshBedrock

pwshBedrock is a PowerShell module designed to simplify interaction with...

31
Emerging
2753 marqinhos/MedicalLiverSegmentationToolKit

Medical Toolkit for Liver Volume Segmentation

31
Emerging
2754 jaabmar/cp_fuse

Implementation for the paper "Copyright-Protected Language Generation via...

31
Emerging
2755 QwenLM/PolyMath

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath:...

31
Emerging
2756 OpenMOSS/LongLLaDA

[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

31
Emerging
2757 NiuTrans/Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for...

31
Emerging
2758 kyegomez/primus

A multimodal foundation model for humanoid robotics that integrates multiple...

31
Emerging
2759 mrcabbage972/simple-toolformer

A Python implementation of Toolformer using Huggingface Transformers

31
Emerging
2760 AGI-Edgerunners/LLM-Optimizers-Papers

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic...

31
Emerging
2761 jwergieluk/revllm

RevLLM -- Reverse Engineering Tools for Large Language Models

31
Emerging
2762 harleyszhang/llm_counts

llm theoretical performance analysis tools and support params, flops, memory...

31
Emerging
2763 pdaicode/awesome-LLMs-finetuning

Collection of resources for finetuning Large Language Models (LLMs).

31
Emerging
2764 dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving

Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...

31
Emerging
2765 Joyce94/LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

31
Emerging
2766 kryptomrx/tonl-mcp-bridge

Reduce LLM token costs by 30-60% with TONL format. TypeScript library & CLI...

31
Emerging
2767 RahulSChand/llama2.c-for-dummies

Step by step explanation/tutorial of llama2.c

31
Emerging
2768 cutec-chris/matrix-llm-bot

An Bot wich can use most of Large Language Models

31
Emerging
2769 hem9984/Dataset-label

This will allow you to choose your labels, and then label every image in a...

31
Emerging
2770 iboing/CorDA

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models...

31
Emerging
2771 apollosoldier/Advanced-Classifier

The Advanced Classification Model is a deep learning-based approach for...

31
Emerging
2772 ynes99/BraTS_Segmentation

Segmentation of brain tumors (Glioma) in MRIs using Meta's model SAM...

31
Emerging
2773 yinizhilian/ICLR2025-Papers-with-Code

历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

31
Emerging
2774 dmis-lab/Outlier-Safe-Pre-Training

[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large...

31
Emerging
2775 rajatrayaraddi/rul-prediction-bilstm-cnn

A BiLSTM-CNN hybrid model with attention for predicting remaining useful life (RUL)

31
Emerging
2776 waltonfuture/Diff-eRank

[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models

31
Emerging
2777 MoleculeTransformers/moleculenet-smiles-bert-mixup

Training pre-trained BERT language model on molecular SMILES from the...

31
Emerging
2778 bhanuprathap2000/sign-language-recognition

This repo contains the code for sign-language-recognition as part of our...

31
Emerging
2779 cosmic-heart/Benetech-Chart-Derendering

Benetech Kaggle Competition Work. Fine Tuning Matcha (Multi Modal...

31
Emerging
2780 garyb9/pytorch-transformers

Transformers architecture code playground repository in python using PyTorch.

31
Emerging
2781 sitammeur/qwen2.5-web

Qwen2.5 Instruct, large language model, operates within web browsers via 🤗...

31
Emerging
2782 telekom/llm_evaluation_results

LLM evaluation results

31
Emerging
2783 shhossain/BanglaTranslationKit

BanglaTranslationKit is a open-source translation package for offline...

31
Emerging
2784 Nikshaan/llm-from-scratch

Implementation of build a LLM from scratch by Sebastian Raschka.

31
Emerging
2785 fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI

This research examines the performance of Large Language Models (GPT-3.5...

31
Emerging
2786 D-Roberts/transformers-retrieval-ranking-nli-ECIR2021

Multilingual retrieval, ranking and natural language inference with...

31
Emerging
2787 upunaprosk/quantized-lm-confidence

Code for NAACL paper When Quantization Affects Confidence of Large Language Models?

31
Emerging
2788 ilias-ant/toxic-spans-detection

An attempt at SemEval 2021 Task 5: Toxic Spans Detection.

31
Emerging
2789 matteomedioli/BERT-KG

Enriching Language Models Representations via Knowledge Graphs Regularisation

31
Emerging
2790 Nickil21/weakly-supervised-parsing

Official Code for our Findings of ACL 2022 paper: Co-training an...

31
Emerging
2791 toriving/haafor-challenge-2020

The project for HAAFOR CHALLENGE 2020

31
Emerging
2792 stevezheng23/fewshot_nlp_pt

Few-shot NLP in PyTorch

31
Emerging
2793 HLTCHKUST/VG-GPLMs

The code repository for EMNLP 2021 paper "Vision Guided Generative...

31
Emerging
2794 mlane/llm-getting-started

Practical, beginner-friendly LLM projects using Python, LangChain, and...

31
Emerging
2795 mtanghu/LEAP

LEAP: Linear Explainable Attention in Parallel for causal language modeling...

31
Emerging
2796 ParadoxZW/LLaVA-UHD-Better

A bug-free and improved implementation of LLaVA-UHD, based on the code from...

31
Emerging
2797 cambridgeltl/sail-bli

Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL...

31
Emerging
2798 seonghyeonye/Flipped-Learning

[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models...

31
Emerging
2799 UIC-Liu-Lab/ContinualLM

An Extensible Continual Learning Framework Focused on Language Models (LMs)

31
Emerging
2800 Skyline-9/Visionary-Vids

Multi-modal transformer approach for natural language query based joint...

31
Emerging
« Prev 1 2 3 26 27 28 29 30 76 77 78 Next »