All Transformer Models

7,795 models ranked by quality score · Page 25 of 78

Showing 2401–2500 of 7,795
# Model Score Tier
2401 deterministic-algorithms-lab/NLP-Journey

This repository provides a selection of very basic and minimal notebooks for...

33
Emerging
2402 SORRY-Bench/sorry-bench

Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large...

33
Emerging
2403 augustwester/transformer-xl

A lightweight PyTorch implementation of the Transformer-XL architecture...

33
Emerging
2404 benct/kotlin-cheat-sheet

:star: Kotlin <3 Cheat Sheet, Collection Extension Functions and General Examples

33
Emerging
2405 akanyaani/miniLLAMA

A simplified LLAMA implementation for training and inference tasks.

33
Emerging
2406 AndrewBoessen/PerfectRep

PerfectRep is a 3D pose estimation model tailored specifically for...

33
Emerging
2407 UBC-NLP/marbert

UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic

33
Emerging
2408 allenai/x-lxmert

PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer...

33
Emerging
2409 YJiangcm/LTE

[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing

33
Emerging
2410 maifeeulasad/LocalLLaMA

📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA

33
Emerging
2411 Uralstech/vid-orca

Deploy LLaMA-2 Chat on Google Cloud.

33
Emerging
2412 detsutut/ama-bot

A modern and lightweight NLP interface for Question-Answering systems and...

33
Emerging
2413 kurakurai/Luth

Luth is a state-of-the-art series of fine-tuned LLMs for French

33
Emerging
2414 asaddi/YALLM-LlamaVision

A set of nodes for basic Llama 3.2 Vision support in ComfyUI

33
Emerging
2415 loretoparisi/bert_text_classifier

Text Classification with BERT

33
Emerging
2416 datawhalechina/unlock-hf

解锁HuggingFace生态的百般用法

32
Emerging
2417 SalesforceAIResearch/Elastic-Reasoning

Make reasoning models scalable

32
Emerging
2418 CogitatorTech/zigformer

An educational transformer-based LLM in pure Zig

32
Emerging
2419 peacelwh/VT-FSL

[NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning

32
Emerging
2420 luffycodes/Tutorbot-Spock

An Education Tutoring Chatbot based on Learning Science Principles powered...

32
Emerging
2421 liuyang-ict/SAP-DETR

[CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between...

32
Emerging
2422 SapienzaNLP/ita-bench

A collection of Italian benchmarks for LLM evaluation

32
Emerging
2423 fajri91/sum_liputan6

The first large-scale summarization corpus for the Indonesian language. AACL 2020.

32
Emerging
2424 nsi319/Finetune-Transformers

Abstractive text summarization by fine-tuning seq2seq models.

32
Emerging
2425 remixer-dec/botality-ii

telegram bot for self-hosted local inference of stable diffusion,...

32
Emerging
2426 waltonfuture/InstructionGPT-4

InstructionGPT-4

32
Emerging
2427 LorenzoAgnolucci/BERT_for_ABSA

In this work (Targeted) Aspect-Based Sentiment Analysis task is converted to...

32
Emerging
2428 sayakpaul/deploy-hf-tf-vision-models

This repository shows various ways of deploying a vision model (TensorFlow)...

32
Emerging
2429 seongminp/transformers-into-vaes

Code for "Finetuning Pretrained Transformers into Variational Autoencoders"

32
Emerging
2430 FareedKhan-dev/Understanding-Transformers-Step-by-Step-math-example

Understanding Large Language Transformer Architecture like a child

32
Emerging
2431 babycommando/neuralgraffiti

Live-bending a foundation model’s output at neural network level.

32
Emerging
2432 andreiramani/jadi4llamacpp

Just another drop in for llama.cpp

32
Emerging
2433 ejurasek00/Hashing_LLM_Debiasing

Repository consisting of the files used in the experiments + brief...

32
Emerging
2434 shamspias/Transformers-and-Large-Language-Models-From-Basics-to-Frontier-Research

Dive into the transformative world of NLP with this guide on Transformers....

32
Emerging
2435 ramalamadingdong/onnx-rubikpi

ONNX LLM runtime on RUBIK-Pi with Gemma 1B and Llama 3.2 1B

32
Emerging
2436 apple/ml-interspeech2022-phi_rtn

Repository accompanying the Interspeech 2022 publication titled...

32
Emerging
2437 templetwo/PhaseGPT

Kuramoto Phase-Coupled Oscillator Attention in Transformers

32
Emerging
2438 codyjk/ChessGPT

♟️ A transformer that plays chess 🤖

32
Emerging
2439 tmcarmichael/fabricai-inference-server

A hackable, modular, containerized inference server for deploying large...

32
Emerging
2440 skjp/spout

Workspace Repo for Synergistic Plugins Optimizing Usability of Transformers(Spout)

32
Emerging
2441 BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models

This repository contains the source code and synthetic datasets used in the...

32
Emerging
2442 kiyoshisasano/llm-failure-atlas

A graph-based failure modeling and deterministic detection system for LLM...

32
Emerging
2443 StarxSky/ANE-GPT-New

New ANE GPT

32
Emerging
2444 BrightBlueCheese/transformers_and_chemistry

The Role of Model Architecture and Scale in Predicting Molecular Properties:...

32
Emerging
2445 zzteam-rccup-2024/aurora-echo

We propose a new feedback system, named Aurora Echo} which provides...

32
Emerging
2446 chris-santiago/met

Reproducing the MET framework with PyTorch

32
Emerging
2447 SapienzaNLP/MaTESe

MaTESe: Machine Translation Evaluation as a Sequence Tagging Problem

32
Emerging
2448 codewithdark-git/llama-3-Hackathon

LLaMA Genius is an AI-powered research assistant designed to help users...

32
Emerging
2449 winstxnhdw/llm-api

A fast CPU-based API for Qwen 2.5 using CTranslate2, hosted on Hugging Face Spaces.

32
Emerging
2450 Bengal1/Simple-Transformer

An introductory guide and practical showcase of the Transformer model.

32
Emerging
2451 hqhq1025/ai-course-notes

📚 220+ 份 AI/LLM 公开课中文讲义 PDF | Stanford CS336·CS224R·CS25·CS231N | Berkeley...

32
Emerging
2452 DrejcPesjak/scaling-monosemanticity-llama

Reproducing Scaling Monosemanticity: Extracting Interpretable Features from...

32
Emerging
2453 NotYuSheng/DialogSmith

Fine-tune an LLM on your Telegram chats to replicate your writing style...

32
Emerging
2454 VisioSphereAI/labelvim

This is a python based standalone image annotation tool designed for tasks...

32
Emerging
2455 tsinghua-fib-lab/UniST

Official implementation for "UniST: A Prompt-Empowered Universal Model for...

32
Emerging
2456 EdvardOlsen/Horoscope_generator

This is a horoscope generating code

32
Emerging
2457 liux2/Langchain-LLM-Config

Langchain LLM config adapters

32
Emerging
2458 ppijbb/NaturalLanguageProcessing

natural language processing notebooks

32
Emerging
2459 CYFARE/PDXTRACT

Extract From PDF's Using Ollama Local LLM

32
Emerging
2460 KCLabMTU/LMCrot

Protein Language Model (pLM) Powered Protein Crotonylation (Kcr) Modified...

32
Emerging
2461 TamSiuhin/P2P

source code for "Instant Personalized Large Language Model Adaptation via...

32
Emerging
2462 jrazi/llm-assignments-fall-2023

My assignment submissions for the Fall 2023 Large Language Models course at...

32
Emerging
2463 MichiganNLP/Scalable-VLM-Probing

Probe Vision-Language Models

32
Emerging
2464 ES7/Introduction-to-LLMs

In this repository I have explained the application of Large Language Models...

32
Emerging
2465 cgxjdzz/FeatureForge-LLM

FeatureForge LLM is a Python package that leverages large language models...

32
Emerging
2466 zealscott/AutoProfiler

Source code for Automated Profile Inference with Language Model Agents

32
Emerging
2467 mrkorzun/Multi-AI-Telegram-Bot

Multi-model Telegram bot (aiogram v3) with OpenRouter model picker (Llama,...

32
Emerging
2468 yangjianxin1/Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、Intern...

32
Emerging
2469 mary-lev/llm-ocr

LLM-powered OCR evaluation and correction package that supports multiple...

32
Emerging
2470 KhoiDOO/vitvqganvae

Benchmark for Evaluating Data Reconstruction using Vector Quantization

32
Emerging
2471 fualsan/TransformerFromScratch

PyTorch Implementation of Transformer Deep Learning Model

32
Emerging
2472 askblocks/askblocks-core

LLM API backend for Askblocks Q&A widget system.

32
Emerging
2473 alphasecio/groq

A Streamlit chatbot with memory for running open-source text models on Groq.

32
Emerging
2474 Arezkiiiii/mini_llm

🚀 Build and understand a Large Language Model from scratch using PyTorch...

32
Emerging
2475 huangjia2019/llm-in-action

LLM Examples

32
Emerging
2476 OpenMLRL/LLM_Collab_Code_Generation

LLM Collaboration for Code Generation

32
Emerging
2477 Alba-Intelligence/GraphMERT.jl

Julia implementation of the GraphMERT algorithm (Arxiv 2510.09580)

32
Emerging
2478 EKarton/English-French-Translator

A web application that translate sentences between English and French

32
Emerging
2479 robert-mcdermott/ollama-batch-cluster

Large Scale Batch Processing with Ollama

32
Emerging
2480 wangxiao5791509/MultiModal_BigModels_Survey

[MIR-2023-Survey] A continuously updated paper list for multi-modal...

32
Emerging
2481 Flagro/OmniModKit

Multimodal LLM toolkit

32
Emerging
2482 pmady/llmops

🚀 The Ultimate Curated List of LLMOps Tools, Frameworks, and Resources - A...

32
Emerging
2483 jpwahle/emnlp23-paraphrase-types

The official implementation of the EMNLP 2023 paper "Paraphrase Types for...

32
Emerging
2484 anshumansinha3301/Article-on-Large-Language-Models

This article provides an in-depth examination of LLMs, explaining...

32
Emerging
2485 Abhishek6353/AllMiniLML6V2-coreml

CoreML conversion of all-MiniLM-L6-v2 with a full SwiftUI demo, tokenizer...

32
Emerging
2486 Shivanshu-Gupta/in-context-learning

Easy in-context learning experiemnts with variety of datasets, LLMs, and...

32
Emerging
2487 yueliu1999/FlipAttack

[ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs...

32
Emerging
2488 paulalesius/llmath

Large Language Math - The Mathematics of LLM Foundational Models - For Beginners

32
Emerging
2489 dengls24/LLM-para

Analyze LLM inference: FLOPs, memory, Roofline model. Supports GQA, MoE,...

32
Emerging
2490 yao8839836/kg-llm

Exploring large language models for knowledge graph completion. ICASSP 2025

32
Emerging
2491 RJain12/choformer

Cho codon optimization WIP

32
Emerging
2492 ola-krutrim/Chitrarth

Chitrarth: Bridging Vision and Language for a Billion People

32
Emerging
2493 mikecvet/beam

LLM Beam Search Example Implementation

32
Emerging
2494 CharlesYuan02/eve-bot

A Discord bot I created in Python. Her name is Eve.

32
Emerging
2495 Living-with-machines/genre-classification

Jupyter book showing how to build an ML powered book genre classifier

32
Emerging
2496 kazemihabib/Mitigating-Reasoning-LLM-Social-Bias

A novel approach to mitigating social bias in Large Language Models through...

32
Emerging
2497 tasketh/tasketh

tasketh is a simple discord bot that lets moderators assign, and users claim tasks.

32
Emerging
2498 pittisl/GreenTrainer

Code for paper "Towards Green AI in Fine-tuning Large Language Models via...

32
Emerging
2499 gsarti/lcl23-xnlm-lab

Materials for the Lab "Explaining Neural Language Models from Internal...

32
Emerging
2500 kyegomez/Qwen-VL

My personal implementation of the model from "Qwen-VL: A Frontier Large...

32
Emerging
« Prev 1 2 3 23 24 25 26 27 76 77 78 Next »