All Transformer Models

7,795 models ranked by quality score · Page 27 of 78

Showing 2601–2700 of 7,795
# Model Score Tier
2601 arshadshk/Last_Query_Transformer_RNN-PyTorch

Implementation of the paper "Last Query Transformer RNN for knowledge...

32
Emerging
2602 HiThink-Research/BizFinBench

A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

32
Emerging
2603 katha-ai/EmoTx-CVPR2023

[CVPR 2023] Official code repository for "How you feelin'? Learning Emotions...

32
Emerging
2604 Mmorgan-ML/Phase-Slip-Sampler

Phase-Slip is a stochastic intervention architecture that operates on the...

32
Emerging
2605 varchasvee108/vision-transformer-maze-agent

Vision Transformer agent that learns to navigate mazes while visualizing...

31
Emerging
2606 asiff00/Bangla-Llama

Fine tuned llama 3 models for context based question answering in bengali language.

31
Emerging
2607 ai-art-dev99/llm-from-scratch

Build a Large Language Model From Scratch

31
Emerging
2608 xiaoachen98/Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

31
Emerging
2609 catherinesyeh/story-viz

Reimagining storyline visualizations with LLMs (VIS 2025)

31
Emerging
2610 prateekralhan/Deep-Question-Answering-System

A deep learning based Q&A system built using RoBerTa model from huggingface...

31
Emerging
2611 laclouis5/uform-coreml-converters

CLI for converting UForm models to CoreML.

31
Emerging
2612 conceptofmind/PaLM-flax

Implementation of the SOTA Transformer architecture from PaLM - Scaling...

31
Emerging
2613 patricia-pereira/cd-erc

Code for the paper: Context-Dependent Embedding Utterance Representations...

31
Emerging
2614 john-osborne-j/quantized-clinicalbert

This repository contains a 4-bit quantized ClinicalBERT model for disease...

31
Emerging
2615 Katashynskyi/Voice_assistant_UA_EN

No api-keys | local | llama3.1 For language studying and live translation

31
Emerging
2616 maxxxzdn/erwin

Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical...

31
Emerging
2617 ropensci/pangoling

An R package for estimating the log-probabilities of words in a given...

31
Emerging
2618 NC0DER/GreekT5

A series of Greek News Summarization Sequence-to-Sequence Models built with...

31
Emerging
2619 ASK-03/Reverse-Chain

Implementation of paper - Reverse Chain: A Generic-Rule for LLMs to Master...

31
Emerging
2620 vcanchik/robotmem

Robot memory

31
Emerging
2621 asiff00/Bengali-Sentence-Error-Correction

Fine-tune mBart 50 for Bengali Sentence Error Correction

31
Emerging
2622 dsdanielpark/hf-transllm

LLMtranslator translates and generates text in multiple languages.

31
Emerging
2623 RaptorMai/MLLM-CompBench

[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs...

31
Emerging
2624 PRITHIVSAKTHIUR/Nvidia-Cosmos-Reason1-Demo

Physical AI models understand physical common sense and generate appropriate...

31
Emerging
2625 Merterm/Modeling-Intensification-for-SLG

Public repo for the paper: "Modeling Intensification for Sign Language...

31
Emerging
2626 SCRN-VRC/Language-Translation-with-Fragment-Shaders

EN to JP and JP to EN with transformer models

31
Emerging
2627 Qwen-Applications/CLIPO

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

31
Emerging
2628 Curated-Awesome-Lists/Awesome-Llama3

A curated, awesome list of resources, tools, and projects for the AI Large...

31
Emerging
2629 bobazooba/shurale

Conversation AI model for open domain dialogs

31
Emerging
2630 ryokamoi/llm-self-correction-papers

List of papers on Self-Correction of LLMs.

31
Emerging
2631 KhaledSharif/robot-transformers

Train and evaluate an Action Chunking Transformer (ACT) to perform...

31
Emerging
2632 curtisgray/wingman

Wingman is the fastest and easiest way to run Llama models on your PC or Mac.

31
Emerging
2633 ItzDerock/llama-playground

A simple to use and powerful web-interface to mess around with Meta's LLaMA LLM.

31
Emerging
2634 avatsaev/av-local-llm-api

Allows to easily run local REST API with a custom LLM, running locally or...

31
Emerging
2635 baldoarbol/BodyShapeGPT

Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions...

31
Emerging
2636 akshat0123/GPT-1

Pytorch implementation of GPT-1

31
Emerging
2637 azzeddineCH/flash-nanoGPT

Jax/Flax re-write of @karpathy 🐐 NanoGPT using some of the common Jax...

31
Emerging
2638 longyuewangdcu/Chinese-Llama-2

improve Llama-2's proficiency in comprehension, generation, and translation...

31
Emerging
2639 tongnie/ImputeFormer

[KDD 2024] "ImputeFormer: Low Rankness-Induced Transformers for...

31
Emerging
2640 bentoml/transformers-nlp-service

Online Inference API for NLP Transformer models - summarization, text...

31
Emerging
2641 JinXins/Awesome-Token-Merge-for-MLLMs

A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.

31
Emerging
2642 ntropy-network/enrichment_models

This repository benchmark Ntropy API against different Large Language Models...

31
Emerging
2643 AhmetZamanis/DeepLearningEnergyForecasting

Time series forecasting on an hourly energy dataset, with LSTM & Transformer...

31
Emerging
2644 codeastra2/llm-feat

Automated feature engineering using Large Language Models (LLMs) for tabular data

31
Emerging
2645 naity/finetune-esm

Scalable Protein Language Model Finetuning with Distributed Learning and...

31
Emerging
2646 vmarinowski/infini-attention

An unofficial pytorch implementation of 'Efficient Infinite Context...

31
Emerging
2647 ImplicitLayer/multiagent_environments

Envirionments for NLP multiagent tasks

31
Emerging
2648 liziniu/policy_optimization

Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)

31
Emerging
2649 JiauZhang/nnm

Neural Network Models

31
Emerging
2650 Relaxed-System-Lab/HexGen

[ICML 2024] Serving LLMs on heterogeneous decentralized clusters.

31
Emerging
2651 Koziev/LM-pretrain

Char-level language model pretraining code and scripts

31
Emerging
2652 Utshav-paudel/LLM-Zero-to-Hero

This repo contains the resources, projects and documentation of mine while...

31
Emerging
2653 prajjwal1/generalize_lm_nli

Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways...

31
Emerging
2654 crscardellino/argumentation-mining-transformers

Argumentation Mining Transformers Module (AMTM) implementation.

31
Emerging
2655 Basel-anaya/LoreWeaver

LoreWeaver is a Novel Generation Multimodal LLM based on Mistral 7B LLM

31
Emerging
2656 yuchen0515/2022-Competition-CUDAOutOfMemory

Our team placed 6th out of 119 teams in E.SUN AI Open Competition Summer...

31
Emerging
2657 lazy-guy/chess-llama

Tiny Llama model trained to play chess

31
Emerging
2658 yyDing1/GNER

[ACL 2024 Findings] Code implementation of Paper "Rethinking Negative...

31
Emerging
2659 misko/spf

Signal Processing Fun (in the sun)

31
Emerging
2660 j-webtek/Local-LLM_FineTune

Finetune Your Local LLM

31
Emerging
2661 muna-ai/muna-predictors

Interesting Python functions compiled to run anywhere with Muna.

31
Emerging
2662 jshuadvd/LongRoPE

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2...

31
Emerging
2663 jordddan/Pruning-LLMs

The framework to prune LLMs to any size and any config.

31
Emerging
2664 makllama/makllama

MaK(Mac+Kubernetes)llama - Running and orchestrating large language models...

31
Emerging
2665 SciCrunch/bio_electra

Bio-Electra - Small and efficient discriminatively pre-trained language...

31
Emerging
2666 Giyanellow/llama-chatbot-with-ui

This project provides a comprehensive template for self-hosting a Large...

31
Emerging
2667 Aradhye2002/selective-peft-toolkit

Official implementation of the paper "Step-by-Step Unmasking for...

31
Emerging
2668 shinomakoi/magi_llm_gui

A Qt GUI for large language models

31
Emerging
2669 wassemgtk/llm.scala

Extensible implementation of a Language Model (LLM) training framework in Scala.

31
Emerging
2670 koudounasalkis/CLUES

This repo contains the code for "A Contrastive Learning Approach to Mitigate...

31
Emerging
2671 raymin0223/fast_robust_early_exit

Fast and Robust Early-Exiting Framework for Autoregressive Language Models...

31
Emerging
2672 tripathiarpan20/self-improvement-4all

Private self-improvement coaching with open-source LLMs

31
Emerging
2673 tenghuilee/ScalingCapFusedVisionLM

number of tokens <=> performance to a vision language model

31
Emerging
2674 swapUniba/LaikaLLM

A hub for training and evaluating LLMs, following the multitask paradigm, in...

31
Emerging
2675 xmed-lab/TAM

[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs

31
Emerging
2676 cui-shaobo/defeasibility-in-causality

exploring the defeasibility inside causality

31
Emerging
2677 qiqiApink/MotionGPT

The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs...

31
Emerging
2678 just-ctrlC-ctrlV/Mechanical-Assistant

Imagine a world where your mechanical tasks are streamlined and optimized by...

31
Emerging
2679 alan-turing-institute/prompto

An open source library for asynchronous querying of LLM endpoints

31
Emerging
2680 ai4sd/multiscale-byte-lm

A hierarchical LM that scales to training on context windows of +5M tokens

31
Emerging
2681 cleopatra-itn/claim_detection

Code for tasks in the paper "Check\_square at CheckThat! 2020: Claim...

31
Emerging
2682 kyegomez/Open-NAMM

An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY"

31
Emerging
2683 VidhyaVarshanyJS/EnsembleX

EnsembleX utilizes the Knapsack algorithm to optimize Large Language Model...

31
Emerging
2684 ziansu/codeart

Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention...

31
Emerging
2685 lrusso/llama3pure

Three inference engines for Llama 3: pure C for desktop systems, pure...

31
Emerging
2686 IParraMartin/An-Explanation-Is-All-You-Need

The original transformer implementation from scratch. It contains...

31
Emerging
2687 nlp-uoregon/Okapi

Okapi: Instruction-tuned Large Language Models in Multiple Languages with...

31
Emerging
2688 hplt-project/monolingual-multilingual-instruction-tuning

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

31
Emerging
2689 codefuse-ai/GALLa

[ACL 2025] Graph Aligned Large Language Models for Improved Source Code Understanding

31
Emerging
2690 Orion-AI-Lab/televit

Teleconnection-driven vision transformers for improved long-term forecasting

31
Emerging
2691 HenryCai11/LLM-Self-Control

The official repo of paper "Self-Control of LLM Behaviors by Compressing...

31
Emerging
2692 M4TH1EU/llama-assist

Manage your smart home in Home Assistant with local LLMs running with llama.cpp

31
Emerging
2693 AshutoshDongare/softskill-NER

Fine tuning 🤗 transformer model for softskill NER task

31
Emerging
2694 camelop/NLP-Robustness

OOD Generalization and Detection (ACL 2020)

31
Emerging
2695 zeroxt32/Forex-Expert-Advisor-Python

Forex Bot Agents Using Machine Learning Implementations. Custom Forex Environments

31
Emerging
2696 nghiempt/llm-analysis-privacy-policy

Unveiling Discrepancies in Android App Data Safety Declarations and Privacy...

31
Emerging
2697 vipulraheja/coedit

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific...

31
Emerging
2698 yfedoseev/llmkit

Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,...

31
Emerging
2699 ivanovitchm/PPGEEC2318

Repository for EEC2318, a graduate course on PPgEEC about Machine Learning

31
Emerging
2700 TamSiuhin/LLM-UM-Reading

A list of large language models for user modeling (LLM-UM) papers, based on...

31
Emerging
« Prev 1 2 3 25 26 27 28 29 76 77 78 Next »