All Transformer Models

7,795 models ranked by quality score · Page 26 of 78

Showing 2501–2600 of 7,795
# Model Score Tier
2501 rti/gptvis

Understanding Transformers Using A Minimal Example

32
Emerging
2502 EternityYW/BiasEval-LLM-MentalHealth

Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models

32
Emerging
2503 kennethleungty/DeepSeek-R1-Ollama-Simple-Evals

Run and Evaluate DeepSeek-R1 Distilled Models Locally with Ollama and...

32
Emerging
2504 m3hrdadfi/news-headline-generation

A Bert2Bert model which able to generate headlines!

32
Emerging
2505 MurtyShikhar/TreeProjections

Tool to measure tree-structuredness of the internal algorithm learnt by a...

32
Emerging
2506 affjljoo3581/polyglot-jax-inference

TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.

32
Emerging
2507 BerkeliumLabs/Berkelium-labs

Your personal AI Lab, accessible everywhere! Explore, experiment, and...

32
Emerging
2508 softsys4ai/differentiable-proving

Code and data for the paper "Pretrained Language Models are Symbolic...

32
Emerging
2509 QwenLM/ParScale

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

32
Emerging
2510 kyegomez/VLM-Mamba

We introduce VLM-Mamba, the first Vision-Language Model built entirely on...

32
Emerging
2511 jose-compu/cerebras-coding-agent

A Cerebras AI LLM coding agent for the command line

32
Emerging
2512 pleisto/yuren-13b

Yuren 13B is an information synthesis large language model that has been...

32
Emerging
2513 rd-serendipity/ai-research-paper-explainer

AI-powered tool that transforms complex research papers into clear,...

32
Emerging
2514 HyperMink/inferenceable

Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes...

32
Emerging
2515 rajaswa/indic-syntax-evaluation

Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages

32
Emerging
2516 taesiri/ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language...

32
Emerging
2517 pyladiesams/llm-guardrails-jul2024

Dive into the world of LLM Guardrails using tools like NVIDIA’s NeMo...

32
Emerging
2518 kanchengw/cnllm

统一的中文大模型适配库,将主流中国大模型 API 输出封装为 OpenAI 格式,无缝协作openai、langchain等大多数openai结构适配的python库

32
Emerging
2519 clip-italian/clip-italian

CLIP (Contrastive Language–Image Pre-training) for Italian

32
Emerging
2520 namgyu-youn/PyTorch-Pruning

Benchmark and profile pruning researches and open-sources

32
Emerging
2521 amazon-science/wqa-contextual-qa

Coala is a python package for Contextual Answer Sentence Selection.

32
Emerging
2522 lxe/llavavision

A simple "Be My Eyes" web app with a llama.cpp/llava backend

32
Emerging
2523 xmindflow/SSCT

[ICCV 2023] Self-supervised Semantic Segmentation: Consistency over Transformation

32
Emerging
2524 asigalov61/Google-Magenta-Piano-Transformer-Colab

[DEAD/NOT SUPPORTED ANYMORE] This is the only fully working and functioning...

32
Emerging
2525 microsoft/encoder-decoder-slm

Efficient encoder-decoder architecture for small language models (≤1B...

32
Emerging
2526 BoHuangLab/CELL-E_2

Multimodal encoder-only transformer model for image-based protein predictions

32
Emerging
2527 PeterGriffinJin/Heterformer

Heterformer: Transformer-based Deep Node Representation Learning on...

32
Emerging
2528 ksm26/Pretraining-LLMs

Master the essential steps of pretraining large language models (LLMs)....

32
Emerging
2529 ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning

An implementation of the DISP-LLM method from the NeurIPS 2024 paper:...

32
Emerging
2530 HeegyuKim/language-model

한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)

32
Emerging
2531 AspirinCode/AlphaPPImd

Exploring the conformational ensembles of protein-protein complexes with...

32
Emerging
2532 gia-uh/cecilia

The Cuban Language Model

32
Emerging
2533 AbhinaavRamesh/ollama-local-serve

Local LLM infrastructure for distributed AI applications. Serve...

32
Emerging
2534 psychbruce/FMAT

😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.

32
Emerging
2535 anyantudre/Florence-2-Vision-Language-Model

Florence-2 is a novel vision foundation model with a unified, prompt-based...

32
Emerging
2536 Bruce-Lee-LY/cutlass_gemm

Multiple GEMM operators are constructed with cutlass to support LLM inference.

32
Emerging
2537 The-Martyr/CausalMM

[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal...

32
Emerging
2538 AntonGuan/TimeOmni-1

[ICLR 2026] Official implementation of " 🦙 TimeOmni-1: Incentivizing Complex...

32
Emerging
2539 tommasocerruti/detllm

Deterministic-mode checks for LLM inference: measure run/batch variance,...

32
Emerging
2540 Simplifine-gamedev/Simplifine

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud...

32
Emerging
2541 MaxwellYaoNi/PACE

[NeurIPS 2024 Spotlight] Official implementation for "PACE: marrying...

32
Emerging
2542 mahsasheikh/DrugGen

DrugGen: Advancing Drug Discovery with Large Language Models and...

32
Emerging
2543 rabilrbl/llamafile-builder

A simple github actions script to build a llamafile and uploads to huggingface

32
Emerging
2544 zTgx/llmweb-rs

Webpage to structured data in Rust & LLM

32
Emerging
2545 ybubnov/metalchat

Pure C++23 Llama inference for Apple Silicon chips

32
Emerging
2546 voidism/Lookback-Lens

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual...

32
Emerging
2547 juzhengz/LoRI

[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

32
Emerging
2548 ShengcaiLiao/TransMatcher

[NeurIPS 2021] TransMatcher: Deep Image Matching Through Transformers for...

32
Emerging
2549 KasraAhmadi/PII-360

An open-source Chrome Extension that identifies Personally Identifiable...

32
Emerging
2550 mddunlap924/PyTorch-LLM

Fine-tuning an LLM using a Generic Workflow and Best Practices with PyTorch

32
Emerging
2551 guanwei49/DABL

DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models

32
Emerging
2552 oxidized-transformers/oxidized-transformers

Modular Rust transformer/LLM library using Candle

32
Emerging
2553 ShiZhengyan/InstructionModelling

[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With...

32
Emerging
2554 leondz/lm_risk_cards

Risks and targets for assessing LLMs & LLM vulnerabilities

32
Emerging
2555 shunk031/allennlp-shiba-model

AllenNLP integration for Shiba: Japanese CANINE model

32
Emerging
2556 Tebmer/Rereading-LLM-Reasoning

EMNLP 2024 "Re-reading improves reasoning in large language models". Simply...

32
Emerging
2557 myscience/x-lstm

Pytorch implementation of the xLSTM model by Beck et al. (2024)

32
Emerging
2558 MusadiqPasha/Turkish-Hate-Speech-Classification-Explanation

Classify, explain, and rewrite Turkish hate speech tweets using BERT, SHAP,...

32
Emerging
2559 BFCmath/FinetuneAI_Learning

How to effectively finetune CV/LLM models (without local gpu)

32
Emerging
2560 bayer-science-for-a-better-life/data2text-bioleaflets

Biomedical Data-to-Text Generation via Fine-Tuning Transformers

32
Emerging
2561 xdevfaheem/Transformers

A Comprehensive Implementation of Transformers Architecture from Scratch

32
Emerging
2562 samadon1/LLM-From-Scratch

Medical Language Model fine-tuned using pretraining, instruction tuning, and...

32
Emerging
2563 kodejuice/ai-trade

A smart AI-powered trading assistant that uses large language models (LLMs)...

32
Emerging
2564 prakash-aryan/debatebrawl-app

A sophisticated AI-powered debate platform that integrates Large Language...

32
Emerging
2565 anas-zafar/LLM-Survey

The official GitHub page for the survey paper "A Survey on Large Language...

32
Emerging
2566 yaodongC/awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs...

32
Emerging
2567 RakePants/nerdless

Conversational AI Telegram bot based on a finetuned language model

32
Emerging
2568 didier-durand/llms-in-clouds

Experiments with LLMs in clouds (powered by SGLang)

32
Emerging
2569 systems-genomics-lab/deeptaxa

A deep learning framework for hierarchical taxonomy classification of 16S...

32
Emerging
2570 ScottCampit/personalized-marketing-chatbot

personalized marketing chatbot

32
Emerging
2571 rezazad68/TMUnet

Contextual Attention Network: Transformer Meets U-Net

32
Emerging
2572 azminewasi/Awesome-LLMs-ICLR-24

It is a comprehensive resource hub compiling all LLM papers accepted at the...

32
Emerging
2573 yinzhangyue/SelfAware

Do Large Language Models Know What They Don’t Know?

32
Emerging
2574 Buyun-Liang/SECA

[NeurIPS 2025] SECA: Semantically Equivalent and Coherent Attacks for...

32
Emerging
2575 XunshanMan/MVGFormer

This is the official implementation of the work presented at CVPR 2024,...

32
Emerging
2576 cmu-flame/FLAME-MoE

Official repository for FLAME-MoE: A Transparent End-to-End Research...

32
Emerging
2577 smvorwerk/xlstm-cuda

Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and...

32
Emerging
2578 open-compass/ANAH

[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO

32
Emerging
2579 synlp/R2-LLM

The official GitHub repository of the AAAI-2024 paper "Bootstrapping Large...

32
Emerging
2580 HKUNLP/efficient-attention

[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control...

32
Emerging
2581 Nota-NetsPresso/shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

32
Emerging
2582 bernardoleite/fairytaleqa-translated

Code for paper "FairytaleQA Translated: Enabling Educational Question and...

32
Emerging
2583 deep-symbolic-mathematics/llm-srbench

[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation...

32
Emerging
2584 SafeAILab/RAIN

[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning

32
Emerging
2585 AILab-CVC/M2PT

[CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data...

32
Emerging
2586 dsdanielpark/open-llm-datasets

Repository for organizing datasets and papers used in Open LLM.

32
Emerging
2587 BorealisAI/flora-opt

This is the official repository for the paper "Flora: Low-Rank Adapters Are...

32
Emerging
2588 zubair-irshad/NeRF-MAE

[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders...

32
Emerging
2589 alvion427/PerroPastor

Run Llama based LLMs in Unity entirely in compute shaders with no dependencies

32
Emerging
2590 ymoslem/Adaptive-MT-LLM

Adaptive Machine Translation with Large Language Models

32
Emerging
2591 mlverse/mall

Run multiple LLM predictions against a data frame with R and Python

32
Emerging
2592 BillChan226/HALC

[ICML 2024] Official implementation for "HALC: Object Hallucination...

32
Emerging
2593 rasbt/faster-pytorch-blog

Outlining techniques for improving the training performance of your PyTorch...

32
Emerging
2594 CJReinforce/PURE

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is...

32
Emerging
2595 TIGER-AI-Lab/TIGERScore

"TIGERScore: Towards Building Explainable Metric for All Text Generation...

32
Emerging
2596 alexliap/greek_gpt

MoE Decoder Transformer implementation with MLX

32
Emerging
2597 Niez-Gharbi/Youtube-Summariser

Summarize your youtube videos with BART on streamlit app.

32
Emerging
2598 xmartlabs/spoter-embeddings

Create embeddings from sign pose videos using Transformers

32
Emerging
2599 fvliang/DART

Official Implementation of DART (DART: Diffusion-Inspired Speculative...

32
Emerging
2600 AIRI-Institute/Probing_framework

Framework for probing tasks

32
Emerging
« Prev 1 2 3 24 25 26 27 28 76 77 78 Next »