All Transformer Models

7,795 models ranked by quality score · Page 29 of 78

Showing 2801–2900 of 7,795
# Model Score Tier
2801 khiwniti/kaggle-llm-api

🤖 Comprehensive solution for running Ollama/vLLM API servers in Kaggle...

31
Emerging
2802 OpenVanguard/remma-o1

Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch...

31
Emerging
2803 obss/turkish-question-generation

Automated question generation and question answering from Turkish texts...

31
Emerging
2804 Trustworthy-ML-Lab/VLG-CBM

[NeurIPS 24] A new training and evaluation framework for learning...

31
Emerging
2805 gsilvamartin/Backforge

Experimental Intelligent AI Backend Agent

31
Emerging
2806 Sharukesh3/LLM-for-hydrogen-storage

The LARGE LANGUAGE MODEL FOR HYDROGEN STORAGE project uses advanced natural...

31
Emerging
2807 EternityYW/TRAM-Benchmark

TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of...

30
Emerging
2808 bowen-upenn/llm_token_bias

[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet...

30
Emerging
2809 Kacper-W-Kozdon/promptflow_unify_integration

The tool package for Microsoft's Prompt flow and the VS Code extension

30
Emerging
2810 hsisaberi/single-trait-electra

A complete ELECTRA-based framework for Big Five personality trait...

30
Emerging
2811 gitabtion/ConvBert-PyTorch

🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers.

30
Emerging
2812 yeyupiaoling/Chinese-LLM-Chat

大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama

30
Emerging
2813 PlanTL-GOB-ES/lm-biomedical-clinical-es

Official source for Spanish pretrained biomedical and clinical language...

30
Emerging
2814 Strifee/arabic2english

Arabic to English machine translation with Transformers and Pytorch

30
Emerging
2815 aimagelab/Emuru-autoregressive-text-img

Official PyTorch implementation for "Zero-Shot Styled Text Image Generation,...

30
Emerging
2816 voxel51/fiftyone-huggingface-plugins

Hugging Face Plugins for FiftyOne

30
Emerging
2817 samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural...

30
Emerging
2818 thefilesareinthecomputer/offline_file_translation

Text file language translation app that translates .txt, .csv, and .xlsx...

30
Emerging
2819 kurnevsky/llama-cpp.el

A client for llama-cpp server

30
Emerging
2820 Honee-W/U-SAM

Official repository for U-SAM (Interspeech 2025)

30
Emerging
2821 snu-mllab/GuidedQuant

Official PyTorch implementation of "GuidedQuant: Large Language Model...

30
Emerging
2822 18907305772/KCA

EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud

30
Emerging
2823 s-omranpour/MIDI-Transformer

Another implementation of the paper "Compound Word Transformer: Learning to...

30
Emerging
2824 NikolasMarkou/fsm_llm

A Finite State Machine hybrid with Large Language Models

30
Emerging
2825 danielsobrado/llm_notebooks

Concepts and examples on using and training LLMs

30
Emerging
2826 MoFHeka/LLaMA-Megatron

A LLaMA1/LLaMA12 Megatron implement.

30
Emerging
2827 mscheong01/speculative_decoding.c

minimal C implementation of speculative decoding based on llama2.c

30
Emerging
2828 load1n9/chat

leverage llama3.2 and other large language models to generate responses to...

30
Emerging
2829 SJTU-DENG-Lab/LightningRL

LightningRL: Breaking the Accuracy–Parallelism Trade-off of Block-wise dLLMs...

30
Emerging
2830 louisoutin/rat_crypto_trader

Relation-Aware Transformer for Portfolio Policy Learning using Binance provider

30
Emerging
2831 bfilar/URLTran

PyTorch/HuggingFace Implementation of URLTran: Improving Phishing URL...

30
Emerging
2832 kvignesh1420/cot-icl-lab

[ACL 2025] Official implementation of the "CoT-ICL Lab" framework

30
Emerging
2833 asimsinan/LLM-Research

A collection of LLM related papers, thesis, tools, datasets, courses, open...

30
Emerging
2834 TirendazAcademy/Llama3-Tutorials

Hands-on projects with Llama 3, Ollama, Streamlit

30
Emerging
2835 kyegomez/MMCA

The open source community's implementation of the all-new Multi-Modal Causal...

30
Emerging
2836 AnasMohammad4321/BERT-Pytorch

Comprehensive BERT model training and visualization, detailing pre-training,...

30
Emerging
2837 jw-source/LlamaSim

Simulate human behavior with mass LLMs

30
Emerging
2838 ndoll1998/AppliedTransformers

State-Of-The-Art Transformer Models

30
Emerging
2839 Stamir36/CursusAI-ChatBot

Chatbot based on artificial intelligence (AI) for communication, image...

30
Emerging
2840 liashchynskyi/ggufer

Convert & quantize HuggingFace models using llama.cpp on premises

30
Emerging
2841 mbzuai-oryx/Video-LLaVA

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

30
Emerging
2842 kyegomez/JaxTransformer

This repository demonstrates how to build a Decoder-Only Transformer with...

30
Emerging
2843 kyegomez/SelfExtend

Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend...

30
Emerging
2844 finxlab/hgait

Official implementation of HGAIT: Heterogeneous Graph Attention with...

30
Emerging
2845 twitter-research/multilingual-alignment-tpp

Code for reproducing the paper Improved Multilingual Language Model...

30
Emerging
2846 dhakalnirajan/LLaMA-BitNet

LLaMA-BitNet is a repository dedicated to empowering users to train their...

30
Emerging
2847 philogicae/gpt4all-telegram-bot

Simple Telegram bot using GPT4All

30
Emerging
2848 hukenovs/slovo

Slovo: Russian Sign Language Dataset and Models

30
Emerging
2849 llm-misinformation/llm-misinformation

The dataset and code for the ICLR 2024 paper "Can LLM-Generated...

30
Emerging
2850 whucs21Mzy/Model-Phase-Transitions

Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A...

30
Emerging
2851 sdpkjc/SATQuest

🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

30
Emerging
2852 EchoSingh/GitHub_Profile_Picture

A guide code to generate your ai profile picture

30
Emerging
2853 Riccorl/transformer-srl

Reimplementation of a BERT based model (Shi et al, 2019), currently the...

30
Emerging
2854 imanslab/poc-uncensored-language-with-wizard-vicuna

Uncensored Language Model using FastAPI and Wizard Vicuna 30B (PoC)

30
Emerging
2855 theosorus/GPT2-Hasktorch

GPT2 implementation in Haskell with the Hasktorch library, inspired by...

30
Emerging
2856 rockyco/estFreqOffset

LLM-Assisted FPGA Design for Carrier Frequency Offset Estimation

30
Emerging
2857 microsoft/AMOS

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training...

30
Emerging
2858 hexuandeng/DRPruning

Implementation for our paper “DRPruning: Efficient Large Language Model...

30
Emerging
2859 leodeveloper/Pdf-Parse-LlamaParse

using Llama Parse to read pdf and convert into mark down or text

30
Emerging
2860 voidful/nlp2go

🏃 hosting nlp models in one line

30
Emerging
2861 mfekadu/nimbus-transformer

it's like Nimbus but uses a transformer language model

30
Emerging
2862 HYUNJS/STTM

[ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free...

30
Emerging
2863 GeorgiosIoannouCoder/mindscanner

Deep learning models and fine-tuned transformers for detecting mental...

30
Emerging
2864 ssbuild/llm_finetuning

Large language Model fintuning bloom , opt , gpt, gpt2...

30
Emerging
2865 sandyresearch/chipmunk

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E...

30
Emerging
2866 hkproj/mistral-llm-notes

Notes on the Mistral AI model

30
Emerging
2867 izmttk/ullm

Lightweight LLM inference engine inspired by nano-vllm, with radix-tree...

30
Emerging
2868 msamprovalaki/Exploring-Multimodal-Large-Language-Models-for-Medical-Image-Captioning

This repository includes the code for my Master Thesis, which investigates...

30
Emerging
2869 conditionWang/FLNK

Federated Learning with New Knowledge -- explore to incorporate various new...

30
Emerging
2870 fangevo/KD-efficient-text-summarization

The project leverages a larger model, Qwen2.5-14B, to generate high-quality...

30
Emerging
2871 Kuberwastaken/MiniLMs

A research project focused on studying and implementing minimalist language...

30
Emerging
2872 ulab-uiuc/Time-R1

Time-R1: Framework and resources for endowing LLMs with comprehensive...

30
Emerging
2873 tunib-ai/joker

AI model designed to test the effectiveness in handling external ethical attacks.

30
Emerging
2874 mubingshen/MLC-SLM-Baseline

The project is associated with the recently-launched INTERSPEECH 2025...

30
Emerging
2875 salesforce/factualNLG

Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing...

30
Emerging
2876 OrigamiDream/CoRT

CoRT: Contrastive Rhetorical Tagging - KISTI 2022 AI/ML Competition

30
Emerging
2877 IST-DASLab/Quartet-II

Quartet II Official Code

30
Emerging
2878 ViLab-UCSD/LaGTran_ICML2024

Code and models for the ICML 2024 paper "Tell, Don`t Show!: Language...

30
Emerging
2879 reasoning-machines/CoCoGen

Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)

30
Emerging
2880 bobxwu/learning-from-rewards-llm-papers

A comrephensive collection of learning from rewards in the post-training and...

30
Emerging
2881 EvilFreelancer/MoDA

Is a framework designed to enhance the performance and flexibility of large...

30
Emerging
2882 avrtt/telegram-content-moderator

NLP/ViT-driven bot for detection & moredation of inappropriate content in...

30
Emerging
2883 trekhleb/homemade-gpt-js

A minimal TensorFlow.js re-implementation of Karpathy's minGPT (Generative...

30
Emerging
2884 CyberAgentAILab/japanese-nli-model

This repository provides the code for Japanese NLI model, a fine-tuned...

30
Emerging
2885 aws-samples/multi-modal-examples-for-amazon-sagemaker

A workshop for collections of multi-modal LLM examples, samples, reference...

30
Emerging
2886 yunkai1841/recipe-generation

NLP Text generation task. Generate recipe by fine tuned LLaMA model.

30
Emerging
2887 AGI-Edgerunners/LLM-Continual-Learning-Papers

Must-read Papers on Large Language Model (LLM) Continual Learning

30
Emerging
2888 calhounpaul/LLaMA-PEFT-LoRa-subreddit-chatbot-colab

Parameter Efficient Fine Tuning (PEFT) to create a chatbot from Facebook's...

30
Emerging
2889 GAIR-NLP/abel

SOTA Math Opensource LLM

30
Emerging
2890 NVlabs/HMAR

[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

30
Emerging
2891 Beomi/megatronlm_dataset_autotokenizer

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

30
Emerging
2892 piotrmaciejbednarski/pllum-cookbook

This repository contains example Jupyter notebooks demonstrating how to use...

30
Emerging
2893 OSUPCVLab/MobileUNETR

Official Implementation of MobileUNETR: A Lightweight End-To-End Hybrid...

30
Emerging
2894 neuro-symbolic-ai/explanation_based_ethical_reasoning

Code and data for Paper "Enhancing Ethical Explanations of Large Language...

30
Emerging
2895 mickymultani/LLM-Architecture

Visualize some important concepts related to LLM architectures.

30
Emerging
2896 yandricr/gpti-py

This package simplifies your interaction with various GPT models, removing...

30
Emerging
2897 Infini-AI-Lab/TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with...

30
Emerging
2898 andreaceto/multimodal-crisis-classification

Multimodal Classification of Crisis-related social media contents.

30
Emerging
2899 Shekswess/tiny-reasoning-language-model

Code repository dedicated to experimenting and research with tiny reasoning...

30
Emerging
2900 Awni00/abstract_transformer

This is the project repo associated with the paper "Disentangling and...

30
Emerging
« Prev 1 2 3 27 28 29 30 31 76 77 78 Next »