All Transformer Models
7,795 models ranked by quality score · Page 30 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2901 |
sodascience/social_science_inferences_with_llms
Addressing LLM-related measurement error in social science modeling research. |
|
Emerging |
| 2902 |
RohitMacherla3/wikiHow_text_summarization_llms
The project aims to utilize pre-trained Large Language Models (LLMs) for... |
|
Emerging |
| 2903 |
fcakyon/gpt2-shakespeare
A tutorial on GPT2 language model training with texts from Shakespeare |
|
Emerging |
| 2904 |
DrHB/rna-stanford
Transformer + GAT for RNA chemical reactivity prediction| Stanford Ribonanza |
|
Emerging |
| 2905 |
lordtt13/transformers-experiments
All my experiments with the various transformers and various transformer... |
|
Emerging |
| 2906 |
mihirchhiber/LLM-ABM-StockSim
LLM-DRIVEN AGENT STOCK MARKET SIMULATION: Built an agent-based simulation... |
|
Emerging |
| 2907 |
aaaastark/Pretrain_Finetune_Transformers_Pytorch
Pre-Training and Fine-Tuning transformer models using PyTorch and the... |
|
Emerging |
| 2908 |
seanbenhur/resusable_text_classification_template
A complete reusable pipeline for text classification using different... |
|
Emerging |
| 2909 |
JawherKl/deep-dive-into-llm
Deep Dive into Large Language Models (LLMs) – A comprehensive study of Large... |
|
Emerging |
| 2910 |
jaketae/tupe
PyTorch implementation of Rethinking Positional Encoding in Language Pre-training |
|
Emerging |
| 2911 |
neerajtiwari360/understand_LLM
A comprehensive guide and tools for running large language models (LLMs) on... |
|
Emerging |
| 2912 |
kmkrofficial/LiteGPT
LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and... |
|
Emerging |
| 2913 |
Followb1ind1y/Medical-LLM-Fine-tuning
Fine-tunes LLaMA-3-8B on PubMedQA with QLoRA, optimized via DeepSpeed and... |
|
Emerging |
| 2914 |
ksm26/Reinforcement-Learning-from-Human-Feedback
Embark on the "Reinforcement Learning from Human Feedback" course and align... |
|
Emerging |
| 2915 |
naderabdelghany/project-rev
A proof-of-concept audio-interactive personalized chatbot based on Ted... |
|
Emerging |
| 2916 |
phonism/llm4cp
Large Language Model for Competitive Programming |
|
Emerging |
| 2917 |
BenGJ10/Complete-Machine-Learning-Notes
A complete collection of handwritten notes and learning resources for... |
|
Emerging |
| 2918 |
autobotasia/vitone
Tự động thêm dấu tiếng việt dùng Transformer model |
|
Emerging |
| 2919 |
Mahesh3394/clinical_text_classification
Text classification with fine tuned LLM model. Bert model fine tuned on... |
|
Emerging |
| 2920 |
antonalth/cs2-transformer-agent
Training a Transformer to play Counter Strike |
|
Emerging |
| 2921 |
nawnoes/pytorch-gpt-x
An implementation of an autoregressive language model using an improved... |
|
Emerging |
| 2922 |
GregorKobsik/ImageTransformer
This notebook shows a basic implementation of a transformer (decoder)... |
|
Emerging |
| 2923 |
semaj87/llm-post-generator
Using LLMs & the SERP API to retrieve information on a given topic, which is... |
|
Emerging |
| 2924 |
Ajax0564/VyomAI
VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ... |
|
Emerging |
| 2925 |
Orion-zhen/transAPI
OpenAI compatible API purely based on Transformers |
|
Emerging |
| 2926 |
fshnkarimi/Fine-tuning-an-LLM-using-LoRA
📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models -... |
|
Emerging |
| 2927 |
HySonLab/LANTERN
LANTERN: Leveraging Large Language Models And Transformer For Enhanced... |
|
Emerging |
| 2928 |
davide-coccomini/TimeSformer-Video-Classification
The notebook explains the various steps to obtain the results of... |
|
Emerging |
| 2929 |
kardSIM/Trading_RL_agent_with_transformers
An RL agent that can trade using Deep Q-Network (DQN) and a decoder-only... |
|
Emerging |
| 2930 |
Fisseha-Estifanos/LLM-API
A repository to demonstrate some of the concepts behind large language... |
|
Emerging |
| 2931 |
LazerLambda/Promptzl
Turn LLMs into zero-shot PyTorch classifiers! |
|
Emerging |
| 2932 |
gmontamat/poor-mans-transformers
Implement Transformers (and Deep Learning) from scratch in NumPy |
|
Emerging |
| 2933 |
visresearch/LLaVA-STF
The official implementation of "Learning Compact Vision Tokens for Efficient... |
|
Emerging |
| 2934 |
ToddThomson/Mila
Achilles Mila Deep Neural Network library provides a comprehensive API to... |
|
Emerging |
| 2935 |
gersteinlab/Struc-Bench
[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating... |
|
Emerging |
| 2936 |
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction... |
|
Emerging |
| 2937 |
ma2za/torch-adapters
Small Library of PyTorch Adaptation modules |
|
Emerging |
| 2938 |
ambideXtrous9/GRPO-and-SFT-Finetune-Qwen3-using-Unsloth-Reasoning-and-Non-Reasoning-Dataset
GRPO and SFT Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset |
|
Emerging |
| 2939 |
yeasy/llm_internals
深入剖析大语言模型架构、原理到训练部署 | How LLM works, including Design, Architecture and... |
|
Emerging |
| 2940 |
sukanyabag/Finetuning-Qwen2-7B-VQA-on-Radiology-Scans
This repository is doing the finetuning of the Qwen2 7B VLM for performing... |
|
Emerging |
| 2941 |
amazon-science/isometric-slt
Isometric Spoken Language Translation - Isometric SLT. |
|
Emerging |
| 2942 |
VectorInstitute/VLDBench
VLDBench: A large-scale benchmark for evaluating Vision-Language Models... |
|
Emerging |
| 2943 |
amazon-science/THRONE
Code release for THRONE, a CVPR 2024 paper on measuring object... |
|
Emerging |
| 2944 |
DianaDorobantu/legal-llm
Develop a Romanian legal domain Large Language Model (LLM) using pre-trained... |
|
Emerging |
| 2945 |
Riccorl/llama-trainer
Llama Trainer Utility |
|
Emerging |
| 2946 |
RAravindDS/CharLLMs
Implementing easy to use "Character Level Language Models" 🕺🏽 |
|
Emerging |
| 2947 |
cool-japan/trustformers
High-performance, memory-safe Rust implementation of Hugging Face... |
|
Emerging |
| 2948 |
StringNLPLAB/MGS
Repository for the paper "Advancing General-Purpose Reasoning Models with... |
|
Emerging |
| 2949 |
tahaabbas/dictator
Dictator – Supercharge Cursor Chat with voice-to-text, custom AI prompts,... |
|
Emerging |
| 2950 |
CharlieBrown-v1/KALM
[NeurIPS'24] KALM: Knowledgeable Agents by Offline Reinforcement Learning... |
|
Emerging |
| 2951 |
sitammeur/gliner-litserve
Leverage ModernGLiNER's capabilities using LitServe. |
|
Emerging |
| 2952 |
MehnaazAsad/NLP_summarization_bart
NLP summarization task with the Bart LLM |
|
Emerging |
| 2953 |
Subconscious-ai/sublime
🧠Behavior Change as a Service🌞 |
|
Emerging |
| 2954 |
GaryYufei/AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey |
|
Emerging |
| 2955 |
yinzhangyue/EoT
Exchange-of-Thought: Enhancing Large Language Model Capabilities through... |
|
Emerging |
| 2956 |
NellyW8/VeriReason
This is the Github Repo for the paper: VeriReason: Reinforcement Learning... |
|
Emerging |
| 2957 |
farhan0167/BankAIAgent
A tool to convert bank statements into Excel files |
|
Emerging |
| 2958 |
mrseanryan/gpt-local
Local GPT (llama 2 or dolly or gpt etc.) via Python - using ctransforers project |
|
Emerging |
| 2959 |
skpig/MPSC
[ACL 2024] Enhancing Large Language Models in Coding Through... |
|
Emerging |
| 2960 |
lucky-verma/SaastIE
Document understanding system using Donut transformer architecture |
|
Emerging |
| 2961 |
renan-siqueira/image-to-text-tool
This tool processes images and generates textual descriptions using advanced... |
|
Emerging |
| 2962 |
rochitasundar/Generative-AI-with-Large-Language-Models
This repository contains the lab work for Coursera course on "Generative AI... |
|
Emerging |
| 2963 |
Mechres/text-summarize
Flask-based API that provides a user-friendly interface to summarize text in... |
|
Emerging |
| 2964 |
franciellevargas/MOL
Multilingual Offensive Lexicon consists of the first contextual lexicon for... |
|
Emerging |
| 2965 |
zjukg/KnowPAT
[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in... |
|
Emerging |
| 2966 |
kyegomez/Simba
A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified... |
|
Emerging |
| 2967 |
TwistingTwists/json_partial
json_parser for LLM outputs -> it fixes the malformed json and produces correct json |
|
Emerging |
| 2968 |
relign-ai/relign
post train language models on multi-step reasoning with reinforcement learning |
|
Emerging |
| 2969 |
Jacksonlark/open-mllms
open llm for multimodal |
|
Emerging |
| 2970 |
iqbal-sk/Detecting-Persuasion-Techniques-in-Memes
Hierarchical, multilingual, multimodal detection of persuasion techniques in... |
|
Emerging |
| 2971 |
centre-for-humanities-computing/stormtrooper
Zero/few shot learning components for scikit-learn pipelines with LLMs and... |
|
Emerging |
| 2972 |
honghanhh/semeval8
L3i++ at SemEval2024-task8: Multidomain, Multimodel and Multilingual... |
|
Emerging |
| 2973 |
chandar-lab/CAIRO
We explain why fairness metrics don't correlate and propose CAIRO to make... |
|
Emerging |
| 2974 |
unisa-hpc/llm.sycl
The sycl version of llm.c (for the final project of HPC course 2024, UNISA) |
|
Emerging |
| 2975 |
do-me/qdrant-frontend
A universal Qdrant table frontend based on transformers.js |
|
Emerging |
| 2976 |
murphyhoucn/llm-dev
LLM Dev |
|
Emerging |
| 2977 |
Zhang-Yihao/Adversarial-Representation-Engineering
Official implementation repository for the paper Towards General Conceptual... |
|
Emerging |
| 2978 |
0xJakuzya/sentiment-analysis-tg-news
Sentiment analysis tool for Telegram news: scraping with Telethon, text... |
|
Emerging |
| 2979 |
rishabkr/Attention-Is-All-You-Need-Explained-PyTorch
A paper implementation and tutorial from scratch combining various great... |
|
Emerging |
| 2980 |
neeleshbhalla/transformers_for_time_series_forecasting
Inferencing 'PatchTST' and 'Informer' to harness the power of transformers... |
|
Emerging |
| 2981 |
icon-lab/HST
Official implementation of Hierarchical Spectrogram Transformers (HST) |
|
Emerging |
| 2982 |
SCZwangxiao/RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding... |
|
Emerging |
| 2983 |
DarshanAdiga/idiom-principle-on-magpie-corpus
Idiom Principle on MAGPIE dataset |
|
Emerging |
| 2984 |
seanpm2001/DALL-E_LLaMA
🤖️🦙️🧠️ DALL-E LLaMA is a combination of DALL-E and LLaMA (Large Language... |
|
Emerging |
| 2985 |
RiccardoSpolaor/Question-Answering
Question answering through pre-trained transformer-based models from Hugging Face. |
|
Emerging |
| 2986 |
Someshog/greenwashing-detection-app
An AI-powered Streamlit web app to detect greenwashing in sustainability... |
|
Emerging |
| 2987 |
avrtt/QASATIK
LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped... |
|
Emerging |
| 2988 |
D1ffic00lt/ai-pastproof
PastProof AI – ML core for automated fact-checking: ingests raw text, finds... |
|
Emerging |
| 2989 |
th789/mbr-for-nmt
Characterizing the performance of minimum Bayes risk (MBR) decoding for... |
|
Emerging |
| 2990 |
seanpm2001/DALL-E_LLaMA_Docs
🤖️🦙️🧠️📖️ The official documentation source repository for DALL-E LLaMA, a... |
|
Emerging |
| 2991 |
pranavsinghps1/CASS
Official PyTorch implementation of CASS, from the following paper: CASS:... |
|
Emerging |
| 2992 |
francoislanc/midistral
LLM finetuned for generating symbolic music |
|
Experimental |
| 2993 |
datasig-ac-uk/nlpsig
Package for constructing paths of embeddings obtained from transformers. |
|
Experimental |
| 2994 |
cliang1453/SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for... |
|
Experimental |
| 2995 |
zchuz/TimeBench
The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of... |
|
Experimental |
| 2996 |
XingLuxi/Cal-FLOPs-for-PLM
Calculating FLOPs of Pre-trained Models in NLP |
|
Experimental |
| 2997 |
rubencart/LIIR-TextGraphs-14
Code for KU Leuven LIIR lab's submission to the TextGraphs-14 shared task on... |
|
Experimental |
| 2998 |
LoserCheems/WonderfulMatrices
Wonderful Matrices to Build Small Language Models |
|
Experimental |
| 2999 |
LlamaGenAI/LlamaGen
AI Comic Factory - Generate Comics with AI, 🦙 Llama for Scalable Anime... |
|
Experimental |
| 3000 |
andresC98/TSF_Transformers_TFM
Repository containing my Master Thesis for the M.Sc. Big Data Analytics,... |
|
Experimental |