All Transformer Models
7,795 models ranked by quality score · Page 51 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 5001 |
neoheartbeats/neoheartbeats-kernel
An architecture for LLMs' continual-learning and long-term memories |
|
Experimental |
| 5002 |
SumitM0432/XLM-RoBERTa-for-Textual-Entailment
A multilingual model XLM- RoBERTa for the textual entailment of sequence... |
|
Experimental |
| 5003 |
maris205/llama-gene
A General-purpose Gene Task Large Language Model Based on Instruction Fine-tuning |
|
Experimental |
| 5004 |
JLX0/llm-automl
Automate machine learning tasks at the code level with LLMs and autoML |... |
|
Experimental |
| 5005 |
matin-ghorbani/Video-Classification-Transformers
Implement a video classification using transformers |
|
Experimental |
| 5006 |
yzhhome/QA
智能问答项目实现 |
|
Experimental |
| 5007 |
kyegomez/ChronoFormer
A production-grade implementation of a memory-efficient transformer... |
|
Experimental |
| 5008 |
elphinkuo/llamaqt.c
Clean C language version of quantizing llama2 model and running quantized... |
|
Experimental |
| 5009 |
e-caste/masters-thesis
My Master's thesis: "Automatic Video Lecture Summarization with Injection of... |
|
Experimental |
| 5010 |
givkashi/Awesome-unet-like-transformers
Awesome UNet with Transformer |
|
Experimental |
| 5011 |
sonoisa/qiita-title-generation
Qiitaの記事本文を与えるとタイトルを自動生成してくれる深層学習モデルの推論処理 |
|
Experimental |
| 5012 |
ictnlp/FastLongSpeech
FastLongSpeech is a novel framework designed to extend the capabilities of... |
|
Experimental |
| 5013 |
henrikalbihn/gliclass-as-a-service
GLiClass model in a FastAPI microservice. |
|
Experimental |
| 5014 |
fracapuano/brainformer
A transformer-based approach to predicting MEG readings from EEG sensory... |
|
Experimental |
| 5015 |
fuyu-quant/IBLM
Repository of a new learning method called inductive bias learning with LLM. |
|
Experimental |
| 5016 |
robertoschiavone/transformer-q-network
My Master's Thesis. |
|
Experimental |
| 5017 |
mukhal/icl-ensembling
[Me-FoMo ICLR 2023 - Oral] Exploring Demonstration Ensembling for In-context Learning |
|
Experimental |
| 5018 |
ahmedgh970/convnext-charm
Official Tensorflow implementation of ConvNeXt-ChARM: ConvNeXt-based... |
|
Experimental |
| 5019 |
Krozmoz/llm-stock-market-predictor
📈 Predict market trends using a language model that reads stock charts as... |
|
Experimental |
| 5020 |
bikhanal/vision-transformer
Implementation of Vision Transformer (ViT) from scratch for image classification. |
|
Experimental |
| 5021 |
saizk/GlioScan
IDH Classification for Gliomas using CNN and Transformers. |
|
Experimental |
| 5022 |
tph-kds/vqa-llm
A Based Large Language Model (LLM) for VQA based on a custom model applying... |
|
Experimental |
| 5023 |
jiannanya/llm_structured
Parse messy LLM output into trustworthy, validated structured data — with... |
|
Experimental |
| 5024 |
MChatzakis/ChatMGL
ChatMGL: A Large Language Model Fine-tuned for Data Science Questions. |
|
Experimental |
| 5025 |
Sarah111-AHM/ZakeyTeam-arabic-qa-system-arabert
an AI powered Arabic Question Answering system built by fine tuning the... |
|
Experimental |
| 5026 |
tinysouth/litellmphp
PHP implementation of LiteLLM and LiteLLM-proxy. |
|
Experimental |
| 5027 |
fattorib/tritonformer
Trainable transformer with fwd+bwd ops in Triton, matching the performance... |
|
Experimental |
| 5028 |
hikmatazimzade/azerbaijani-tokenizer
High-Performance Azerbaijani Tokenizers (30% fewer tokens, 40% faster than... |
|
Experimental |
| 5029 |
yahskapar/LLMs-and-Probabilistic-Reasoning
Data and software artifacts for the EMNLP 2024 (Main) paper "What Are the... |
|
Experimental |
| 5030 |
yulang/phrasal-composition-in-transformers
This repo contains datasets and code for Assessing Phrasal Representation... |
|
Experimental |
| 5031 |
chaithanyasai18/LLMs-finetuning
This repository consists of python scripts for LLM finetuning (SFT, LoRA,... |
|
Experimental |
| 5032 |
X-rayLaser/DistributedLLM
Run LLM inference by spliting models into parts and hosting each part on a... |
|
Experimental |
| 5033 |
unaidedelf8777/faster-outlines
A Lazy, high throughput and blazing fast structured text generation backend. |
|
Experimental |
| 5034 |
RoyZry98/T-REX-Pytorch
[Arxiv 2025] Official code for T-REX: Mixture-of-Rank-One-Experts with... |
|
Experimental |
| 5035 |
ekunnii/adversarial-feedback-chatbot
EMNLP 2020 finding paper "Learning Improvised Chatbots from Adversarial... |
|
Experimental |
| 5036 |
pramodkoujalagi/SmolLM2-360M-Instruct-Text-2-JSON
A fine-tuned version of SmolLM2-360M-Instruct-bnb-4bit specialized for... |
|
Experimental |
| 5037 |
raaasin/Whispurr
A python based assistant that replies to your WhatsApp text on your behalf,... |
|
Experimental |
| 5038 |
Martin-qyma/TRM
From Faithfulness to Correctness: Generative Reward Models that Think Critically |
|
Experimental |
| 5039 |
sky24h/Training-Free_Zero-Shot_Semantic_Segmentation_with_LLM_Refinement
This repository contains official implementation of the paper "Training-Free... |
|
Experimental |
| 5040 |
jihadkhawaja/Llama.Grammar
GBNF converter for llama.cpp Grammar directly from C# types |
|
Experimental |
| 5041 |
tbogdala/woolycore
The core wrapper around llama.cpp in C to provide an easy surface to build... |
|
Experimental |
| 5042 |
amazon-science/mada_optimizer_search
Code the ICML 2024 paper: "MADA: Meta-Adaptive Optimizers through... |
|
Experimental |
| 5043 |
NathanLeroux-git/OnlineTransformerWithSpikingNeurons
This code is the implementation of the Spiking Online Transformer of the... |
|
Experimental |
| 5044 |
stoyan-stoyanov/transformers-calculator
Transformer Calculator - Estimate training time for transformer models. |
|
Experimental |
| 5045 |
CyberMaryVer/llm-notebooks
All the tutorials related to LLM |
|
Experimental |
| 5046 |
KillovSky/Isis
O Projeto Ísis é um plugin opcional em Python para o Projeto Íris,... |
|
Experimental |
| 5047 |
rokbenko/arctic-meet
ArcticMeet is an AI meeting assistant using Streamlit for the GUI and the... |
|
Experimental |
| 5048 |
arkodeepsen/helix
Professional training stack for 100M parameter language models optimized for... |
|
Experimental |
| 5049 |
MelKorSA/iwb151-fouette-bytes
A microservice that combines Meta-LLaMA AI with financial news analysis to... |
|
Experimental |
| 5050 |
getflexai/flex_ai
simplifies fine-tuning and inference for 60+ open-source LLMs through a single API |
|
Experimental |
| 5051 |
eniompw/llama-cpp-gpu
Load larger models by offloading model layers to both GPU and CPU |
|
Experimental |
| 5052 |
k-randl/self-explaining_llms
Official implementation of the papers "Evaluating the Reliability of... |
|
Experimental |
| 5053 |
atomlayer/llamachan
llamachan is a project that realises the idea of a dead internet for an imageboard |
|
Experimental |
| 5054 |
qubasehq/qudata
A comprehensive LLM data processing system designed to transform raw... |
|
Experimental |
| 5055 |
excitedplus1s/chatLLaMa
llama.cpp Desktop Client Demo |
|
Experimental |
| 5056 |
kikirizki/miniChatbot
The minimum implementation of chatbot using popular LLM model rewrite from... |
|
Experimental |
| 5057 |
claw1200/llama-cord
Discord App for Interacting with local Ollama Models. Multiple Agents Supported! |
|
Experimental |
| 5058 |
spongedsc/pathways
Pathways: multi-modal AI/ML models on discord |
|
Experimental |
| 5059 |
dwisiswant0/prepare-commit-msg-ai
Prepare Git Commit Message with AI: Write commit message based on code... |
|
Experimental |
| 5060 |
Kritik-helpingai/VORTEX
VortexGPT provides free access to text and image generation models. |
|
Experimental |
| 5061 |
231sm/Eval_Multi-Step_Reasoning
Comprehensive Evaluation On Answer Calibration For Multi-Step Reasoning |
|
Experimental |
| 5062 |
yandricr/gpti-php
This package simplifies your interaction with various GPT models, removing... |
|
Experimental |
| 5063 |
AlbertoMC126/ChronoSHAP_Transformers_LTSF-Linear_robustness
Code to study Transformers and LTSF-Linear models robustness and performance |
|
Experimental |
| 5064 |
chenxingqiang/FedCL-LLM
Implementation of FedCL-LLM: A Federated Continual Learning Framework... |
|
Experimental |
| 5065 |
briesearch/token-masks
Masked language model with Positional & One-Hot encoding - built using Aurora |
|
Experimental |
| 5066 |
NakerTheFirst/Sentiment-analysis
Analyse social media sentiment of OpenAI using LinkedIn data with NLP and... |
|
Experimental |
| 5067 |
priyam-hub/LLM-Fine-Tuning-Pipeline
A comprehensive pipeline for Different Fine-Tuning Methods for Large... |
|
Experimental |
| 5068 |
poojaharihar03/customer-AI-support
AI Chatbot designed to help assist users in any interview prep. Supports... |
|
Experimental |
| 5069 |
dhia7an/agent-sdk
🤖 Build transparent, message-first agents with efficient tool calls,... |
|
Experimental |
| 5070 |
arnhazra/arcstack
This application is an AI model marketplace that simplifies access to... |
|
Experimental |
| 5071 |
enggpt-it/Corso-LangChain
Questo corso offre un percorso completo per padroneggiare LangChain, il... |
|
Experimental |
| 5072 |
mohammadreza-mohammadi94/Transformers-Hub
A collection of projects and experiments using Hugging Face's Transformers... |
|
Experimental |
| 5073 |
erenisci/natural-language-processing
This repository covers a journey from basic to advanced NLP models, with a... |
|
Experimental |
| 5074 |
kevinbdsouza/GraphTransHiC
A Graph Transformer that creates hierarchal representations of HiC. |
|
Experimental |
| 5075 |
maximkm/DLA_ASR_HW
ASR pytorch project |
|
Experimental |
| 5076 |
jolual2747/nlp-question-answering-with-hugginggface-transformers
NLP question answering fine tuning Hugging Face's transformers |
|
Experimental |
| 5077 |
viktor-shcherb/vive_la_ner
The default way to fine-tune BERT is wrong. Here is why |
|
Experimental |
| 5078 |
balnarendrasapa/faq-llm
This is course project for DSCI 6004 deals with fine-tuning a pretrained... |
|
Experimental |
| 5079 |
tristandb8/PyTorch-PaliGemma-2
PyTorch implementation of PaliGemma 2 |
|
Experimental |
| 5080 |
osainz59/XLREMed
Code for the Cross-Lingual Transfer Learning for Medical Relation Extraction |
|
Experimental |
| 5081 |
Prajwalsrinvas/nimble_LLM_web_scraping_challenge
Web scraping + LLMs |
|
Experimental |
| 5082 |
Pavansomisetty21/Qwen2-Vision-Finetuning-Unsloth---Maths-OCR-Formulae-Extraction-
we finetune unsloth llama model to extract mathematical fomulas in the... |
|
Experimental |
| 5083 |
dejwi/iBuild
iBuild is a desktop app that uses local AI models to generate Minecraft... |
|
Experimental |
| 5084 |
Ate329/SentiMusic
A text-to-audio application that turns words and sentiments into melodies. |
|
Experimental |
| 5085 |
themaximalist/ModelDeployer
API Proxy for AI models, rate limiting, management and more! |
|
Experimental |
| 5086 |
kyegomez/MultiQuerySuperpositionAttention
Multi-Query Attention with Sub-linear Masking, Superposition, and Entanglement |
|
Experimental |
| 5087 |
minuva/fast-nlp-text-toxicity
Fast text toxicity classification model |
|
Experimental |
| 5088 |
Nathan-Nesbitt/CodeSummary
A REST API for NLP |
|
Experimental |
| 5089 |
Chubek/will-sh3-b33
Will you ever find love? |
|
Experimental |
| 5090 |
nlx-group/Commonsense-Reasoning-Neuro-only-vs-Neuro-Symbolic-Methods
Code for the article "Commonsense Reasoning: how do Neuro-only and hybrid... |
|
Experimental |
| 5091 |
pelagecha/typ
Associative Memory Augmentation for Long-Context Retrieval in Transformers |
|
Experimental |
| 5092 |
mltraore/CompSegNet
CompSegNet: An enhanced U-shaped architecture for nuclei segmentation in H&E... |
|
Experimental |
| 5093 |
dedely/XAI4EO
Towards Explainable AI4EO: an explainable DL approach for crop type mapping... |
|
Experimental |
| 5094 |
rolandogdp/twitter-sent-analysis
Twitter sentiment analysis project |
|
Experimental |
| 5095 |
linhaowei1/Fine-tuning-Scaling-Law
🌹[ICML 2024] Selecting Large Language Model to Fine-tune via Rectified Scaling Law |
|
Experimental |
| 5096 |
Pavansomisetty21/Visual-Question-Answering-Pixtral_Vision_Finetuning_Unsloth
In this we finetune Pixtral-12B-2409 model using unsloth for visual Question... |
|
Experimental |
| 5097 |
NicolasSournac/Open-Book-Question-Answering
Comparative study of large language models in the field of open-book QA,... |
|
Experimental |
| 5098 |
xwang297/metamate-dataset
MetaMate: Large Language Model to the Rescue of Automated Data Extraction... |
|
Experimental |
| 5099 |
useentropy/llmkit
LLM Kit - Python Large Language Model Kit for generating data of your choice |
|
Experimental |
| 5100 |
nicholaswilven/pegasus-tpu-trainer
Transformer encoder-decoder (PEGASUS) pretraining and finetuning using... |
|
Experimental |