All Transformer Models

7,795 models ranked by quality score · Page 51 of 78

Showing 5001–5100 of 7,795
# Model Score Tier
5001 neoheartbeats/neoheartbeats-kernel

An architecture for LLMs' continual-learning and long-term memories

20
Experimental
5002 SumitM0432/XLM-RoBERTa-for-Textual-Entailment

A multilingual model XLM- RoBERTa for the textual entailment of sequence...

20
Experimental
5003 maris205/llama-gene

A General-purpose Gene Task Large Language Model Based on Instruction Fine-tuning

20
Experimental
5004 JLX0/llm-automl

Automate machine learning tasks at the code level with LLMs and autoML |...

20
Experimental
5005 matin-ghorbani/Video-Classification-Transformers

Implement a video classification using transformers

20
Experimental
5006 yzhhome/QA

智能问答项目实现

20
Experimental
5007 kyegomez/ChronoFormer

A production-grade implementation of a memory-efficient transformer...

20
Experimental
5008 elphinkuo/llamaqt.c

Clean C language version of quantizing llama2 model and running quantized...

20
Experimental
5009 e-caste/masters-thesis

My Master's thesis: "Automatic Video Lecture Summarization with Injection of...

20
Experimental
5010 givkashi/Awesome-unet-like-transformers

Awesome UNet with Transformer

20
Experimental
5011 sonoisa/qiita-title-generation

Qiitaの記事本文を与えるとタイトルを自動生成してくれる深層学習モデルの推論処理

20
Experimental
5012 ictnlp/FastLongSpeech

FastLongSpeech is a novel framework designed to extend the capabilities of...

20
Experimental
5013 henrikalbihn/gliclass-as-a-service

GLiClass model in a FastAPI microservice.

20
Experimental
5014 fracapuano/brainformer

A transformer-based approach to predicting MEG readings from EEG sensory...

20
Experimental
5015 fuyu-quant/IBLM

Repository of a new learning method called inductive bias learning with LLM.

20
Experimental
5016 robertoschiavone/transformer-q-network

My Master's Thesis.

20
Experimental
5017 mukhal/icl-ensembling

[Me-FoMo ICLR 2023 - Oral] Exploring Demonstration Ensembling for In-context Learning

20
Experimental
5018 ahmedgh970/convnext-charm

Official Tensorflow implementation of ConvNeXt-ChARM: ConvNeXt-based...

20
Experimental
5019 Krozmoz/llm-stock-market-predictor

📈 Predict market trends using a language model that reads stock charts as...

20
Experimental
5020 bikhanal/vision-transformer

Implementation of Vision Transformer (ViT) from scratch for image classification.

20
Experimental
5021 saizk/GlioScan

IDH Classification for Gliomas using CNN and Transformers.

20
Experimental
5022 tph-kds/vqa-llm

A Based Large Language Model (LLM) for VQA based on a custom model applying...

20
Experimental
5023 jiannanya/llm_structured

Parse messy LLM output into trustworthy, validated structured data — with...

20
Experimental
5024 MChatzakis/ChatMGL

ChatMGL: A Large Language Model Fine-tuned for Data Science Questions.

20
Experimental
5025 Sarah111-AHM/ZakeyTeam-arabic-qa-system-arabert

an AI powered Arabic Question Answering system built by fine tuning the...

20
Experimental
5026 tinysouth/litellmphp

PHP implementation of LiteLLM and LiteLLM-proxy.

20
Experimental
5027 fattorib/tritonformer

Trainable transformer with fwd+bwd ops in Triton, matching the performance...

20
Experimental
5028 hikmatazimzade/azerbaijani-tokenizer

High-Performance Azerbaijani Tokenizers (30% fewer tokens, 40% faster than...

20
Experimental
5029 yahskapar/LLMs-and-Probabilistic-Reasoning

Data and software artifacts for the EMNLP 2024 (Main) paper "What Are the...

20
Experimental
5030 yulang/phrasal-composition-in-transformers

This repo contains datasets and code for Assessing Phrasal Representation...

20
Experimental
5031 chaithanyasai18/LLMs-finetuning

This repository consists of python scripts for LLM finetuning (SFT, LoRA,...

20
Experimental
5032 X-rayLaser/DistributedLLM

Run LLM inference by spliting models into parts and hosting each part on a...

20
Experimental
5033 unaidedelf8777/faster-outlines

A Lazy, high throughput and blazing fast structured text generation backend.

20
Experimental
5034 RoyZry98/T-REX-Pytorch

[Arxiv 2025] Official code for T-REX: Mixture-of-Rank-One-Experts with...

20
Experimental
5035 ekunnii/adversarial-feedback-chatbot

EMNLP 2020 finding paper "Learning Improvised Chatbots from Adversarial...

20
Experimental
5036 pramodkoujalagi/SmolLM2-360M-Instruct-Text-2-JSON

A fine-tuned version of SmolLM2-360M-Instruct-bnb-4bit specialized for...

20
Experimental
5037 raaasin/Whispurr

A python based assistant that replies to your WhatsApp text on your behalf,...

20
Experimental
5038 Martin-qyma/TRM

From Faithfulness to Correctness: Generative Reward Models that Think Critically

20
Experimental
5039 sky24h/Training-Free_Zero-Shot_Semantic_Segmentation_with_LLM_Refinement

This repository contains official implementation of the paper "Training-Free...

20
Experimental
5040 jihadkhawaja/Llama.Grammar

GBNF converter for llama.cpp Grammar directly from C# types

20
Experimental
5041 tbogdala/woolycore

The core wrapper around llama.cpp in C to provide an easy surface to build...

20
Experimental
5042 amazon-science/mada_optimizer_search

Code the ICML 2024 paper: "MADA: Meta-Adaptive Optimizers through...

20
Experimental
5043 NathanLeroux-git/OnlineTransformerWithSpikingNeurons

This code is the implementation of the Spiking Online Transformer of the...

20
Experimental
5044 stoyan-stoyanov/transformers-calculator

Transformer Calculator - Estimate training time for transformer models.

20
Experimental
5045 CyberMaryVer/llm-notebooks

All the tutorials related to LLM

19
Experimental
5046 KillovSky/Isis

O Projeto Ísis é um plugin opcional em Python para o Projeto Íris,...

19
Experimental
5047 rokbenko/arctic-meet

ArcticMeet is an AI meeting assistant using Streamlit for the GUI and the...

19
Experimental
5048 arkodeepsen/helix

Professional training stack for 100M parameter language models optimized for...

19
Experimental
5049 MelKorSA/iwb151-fouette-bytes

A microservice that combines Meta-LLaMA AI with financial news analysis to...

19
Experimental
5050 getflexai/flex_ai

simplifies fine-tuning and inference for 60+ open-source LLMs through a single API

19
Experimental
5051 eniompw/llama-cpp-gpu

Load larger models by offloading model layers to both GPU and CPU

19
Experimental
5052 k-randl/self-explaining_llms

Official implementation of the papers "Evaluating the Reliability of...

19
Experimental
5053 atomlayer/llamachan

llamachan is a project that realises the idea of a dead internet for an imageboard

19
Experimental
5054 qubasehq/qudata

A comprehensive LLM data processing system designed to transform raw...

19
Experimental
5055 excitedplus1s/chatLLaMa

llama.cpp Desktop Client Demo

19
Experimental
5056 kikirizki/miniChatbot

The minimum implementation of chatbot using popular LLM model rewrite from...

19
Experimental
5057 claw1200/llama-cord

Discord App for Interacting with local Ollama Models. Multiple Agents Supported!

19
Experimental
5058 spongedsc/pathways

Pathways: multi-modal AI/ML models on discord

19
Experimental
5059 dwisiswant0/prepare-commit-msg-ai

Prepare Git Commit Message with AI: Write commit message based on code...

19
Experimental
5060 Kritik-helpingai/VORTEX

VortexGPT provides free access to text and image generation models.

19
Experimental
5061 231sm/Eval_Multi-Step_Reasoning

Comprehensive Evaluation On Answer Calibration For Multi-Step Reasoning

19
Experimental
5062 yandricr/gpti-php

This package simplifies your interaction with various GPT models, removing...

19
Experimental
5063 AlbertoMC126/ChronoSHAP_Transformers_LTSF-Linear_robustness

Code to study Transformers and LTSF-Linear models robustness and performance

19
Experimental
5064 chenxingqiang/FedCL-LLM

Implementation of FedCL-LLM: A Federated Continual Learning Framework...

19
Experimental
5065 briesearch/token-masks

Masked language model with Positional & One-Hot encoding - built using Aurora

19
Experimental
5066 NakerTheFirst/Sentiment-analysis

Analyse social media sentiment of OpenAI using LinkedIn data with NLP and...

19
Experimental
5067 priyam-hub/LLM-Fine-Tuning-Pipeline

A comprehensive pipeline for Different Fine-Tuning Methods for Large...

19
Experimental
5068 poojaharihar03/customer-AI-support

AI Chatbot designed to help assist users in any interview prep. Supports...

19
Experimental
5069 dhia7an/agent-sdk

🤖 Build transparent, message-first agents with efficient tool calls,...

19
Experimental
5070 arnhazra/arcstack

This application is an AI model marketplace that simplifies access to...

19
Experimental
5071 enggpt-it/Corso-LangChain

Questo corso offre un percorso completo per padroneggiare LangChain, il...

19
Experimental
5072 mohammadreza-mohammadi94/Transformers-Hub

A collection of projects and experiments using Hugging Face's Transformers...

19
Experimental
5073 erenisci/natural-language-processing

This repository covers a journey from basic to advanced NLP models, with a...

19
Experimental
5074 kevinbdsouza/GraphTransHiC

A Graph Transformer that creates hierarchal representations of HiC.

19
Experimental
5075 maximkm/DLA_ASR_HW

ASR pytorch project

19
Experimental
5076 jolual2747/nlp-question-answering-with-hugginggface-transformers

NLP question answering fine tuning Hugging Face's transformers

19
Experimental
5077 viktor-shcherb/vive_la_ner

The default way to fine-tune BERT is wrong. Here is why

19
Experimental
5078 balnarendrasapa/faq-llm

This is course project for DSCI 6004 deals with fine-tuning a pretrained...

19
Experimental
5079 tristandb8/PyTorch-PaliGemma-2

PyTorch implementation of PaliGemma 2

19
Experimental
5080 osainz59/XLREMed

Code for the Cross-Lingual Transfer Learning for Medical Relation Extraction

19
Experimental
5081 Prajwalsrinvas/nimble_LLM_web_scraping_challenge

Web scraping + LLMs

19
Experimental
5082 Pavansomisetty21/Qwen2-Vision-Finetuning-Unsloth---Maths-OCR-Formulae-Extraction-

we finetune unsloth llama model to extract mathematical fomulas in the...

19
Experimental
5083 dejwi/iBuild

iBuild is a desktop app that uses local AI models to generate Minecraft...

19
Experimental
5084 Ate329/SentiMusic

A text-to-audio application that turns words and sentiments into melodies.

19
Experimental
5085 themaximalist/ModelDeployer

API Proxy for AI models, rate limiting, management and more!

19
Experimental
5086 kyegomez/MultiQuerySuperpositionAttention

Multi-Query Attention with Sub-linear Masking, Superposition, and Entanglement

19
Experimental
5087 minuva/fast-nlp-text-toxicity

Fast text toxicity classification model

19
Experimental
5088 Nathan-Nesbitt/CodeSummary

A REST API for NLP

19
Experimental
5089 Chubek/will-sh3-b33

Will you ever find love?

19
Experimental
5090 nlx-group/Commonsense-Reasoning-Neuro-only-vs-Neuro-Symbolic-Methods

Code for the article "Commonsense Reasoning: how do Neuro-only and hybrid...

19
Experimental
5091 pelagecha/typ

Associative Memory Augmentation for Long-Context Retrieval in Transformers

19
Experimental
5092 mltraore/CompSegNet

CompSegNet: An enhanced U-shaped architecture for nuclei segmentation in H&E...

19
Experimental
5093 dedely/XAI4EO

Towards Explainable AI4EO: an explainable DL approach for crop type mapping...

19
Experimental
5094 rolandogdp/twitter-sent-analysis

Twitter sentiment analysis project

19
Experimental
5095 linhaowei1/Fine-tuning-Scaling-Law

🌹[ICML 2024] Selecting Large Language Model to Fine-tune via Rectified Scaling Law

19
Experimental
5096 Pavansomisetty21/Visual-Question-Answering-Pixtral_Vision_Finetuning_Unsloth

In this we finetune Pixtral-12B-2409 model using unsloth for visual Question...

19
Experimental
5097 NicolasSournac/Open-Book-Question-Answering

Comparative study of large language models in the field of open-book QA,...

19
Experimental
5098 xwang297/metamate-dataset

MetaMate: Large Language Model to the Rescue of Automated Data Extraction...

19
Experimental
5099 useentropy/llmkit

LLM Kit - Python Large Language Model Kit for generating data of your choice

19
Experimental
5100 nicholaswilven/pegasus-tpu-trainer

Transformer encoder-decoder (PEGASUS) pretraining and finetuning using...

19
Experimental
« Prev 1 2 3 49 50 51 52 53 76 77 78 Next »