All Transformer Models

7,795 models ranked by quality score · Page 53 of 78

Showing 5201–5300 of 7,795
# Model Score Tier
5201 svjack/CodeActAgent-Gradio

UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit...

19
Experimental
5202 mtszkw/fast-torch

Comparing PyTorch, JIT and ONNX for inference with Transformers

19
Experimental
5203 maxboels/Surgical-Phase-Recognition

Collated a list of useful open-access work related to surgical phase...

19
Experimental
5204 Mxbonn/ltmp

Code for Learned Thresholds Token Merging and Pruning for Vision...

19
Experimental
5205 mrdiamonddirt/local-llama-chrome-extension

A chrome extention for quering a local llm model using llama-cpp-python,...

19
Experimental
5206 KomeijiForce/CoTAM

Official Implementation of the ACL2024 Findings paper "Controllable Data...

19
Experimental
5207 zhaochen0110/LMLM

Code and data for "Improving Temporal Generalization of Pre-trained Language...

19
Experimental
5208 CodeName-Detective/Prompt-to-Song-Generation-using-Large-Language-Models

This project uses LLMs to generate music from text by understanding prompts,...

19
Experimental
5209 GregorKobsik/Octree-Transformer

Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically...

19
Experimental
5210 MajidRajabiVardanjani/haji-api

Haji API | وب سرویس حاجی

19
Experimental
5211 sanyalsunny111/Early_Weight_Avg

[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training

19
Experimental
5212 moskomule/simple_transformers

Simple transformer implementations that I can understand

19
Experimental
5213 JaydenTeoh/beyond-next-token-prediction

Curated collection of research on the limitations of next-token prediction...

19
Experimental
5214 omron-sinicx/transformer4sr

[NeurIPS 2023 AI4Science] "A Transformer Model for Symbolic Regression...

19
Experimental
5215 claws-lab/projection-in-MLLMs

Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal...

19
Experimental
5216 NJX-njx/microgpt

🔬 The most atomic GPT-2 implementation in 265 lines of pure Python & CUDA. A...

19
Experimental
5217 LiguseCorp/rrrLLM-generator

Rare Refusal-Reply LLM generator. Aimed at generating a new model based on...

19
Experimental
5218 c4dt/pitfalls_in_fine_tuning_llms

Jupyter notebooks for the LLM fine-tuning pitfalls hands-on workshop

19
Experimental
5219 Damarcreative/secure-upload

Remove adult content in discord channels better with Artificial Intelligence.

19
Experimental
5220 PaletiKrishnasai/Dialogue-Summarizer

Dialogue Summarization application hosted using AWS and CICD deployment with...

19
Experimental
5221 wyu-du/Self-Training-Dialogue-Generation

This repository contains the data and code for the paper "Self-training with...

19
Experimental
5222 Miguell-J/Google-Competition-Gemma-2

Fine-tuning of Gemma 2 model in Google Competition using a dataset of...

19
Experimental
5223 neuronalin/gpt-from-scratch-pytorch

A decoder-only GPT-style Transformer built from scratch with PyTorch —...

19
Experimental
5224 Jshulgach/Grounded-SAM-2-Stream

Track anything in streaming with Grounding DINO, SAM 2, and LLM

19
Experimental
5225 ozzyonfire/bird-id

Bird classification model running locally on the web using Transformers.js

19
Experimental
5226 thesofakillers/CLAfICLe

Official repository for the paper "CLAfICLe: Cross-Lingual Adaptation for...

19
Experimental
5227 Merterm/COSMic

Public repo for the paper: "COSMic: A Coherence-Aware Generation Metric for...

19
Experimental
5228 corentin-ryr/CLIP-mixer

Implementation of CLIP using a Mixer architecture

19
Experimental
5229 VijayPrakashReddy-k/CLIP-PACL

Contrastive Language - Image Pre-training (CLIP) and Patch Aligned...

19
Experimental
5230 junayed-hasan/spontaneous-smile-recognition

A deep learning framework for distinguishing spontaneous from posed smiles...

19
Experimental
5231 puneetkakkar/road-damage-detection

An application for automatic road damage assessment using semantic...

19
Experimental
5232 supersjgk/Transformers

Playing with Transformers and LLM

19
Experimental
5233 sunjana2199/AI-Book-Wizard

Course Project for ADL : AI Book Wizard

19
Experimental
5234 yassine-rd/deep-learning-course

This repository contains my personal notes and Jupyter notebooks on Deep...

19
Experimental
5235 rizavelioglu/ml4prom

[IDEAL'22] The implementation of the paper: "Explainable Artificial...

19
Experimental
5236 kyegomez/Modeling-Economic-Systems-as-Neural-Networks

This paper presents a groundbreaking framework that models economic systems...

19
Experimental
5237 marian-nmt/pymarian-webapp

Pymarian Webapp

19
Experimental
5238 couchbaselabs/neural-translation-example

An example showcasing the real time neural translation of data stored in...

19
Experimental
5239 jha-lab/edgetran

[TMC'23] EdgeTran: Device-Aware Co-Search of Transformers and Mobile Platforms

19
Experimental
5240 hiyouga/Toxic_Detection

BUAA SCSE Autumn 2021 Machine Learning Group Homework

19
Experimental
5241 iamlxb3/UMAMGT

Code for the publication of LREC'22

19
Experimental
5242 HannaAbiAkl/PSYCHIC

The official repository for the PSYCHIC model

19
Experimental
5243 TRISTAN-ORF/RiboTIE_article

Scripts run to produce the RiboTIE paper

19
Experimental
5244 awadalaa/transact

An unofficial implementation of "TransAct: Transformer-based Realtime User...

19
Experimental
5245 SergioArnaud/attention-is-all-you-need

Implementation of a transformer following the Attention Is All You Need paper

19
Experimental
5246 agasheaditya/handson-transformers

End-to-end implementation of Transformers using PyTorch from scratch

19
Experimental
5247 VinkuraAI/AXEN-M

AXEN-M (Attention eXtended Efficient Network - Model) is a powerful...

19
Experimental
5248 minuva/fast-nlp-text-emotion

Fast emotion classification model

19
Experimental
5249 yangoz94/ml-depression-detection

A web application that uses a ML/NLP model to detect depression based on...

19
Experimental
5250 teticio/inBERTolate

Hit your word count by using BERT to pad out your essays!

19
Experimental
5251 falox/hf-samples

How to Run Open Source AI Models from Hugging Face

19
Experimental
5252 dmamakas2000/nllp-2022

In this repository we generally experiment with the creation of...

19
Experimental
5253 AlexandreGazagnes/CentraleSupElec-NLP-Public-Resources

Centrale-NLP-Public-Ressources : This repository is about the NLP class 2023/2024

19
Experimental
5254 bluexdev/YouTubeSentimentAnalyzer

Herramienta de análisis de sentimientos para comentarios de YouTube...

19
Experimental
5255 wildangunawan/Hotel-Review

Streamlit implementation of hotel review sentiment analysis using fine-tuned...

19
Experimental
5256 patrickcurl/sanechain

Filling in the missing gaps with langchain, and creating OO wrappers to...

19
Experimental
5257 adithyab94/Attiri

Attiri: An Instruction-following LLaMa and Alpaca model for Tamil

19
Experimental
5258 lufixSch/auto_llama

Supercharge your local LLM

19
Experimental
5259 sathishkumar67/GPT-2-IMDB-Sentiment-Fine-Tuning-with-PPO

Implemented the Proximal Policy Optimization (PPO) algorithm to fine-tune a...

19
Experimental
5260 rxian/domain-alignment

Code for importance-weighted domain alignment, and the paper “Cross-Lingual...

19
Experimental
5261 JinHanLei/LLM-Stream-Service

Streaming API and Web page for Large Language Models (Llama3) based on...

19
Experimental
5262 TimeSurgeLabs/promptproxy

Call many AIs from a single API.

19
Experimental
5263 gyanaranjans/llma-rust

A simple webapp to showcase the ability to write a simple chatbot webapp...

19
Experimental
5264 iakashpaul/Ghudsavar

Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes

19
Experimental
5265 Shengwei-Peng/Classical-Chinese-Translation

A project for bidirectional translation between Classical Chinese and modern...

19
Experimental
5266 lucaslingle/e-lra

Streamlined variant of Long-Range Arena with pinned dependencies, automated...

19
Experimental
5267 Logisx/LLMath-QLoRA

🧮 End-to-end LLM instruction finetuning based on PEFT & QLoRA to solve math problems.

19
Experimental
5268 arham-kk/llama2-qlora-sft

This model is a fine-tuned model based on the...

19
Experimental
5269 sanskaryo/Click-Clinic-One-Stop-GenAI-Health-App

Click Clinic is a one stop Health and Nutrition Gen ai Solution

19
Experimental
5270 jwt2706/Grype

Growth, one byte at a time. Second place winner for 'best-use of AI' @...

19
Experimental
5271 crux82/advances-in-ai-2024

Materials used during the Lecture about LLMs held in the Summer School...

19
Experimental
5272 OpenM3D/M3DBench

[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following...

19
Experimental
5273 UIC-Liu-Lab/CPT

[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning

19
Experimental
5274 sanand0/llmexcel

An Excel =LLM() function that talks to OpenAI models

19
Experimental
5275 ai4ce/LLM4VPR

Can multimodal LLM help visual place recognition?

19
Experimental
5276 zhaochen0110/Cotempqa

Code and data for "Living in the Moment: Can Large Language Models Grasp...

19
Experimental
5277 ogkalu2/Human-parity-on-machine-translations

Bilingual (or Multilingual) Large Language models and In-context Learning-...

19
Experimental
5278 LiorSinai/TransformersLite.jl

A lightweight package for the transformer deep learning architecture in Julia

19
Experimental
5279 BaohaoLiao/mefts

[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to...

19
Experimental
5280 declare-lab/resta

Restore safety in fine-tuned language models through task arithmetic

19
Experimental
5281 manufactai/finetuning-cookbook

A collection of practical examples and tutorials for fine-tuning large...

19
Experimental
5282 azygadlo/LLM-catalog

Majority of the Large Language Models summarized in a table. From the...

19
Experimental
5283 wyt2000/InverseCoder

[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the...

19
Experimental
5284 natharmatron/MediSight.AI

💻🔒 A local-first full-stack app to analyze medical PDFs with an AI model...

19
Experimental
5285 ZBox1005/CoT-UQ

[arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in...

19
Experimental
5286 AdamG012/moe-paper-models

A sumary of MoE experimental setups across a number of different papers.

19
Experimental
5287 tyhobbs/FinRL_Deep_Reinforcement_Learning

A progressive DRL stock trading system built on FinRL, benchmarking four...

19
Experimental
5288 d-f/llm-summarization

LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR...

19
Experimental
5289 EvilFreelancer/benchmarking-llms

Comprehensive benchmarks and evaluations of Large Language Models (LLMs)...

19
Experimental
5290 ilmedova/ml-fun

Machine Learning and AI projects for fun

19
Experimental
5291 MasihMoafi/Financial-Market-Analysis

Financial market analysis using time-series models, clustering algorithms,...

19
Experimental
5292 PRITHIVSAKTHIUR/GALLO-3XL

High Quality Image Generation Model - Powered with NVIDIA A100

19
Experimental
5293 di37/LLM-Load-Unload-Ollama

This is a simple demonstration to show how to keep an LLM loaded for...

19
Experimental
5294 shreydan/shakespeareGPT

understanding language modeling by training a small GPT on Shakespeare plays.

19
Experimental
5295 ns408/local-ai-setup

Run modern AI models on older laptops - optimized for 2nd-gen Intel hardware

19
Experimental
5296 GeorgeVern/lmcor

Code for the EACL 2024 paper: "Small Language Models Improve Giants by...

19
Experimental
5297 notsopreety/llama

A simple python script for terminal which allows to interact with LLaMa 3.2 90B.

19
Experimental
5298 ash-01xor/Imgcap

A CLI to generate captions for images

19
Experimental
5299 Raxephion/loRA-Epoch-Analyser

A Python script to analyze images generated at different epochs of LoRA...

19
Experimental
5300 guilherme-hermano/AVALIACAO-DE-MODELOS-QUESTION-ANSWERING

Avaliação comparativa entre TinyRoBERTa e BERT Base em Question Answering — CTEIA/UFC

19
Experimental
« Prev 1 2 3 51 52 53 54 55 76 77 78 Next »