All Transformer Models

7,795 models ranked by quality score · Page 48 of 78

Showing 4701–4800 of 7,795
# Model Score Tier
4701 kasia-kobalczyk/guess_llm

Implementation of the probing models presented in the ICLR 2026 paper...

21
Experimental
4702 landry-some/LLM-streaming

Efficient streaming inference for large language models (LLMs).

21
Experimental
4703 kyegomez/HeptapodLM

An Implementation of an Transformer model that generates tokens non-linearly...

21
Experimental
4704 gowtamyreddy/NLP

Text Generation using RNN, LSTM, and Transformer

21
Experimental
4705 liam8421/faster-llm

🚀 Accelerate LLM training with Fast-LLM, an open-source library for...

21
Experimental
4706 fattorib/ZeRO-transformer

Two implementations of ZeRO-1 optimizer sharding in JAX

21
Experimental
4707 onlychara553-debug/dgx-spark-inference-stack

🚀 Serve large language models efficiently at home with this Docker-based...

21
Experimental
4708 YousfiNahed/KoValPlus

🌍 Evaluate cultural and value alignment of LLMs with Korean responses using...

21
Experimental
4709 Exahia/llm-benchmark-fr

Benchmarks LLM sur tâches métier françaises — Mistral vs Llama vs Qwen vs DeepSeek

21
Experimental
4710 PKU-Alignment/llms-resist-alignment

[ACL2025 Best Paper] Language Models Resist Alignment

21
Experimental
4711 KhoiBui16/UIT_CS221_Basic_Natural_Language_Processing

The project focuses on classifying hallucinations in Vietnamese LLM outputs...

21
Experimental
4712 vkhamesi/proteins

🧬 Fine-Tuning Large Language and Protein Models on a single T4 GPU via...

21
Experimental
4713 aakasharya09/llm-leaderboard

📊 Compare LLM models effortlessly with our tool, showcasing performance...

21
Experimental
4714 eduardopini/Dresguardian

🛡️ Elevate your privacy with Dresguardian, a self-hosted Telegram bot that...

21
Experimental
4715 Kelvinkeoma/AI-Digital-Doppelganger

Build a personal AI Telegram bot that processes text, voice, and images with...

21
Experimental
4716 nluninja/drugsLLM

An intelligent conversational assistant study designed to provide accurate,...

21
Experimental
4717 cvssn/shade

ai pair programming in your terminal

21
Experimental
4718 LlamaFlowJs/LlamaFlowJs

LlamaFlow is a framework that has inbuilt agentic workflows,reiterative...

21
Experimental
4719 n1405732043/pi-token-burden

Analyze system prompt tokens to identify usage and manage token budgets...

21
Experimental
4720 M-e-r-c-u-r-y/pytorch-transformers

Collection of different types of transformers for learning purposes

21
Experimental
4721 malith153/token-forge

🔑 Build robust identity solutions with TokenForge, an enterprise-ready...

21
Experimental
4722 awpggexcutor-beep/T5-Refiner-DomainFocus

🌟 Enhance T5 model performance with domain-specific word masking for...

21
Experimental
4723 AIDajiangtang/LLM-from-scratch

从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch

21
Experimental
4724 ozyurtf/attention-and-transformers

The purpose of this project is to understand how the Transformers work and...

21
Experimental
4725 CheongWoong/impact_of_cooccurrence

A repository for analyzing the impact of co-occurrence statistics on factual...

21
Experimental
4726 jasminwolf/ZakeyTeam-arabic-qa-system-arabert

🤖 Enhance Arabic NLP capabilities with this AI-powered question answering...

21
Experimental
4727 r-kovalch/omnigec-models

Reproducible QLoRA recipes and configs that fine‑tune Aya‑Expanse‑8B and...

21
Experimental
4728 ertosns/wiki-summary

wikipedia summarizer transformer

21
Experimental
4729 jstilb/timeseries-forecasting

Multi-variate time series forecasting: LSTM, Transformer, and statistical...

21
Experimental
4730 resetpaid/lumina

Perform passive domain reconnaissance using public data sources without...

21
Experimental
4731 gameofdimension/seven8wen

大语言模型高效微调

21
Experimental
4732 khalidm31415/fastapi-transformers-zsl

Zero-shot learning text classification web app with FastAPI backend

21
Experimental
4733 lechmazur/deception

Benchmark evaluating LLMs on their ability to create and resist...

21
Experimental
4734 ENTITY107/rlmgw

🔄 Explore Recursive Language Models (RLMs) to enhance natural language...

21
Experimental
4735 KeepALifeUS/ml-attention-mechanisms

Flash Attention, RoPE, multi-head attention for temporal patterns

21
Experimental
4736 Skyline-9/Shotluck-Holmes

[ACM MMGR '24] 🔍 Shotluck Holmes: A family of small-scale LLVMs for...

21
Experimental
4737 mrconter1/PullRequestBenchmark

Evaluating LLMs performance in PR reviews as an indicator for their...

21
Experimental
4738 Gmail1995/llm-course

🧩 Explore LLM essentials, build advanced models, and develop applications...

21
Experimental
4739 lzyrapx/LLM-Grandmaster-Notes

🎓The path to LLM mastery is paved with broken embeddings and resurrected gradients.

21
Experimental
4740 ph-ausseil/llm-training-dataset-builder

Streamlines the creation of dataset to train a Large Language Model with...

21
Experimental
4741 bupt-ai-club/llm-compression-papers

papers of llm compression

21
Experimental
4742 adkwn1/question-answer-app

Question and Answer web applicaiton using fine-tuned and pre-trained T5...

21
Experimental
4743 villagecomputing/superpipe

Superpipe - optimized LLM pipelines for structured data

21
Experimental
4744 amazon-science/mezo_svrg

Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for...

21
Experimental
4745 CaterinaBi/health-communication-paper2

Bonan & Samo. January 2023. Paper on cross-linguistic bias in health-related...

21
Experimental
4746 MaxLSB/mini-paligemma2

Minimalist implementation of PaliGemma 2 & PaliGemma VLM from scratch

21
Experimental
4747 princeton-nlp/MultilingualAnalysis

Repository for the paper titled: "When is BERT Multilingual? Isolating...

21
Experimental
4748 declare-lab/flacuna

Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive...

21
Experimental
4749 aratan/ApiCloudLLaMA

The idea is to make an api that everyone can consume in their GPT4-like...

21
Experimental
4750 francesco-s/document-claim-mapping

A tool using LLMs and few-shot learning for document-claim mapping and...

21
Experimental
4751 malvads/whatsapp-gpt-bot

WhatsApp GPT bot for doing weird stuff

21
Experimental
4752 sak96/rust_llama_app

Chat bot (llama) written in rust using Yew and Tauri.

21
Experimental
4753 PrivateDennis/InfinityGame

Craft infinit items with the help of AI based on idea of neil.fun

21
Experimental
4754 lionajuanabel/Fine-Dllm

LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven...

21
Experimental
4755 ahmedbesbes/audiolizr

A bentoML-powered API to transcribe audio and make sense of it

21
Experimental
4756 afondiel/LangChain-For-LLM-Application-Dev-DeepLearningAI

Crash course on LangChain for LLM Application Developement by DeepLearningAI

21
Experimental
4757 Bhattacharya-Lab/CASP15

CASP15 performance benchmarking of the state-of-the-art protein structure...

21
Experimental
4758 Microsatellites-and-Space-Microsystems/pose_estimation_domain_gap

Two methods for solving domain gap in satellite pose estimation in space...

21
Experimental
4759 Siesher/Generator_for_reasoning

🧠 Reasoning data generator for LLM training

21
Experimental
4760 AikyamLab/llm-memorization

Understanding the memorization property of Large Language Models using Model...

21
Experimental
4761 daolytica/Panther

A desktop application for multi-LLM brainstorming, debate, local model...

21
Experimental
4762 Muhammad-Hammad-59/Qwen05B-lora-qlora-finetuning-for-customer-support

Parameter-efficient fine-tuning (LoRA + QLoRA) of Qwen2.5-0.5B-Instruct for...

21
Experimental
4763 G-Art/matrix_steering_vector_research

Iterative Sparse Matrix Steering: Closed-Form Subspace Alignment for...

21
Experimental
4764 AIdventures/flora

Fine-tuning LLMs with LoRA

21
Experimental
4765 seehiong/micronaut-llama3

A high-performance Llama3 implementation using Micronaut and GraalVM Native Image

21
Experimental
4766 nlx-group/Shortcutted-Commonsense-Reasoning

Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep...

21
Experimental
4767 tanishqmudaliar/Silver-Guard-AI-Model-Training

TRAI‑aware Indian SMS scam detector that fine‑tunes MobileBERT on real +...

21
Experimental
4768 pacifikus/itmo_ods_nlp_course

NLP course materials at ITMO

21
Experimental
4769 EastTower16/LLMDataDistill

distill large scale web page text

21
Experimental
4770 frost-beta/llama2-high-level-cpp

Inference Llama2 with High-Level C++.

21
Experimental
4771 Eleanor-H/MUSTARD

Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform...

21
Experimental
4772 AKSW/LLMDatasetGenerator

LLM based datatset generator for KGQA on user defined knowledge graphs

21
Experimental
4773 Kcrypto126/Multi-Ai-Chat-App

chatting app

21
Experimental
4774 PRITHIVSAKTHIUR/Vit-Mature-Content-Detection

Vit-Mature-Content-Detection is an image classification vision-language...

21
Experimental
4775 itsvaibhav01/Immune

[CVPR2025] Official Repository for IMMUNE: Improving Safety Against...

21
Experimental
4776 melove297/reddit-factuality-detection

🧐 Detect factual reliability in Reddit posts using machine learning with...

21
Experimental
4777 ayinedjimi/ComplianceBot

AI-Powered Compliance Assistant with Transformers and Gradio

21
Experimental
4778 ahs95/restaurant-idea-generator

Offline‑first app that generates restaurant names, 3‑item menus, and...

21
Experimental
4779 SemanticWave-Hoyeon/NavtexRecovery

AI-powered restoration system for damaged NAVTEX (NAVigational TEleX)...

21
Experimental
4780 vtrnnhlinh/Graph-of-Models

My proposed idea to create a graph of models, or a network of models...

21
Experimental
4781 AndreiRoibu/Transformer-Classifiers

This repository contains NLP classification models built with the Hugging...

21
Experimental
4782 svjack/docvqa-gen

Question Answering dataset generator of Document Visual in English and Chinese

21
Experimental
4783 hadrienbdc/bert-sentiment-analysis-pytorch

Fine-tuning Bert for sentiment analysis with pytorch

21
Experimental
4784 taeminlee/intent_classifier

Korean Intention classifier with pytorch lightning ⚡

21
Experimental
4785 arpitpatelsitapur/ScholarLensAI

A FastAPI app for research paper recommendation and chat with those papers....

21
Experimental
4786 zzbright1998/SentenceKV

Official implementation of "SentenceKV: Efficient LLM Inference via...

21
Experimental
4787 jaisenbe58r/NLP-Transformer_Translator

Implementación Transformers, adaptación del curso: "Procesamiento del...

21
Experimental
4788 abc1203/transformer-model

An implementation of the transformer deep learning model, based on the...

21
Experimental
4789 CatnipCoders/Lambda-Driver

Lambda-Driver optimizes a small pre-trained model for resource-constrained...

21
Experimental
4790 akshantchaudhary09/YouTube-Transcript-Summarizer

A chrome extension that can summarize the transcript of youtube videos.

21
Experimental
4791 nininau/awesome-llm-services

🔍 Discover 106+ open-source LLM services and tools for AI, ideal for local...

21
Experimental
4792 Clinical-Quality-Artifical-Intelligence/NurseSim-RL

AI-powered clinical triage simulation using Manchester Triage System (MTS)....

21
Experimental
4793 axonura/axonura-X1

The First AI Model Of Axonura

21
Experimental
4794 theanasuddin/Deep-Learning-Fundamentals

Python implementations of deep learning fundamentals, from multilayer...

21
Experimental
4795 ruslanmv/ollabridge

OllaBridge transforms your laptop or workstation into a production-grade,...

21
Experimental
4796 Torim98/regime-switching-daa

Systematischer Vergleich ökonometrischer Modelle und moderner...

21
Experimental
4797 llamaplushiesYT/HTML-Games

Just some random HTML games that you can play in school or any where

21
Experimental
4798 MathewJobey/linux-logsummary-justpy

Turn raw Linux logs into executive insights using **Drain3** and **Ollama**....

21
Experimental
4799 RichardScottOZ/geoscience-transformers-for-predictive-mapping-of-critical-minerals

First pass paper implementation

21
Experimental
4800 1010code/github-models-tutorial

GitHub Models API 教學,免費試玩 GPT-4o、Llama、DeepSeek,Colab 範例程式

21
Experimental
« Prev 1 2 3 46 47 48 49 50 76 77 78 Next »