All Transformer Models

7,795 models ranked by quality score · Page 38 of 78

Showing 3701–3800 of 7,795
# Model Score Tier
3701 Anu0408/Language_Translation_GenAI_App

Language Translator is an AI-powered tool for text and voice translation...

25
Experimental
3702 DrRuin/Lightweight-Fine-Tuning

Lightweight fine-tuning is one of the most important techniques for adapting...

25
Experimental
3703 yuki-2025/llama3-8b-fine-tuning-math

Fine-Tuning Llama 3-8B for Structured Math Reasoning: Fine-tuning Llama3 8b...

25
Experimental
3704 lucataco/cog-llama-3-vision-alpha

Cog wrapper for qresearch/llama-3-vision-alpha

25
Experimental
3705 ZhouYuxuanYX/Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs

This is the official implementation of our ACL 2025 Main paper "Balancing...

25
Experimental
3706 nguynking/CS330

Assignment solutions for CS330: Deep Multi-Task and Meta Learning, Fall 2023...

25
Experimental
3707 deadlykitten4/ERC-SVD

ERC-SVD: Error-Controlled SVD for Large Language Model Compression

25
Experimental
3708 iiis-ai/TemplateMath

[ICLR 2025 DATA-FM] Training and Evaluating Language Models with...

25
Experimental
3709 Shreyas-Bhat/LMLF

Code for "Generating Novel Leads for Drug Discovery Using LLMs with Logical...

25
Experimental
3710 webnizam/alpaca-telegram-bot

Simplest way to host a local ChatGPT like model for Telegram.

25
Experimental
3711 osiriszjq/structured_init

Structured Initialization for Attention in Vision Transformers

25
Experimental
3712 VinniLP/Document-Similarity-Finding-using-BERT

Document-Similarity-Finding-using-BERT

25
Experimental
3713 tuan3w/llama-raycast

Chat with LLaMa in Raycast

25
Experimental
3714 s-omranpour/Shirin-Sokhan

A Persian Poet Transformer! (finetuned GPT2 on Ganjoor data)

25
Experimental
3715 alphadl/OOP-eval

The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs

25
Experimental
3716 thunlp/cost-optimal-gqa

The code for the paper "Cost-Optimal Grouped-Query Attention for...

25
Experimental
3717 Trustworthy-ML-Lab/ThinkEdit

[EMNLP 25] An effective and interpretable weight-editing method for...

25
Experimental
3718 joshvoigts/llmctx

LLM context builder

25
Experimental
3719 raghavagps/il2pred

Prediction of IL2 inducing peptides

25
Experimental
3720 open-compass/Ada-LEval

The official implementation of "Ada-LEval: Evaluating long-context LLMs with...

25
Experimental
3721 TirendazAcademy/Bert-Text-Classification-Gradio-App

End-to-end text classification project with Transformers, Comet ML, and Gradio

25
Experimental
3722 SrikarVeluvali/Astor-AI

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented...

25
Experimental
3723 didar00/Final-Project

SELFIES-Transformer: Learning the Representation of Chemical Space for...

25
Experimental
3724 mala-lab/HaMI

[NeurIPS 2025] Official implementation for ''Robust Hallucination Detection...

25
Experimental
3725 horenbergerb/llamagotchi

A bunch of LLaMa model investigations, including recreating generative...

25
Experimental
3726 IbrahimSobh/askpdf

In this tutorial we will see 💡 How to get answers from a PDF file using...

25
Experimental
3727 PCfVW/plip-rs

Mechanistic interpretability toolkit for code LLMs, in Rust. Analysis of...

25
Experimental
3728 KimDaeUng/PLM-Implementation

NLP Pretrained Language Models Implementation Study

25
Experimental
3729 oneonlee/KoAirBERT

🤗 항공 안전 도메인에 특화된 한국어 BERT 모델 ✈️

25
Experimental
3730 basaanithanaveenkumar/HaloBlocks

Python library designed to make model experimentation seamless and fast. The...

25
Experimental
3731 RLHF-V/RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from...

25
Experimental
3732 cattolatte/reflective-reasoning-transformer

🧠 R2T Prototype: An LLM pre-trained on causal graphs (not just text) to...

25
Experimental
3733 arcxteam/gguf-convert-model

Auto GGUF Converter for HuggingFace Hub Models with Multiple Quantizations...

25
Experimental
3734 Mattbusel/llm-cpp

The C++ LLM toolkit. 26 single-header libraries for streaming, caching, cost...

25
Experimental
3735 balaji1233/AI-Radiology-Reporting

Using MAIRA-2 multimodal transformer designed for the generation of...

25
Experimental
3736 minjiyoon/MMGL

Multimodal Graph Learning: how to encode multiple multimodal neighbors with...

25
Experimental
3737 EmbeddedLLM/embeddedllm

EmbeddedLLM: API server for Embedded Device Deployment. Currently support...

25
Experimental
3738 blayyyyyk/cs478

For the duration of my Independent Study course, I have been tasked with...

25
Experimental
3739 sajidkhan2067/LLMOnAWS

Deploy smaller LLM on AWS Lambda: Phi-2, cost-effective language model

25
Experimental
3740 DFKI-NLP/gevalm

Code and data for the paper "Evaluating German Transformer Language Models...

25
Experimental
3741 Khaeldur/NeuralForge

On-device LLM fine-tuning for Apple Silicon (ANE)

25
Experimental
3742 godofpdog/ViT_PyTorch

This is a simple PyTorch implementation of Vision Transformer (ViT)...

25
Experimental
3743 jranaraki/vllm-fit

A CLI tool designed to simply recommend (conservative), and/or profile (to...

25
Experimental
3744 VincLee8188/Spatio-temporal-forecasting-PyTorch

Leverage on recent advances in graph convolution and sequence modeling to...

25
Experimental
3745 cronenberg64/SciBERT-CTFT

SciBERT-based scientific abstract classification using SetFit framework with...

24
Experimental
3746 sc0v0ne/udemy_course_mastering_ollama_build_private_local_llm_apps_with_python

Udemy Course Mastering Ollama Build Private Local LLM Apps with Python

24
Experimental
3747 ffreemt/convbot

A conversational bot based on huggingface transformers

24
Experimental
3748 OFA-Sys/DiverseEvol

Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

24
Experimental
3749 princeton-nlp/dyck-transformer

[ACL 2021] Self-Attention Networks Can Process Bounded Hierarchical Languages

24
Experimental
3750 UNITES-Lab/HEXA-MoE

Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE...

24
Experimental
3751 PRITHIVSAKTHIUR/Qwen-Image-LoRA-DLC

Qwen-Image model with various LoRA (Low-Rank Adaptation) styles. This tool...

24
Experimental
3752 ansh-info/Titans-Learning-to-Memorize-at-Test-Time-with-Manim

Visual animated walkthroughs of the DeepMind "Titans: Learning to Memorize...

24
Experimental
3753 MBadriNarayanan/ClickbaitClassification

Classifying clickbaits: articles with potentially misleading titles, using a...

24
Experimental
3754 MMStar-Benchmark/MMStar

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on...

24
Experimental
3755 RufelleEmmanuelPactol/Mixture-of-Experts-Transcript-Evaluator

A mixture of experts inspired transcript evaluator using LLM fine-tuning....

24
Experimental
3756 symfony/ai-transformers-php-platform

TransformersPhp platform bridge for Symfony AI

24
Experimental
3757 hululuzhu/llama-lora-chinese-couplet

llama-lora e2e example to demo a Chinese Couplet AI in 10 mins. some...

24
Experimental
3758 Navya0203/Abstractive-Text-Summarization-Using-RNN-and-Transformers

This repository contains implementations of abstractive text summarization...

24
Experimental
3759 daniau23/LoRAfrica

LoRAfrica: Scaling LLM Fine Tuning for African History

24
Experimental
3760 zixi-liu/Transformers-Learning

Stanford CS25 - Transformer United and CS224n learning notes and code dump.

24
Experimental
3761 OpenNLPLab/Tnn

[ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper -...

24
Experimental
3762 Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages

This project presents a multilingual pipeline for both speech-to-text and...

24
Experimental
3763 nateraw/discord-image-captioning-bot

A Discord bot for captioning images

24
Experimental
3764 ahmed19999520-alt/Veronica-X-Pro-open-source-code-2.0

Advanced AI system with real quantum computing integration, sophisticated...

24
Experimental
3765 codiceSpaghetti/numpyGPT

A from-scratch GPT built with NumPy and Python’s standard library. No...

24
Experimental
3766 nsourlos/LLM_evaluation_framework

Evaluate performance of LLM models for Q&A in any domain

24
Experimental
3767 ertugrulakben/NEURON

Hybrid memory architecture combining exact recall with infinite-capacity...

24
Experimental
3768 Nutanpatil06/Fine-Tuning-LLM-with-LLaMA-Factory

Complete LoRA/QLoRA implementation using LLaMA Factory. Fine-tune models...

24
Experimental
3769 opencodeiiita/Finetuning_Llama

Fine-Tuning LLaMA for Indian Laws

24
Experimental
3770 happydasch/llm_advisory

Modular framework for building topic-specific advisors powered by large...

24
Experimental
3771 1tangerine1day/chinese-QA-chatbot

A simple chinese QA chatbot implement with pytorch and transformer trained...

24
Experimental
3772 styfeng/SMERTI

Code for SMERTI for Semantic Text Exchange.

24
Experimental
3773 Bradley-Butcher/Conformers

Unofficial implementation of Conformal Language Modeling by Quach et al

24
Experimental
3774 harshpimpale/AyurvedaGPT

A Streamlit-based platform offering Ayurvedic remedies. Users can ask...

24
Experimental
3775 waybarrios/dgx-spark-finetune-llm

LLM fine-tuning with LoRA + NVFP4/MXFP8 on NVIDIA DGX Spark (Blackwell GB10)

24
Experimental
3776 xufangzhi/Symbol-LLM

[ACL 2024] The project of Symbol-LLM

24
Experimental
3777 ArturPen/ab-transformers-timeskip-exploit

Python + ADB automation script for the Time Skip exploit in Angry Birds Transformers.

24
Experimental
3778 AstraBert/DebateLLM-Championship

5 LLMs, 1vs1 matches to produce the most convincing argumentation in favor...

24
Experimental
3779 rashomon-gh/attention-visualiser

a module to visualise attention layer activations from transformer based...

24
Experimental
3780 smsnobin77/Awesome-Multimodal-Unlearning

This repo presents a survey of multimodal unlearning across vision,...

24
Experimental
3781 Comrade-1729/lex-brief-ai

Safety-first legal NLP system with hierarchical long-document processing,...

24
Experimental
3782 LGDiMaggio/few-shot-fault-diagnosis-multimodal-LLM

Few-shot bearing fault diagnosis using multimodal LLMs and prototypical networks

24
Experimental
3783 ilanaliouchouche/KANBert

Implementation of an Encoder only MoE usable as an Embedding Model,...

24
Experimental
3784 SauravP97/toy-transformer

A decoder only Transformer implementing masked attention

24
Experimental
3785 m-rishab/Research-Paper-Recommendation

This project aims to build a research paper recommendation system. Given a...

24
Experimental
3786 nitrictech/pycasts

A text to Podcast inference API

24
Experimental
3787 Dim10p/relation-extraction-on-financial-documents

This repository contains all the scripts and methodology for the Relations...

24
Experimental
3788 Strong-AI-Lab/Explanation-Generation

We introduce "ILearner-LLM" a framework that uses iterative enhancement with...

24
Experimental
3789 hank0316/AdaSearch

This includes the original implementation of "AdaSearch: Balancing...

24
Experimental
3790 YASSER-27/LLMs

A high-performance, cross-platform desktop application for chatting with...

24
Experimental
3791 Johandaonis1/OMG-Agent

🤖 Automate Android operations with OMG-Agent, an open-source Mobile GUI...

24
Experimental
3792 horde-research/horde-common

Shared scripts for offline Kazakh LLM eval—run inference, auto-score, and...

24
Experimental
3793 NS027/medical_chatbot_project_genAI

Multimodal AI-powered medical assistant with LLMs, speech, and image understanding.

24
Experimental
3794 MDalamin5/Build-and-Finetune-LLM-From-Scratch-Deploy-via-vLLM-AWS-GCP

A complete end-to-end learning repo covering everything from building Large...

24
Experimental
3795 bendsouza2/yt-translator

This project aims to provide free and accessible language learning resources...

24
Experimental
3796 Rin313/StegLLM

离线的LLM文本隐写程序。Offline LLM text steganography program.

24
Experimental
3797 mourga/transformer-uncertainty

Code for evaluating uncertainty estimation methods for Transformer-based...

24
Experimental
3798 gxcsoccer/alloy

Hybrid SSM-Attention language model on Apple Silicon with MLX — interleaving...

24
Experimental
3799 korovod/kenotron

Experimental fork of Nanotron, a minimalistic large language model...

24
Experimental
3800 frikishaan/glama-124m

GLaMA is a small-scale autoregressive transformer model inspired by...

24
Experimental
« Prev 1 2 3 36 37 38 39 40 76 77 78 Next »