All Transformer Models

7,795 models ranked by quality score · Page 68 of 78

Showing 6701–6800 of 7,795
# Model Score Tier
6701 jacoboromerodiaz/context-mixing-audio-text

Attribution framework for analyzing audio–text context mixing in...

13
Experimental
6702 harishm17/build-llm-from-scratch

From‑scratch LLM notebooks: Transformers, BPE tokenizer, PyTorch...

13
Experimental
6703 Jason-Wang313/Drift-Bench

Quantifying the "Safety Half-Life" of LLMs: A framework to measure how...

13
Experimental
6704 fabiantoh98/llm-preference-learning

End-to-end LLM preference learning pipeline: training, evaluation, and...

13
Experimental
6705 cluebbers/dpo-rlhf-paraphrase-types

Enhancing paraphrase-type generation using Direct Preference Optimization...

13
Experimental
6706 juansalnac/API-mega-list

🌐 Discover a comprehensive collection of APIs to enhance your projects and...

13
Experimental
6707 tahangz/transformer-chatbot

A simple chatbot built using the vanilla Transformer architecture (Vaswani...

13
Experimental
6708 Jkanishkha0305/LLMs-from-Scratch

A curated collection of Large Language Models(LLMs), Small Language...

13
Experimental
6709 seoyeon9646/MLM-data-augmentation

Masked Language Modeling for data augmentation

13
Experimental
6710 tbogdala/ai_notepad

A lightweight Rust application to test interaction with large language...

13
Experimental
6711 Yousefbadr0/GPT-Neo_Medical_Fine-Tuning_using_LoRA

Fine-tuning GPT-Neo-125M using LoRA on a medical QA dataset, achieving...

13
Experimental
6712 AparnaRoy76/LLM-finetuning

A comprehensive toolkit for fine-tuning Large Language Models (LLMs) using...

13
Experimental
6713 Abdullahali77/AI_Testing_CLI

A specialized command-line tool that generates Python unit tests for your...

13
Experimental
6714 NavodPeiris/node_llama

run llama models using llamafile and communicate with llama models through...

13
Experimental
6715 ltouati/tiny-llm

A tiny llm writen using rust candle

13
Experimental
6716 Bristiii/AI-RESUME-AND-PORTFOLIO-BUILDER

An AI-powered web app built using Streamlit, Hugging Face Transformers, and...

13
Experimental
6717 nglguarino/code-completion

Fine-tuned 3 LLMs (Phi-2, Gemma, Llama2) on 100K+ instruction CodeInstruct...

13
Experimental
6718 PrateekKacham/mistral-7b-text2sql-finetuning

Fine-tuning Mistral 7B for Text-to-SQL generation using QLoRA — 200%...

13
Experimental
6719 Travor278/SEED-LLaVA

Single-GPU reproduction of SEED for hallucination mitigation in...

13
Experimental
6720 romizone/simulasiLLM

🧠 Interactive LLM Attention Simulation — Visualize how GPT-2 transformers...

13
Experimental
6721 alfredang/finetuning-llm-huggingface

🤖 Fine-tune Qwen3-0.6B for IT support ticket routing using LoRA + Unsloth....

13
Experimental
6722 johnayoung/eth-finetuning-cookbook

Educational cookbook for fine-tuning LLMs on Ethereum transaction data using QLoRA

13
Experimental
6723 punyamodi/lora-finetune-studio

Full-stack LoRA fine-tuning studio for large language models with Gradio UI,...

13
Experimental
6724 Dhyani2206/Domain_Specialized_LLaMA

Fine-tuning LLaMA-3, Mistral-7B, and Phi-3 using QLoRA on a curated Data...

13
Experimental
6725 SCCSMARTCODE/Deep-Learning-03-LLM-FineTuning

Scalable and modular framework for fine-tuning large language models (LLMs)...

13
Experimental
6726 mxagar/llm_peft_fine_tuning_example

Example project in which a Large Language Model is fine-tuned using PEFT.

13
Experimental
6727 IlyyinKashaf/MarketingMuse

Fine-tuned TinyLlama-1.1B (Decoder-Only) via 3-phase training (domain...

13
Experimental
6728 DarkFoot101/Smart-Product-Pricing

Built a multimodal pricing system combining numerical, text, and image...

13
Experimental
6729 sdtrkl/lightweight-fine-tuning

This project is part of Generative AI Nanodegree by Udacity

13
Experimental
6730 Mo-Shakib/llama3.2-3b-lora-finetuning-kit

Fast, memory-efficient LoRA fine-tuning toolkit for...

13
Experimental
6731 SamsungSAILMontreal/mulo

μLO: Compute-Efficient Meta-Generalization of Learned Optimizers [to appear...

13
Experimental
6732 rizalsimb1/context-manager

Fine-tune large language models (Llama 3, Mistral, Phi-3) with LoRA and...

13
Experimental
6733 anilsrml/LLM-FineTuning-QLora

Kumru-2B büyük dil modelinin, tıbbi veriler (TUS sınavı) üzerinde QLoRA...

13
Experimental
6734 Rishi625/LLM-Finetune-Pipeline

Production-grade ML pipeline for Llama 3.2 fine-tuning with LoRA/QLoRA,...

13
Experimental
6735 ZeeetOne/bioinstruct-finetuning-experiment

LoRA fine-tuning experiment: Llama-3.2-1B-Instruct + BioInstruct dataset...

13
Experimental
6736 Shoaib-33/Web-Scrapper-using-LLM

A web scraping tool using LLM

13
Experimental
6737 Yaswanth1702/AI-Clinical-Assistant

Adaptive Recommendations Based on Individual Health Profiles​

13
Experimental
6738 2pa4ul2/Easygen-v2

Exam Generation With Large Language Model (LLMs)

13
Experimental
6739 Tahirahmad1002/Task_05_Mental_Health_Chatbot

The goal of this project is to bridge the gap between AI and human empathy....

13
Experimental
6740 froge159/belief-project-sef

Activation-Space Interventions for Causal Control of Belief Representations...

13
Experimental
6741 pratheeksha-s-devadiga/multimodal-medical-assistant

Multimodal AI system combining speech recognition, vision models, and LLMs...

13
Experimental
6742 reveurmichael/space_mining

SpaceMining: a novel RL environment beyond LLM priors

13
Experimental
6743 mattzzz/shakeLLM

Exploration of LLMs using complete works of Shakespeare

13
Experimental
6744 wahab-cide/african_languages_llm_project

Training multilingual language models on African languages including...

13
Experimental
6745 avirupc/nlp

A curated collection of my learning path in NLP and LLMs. Contains my notes,...

13
Experimental
6746 Blue-No1/open-weight-collection

Tracking open-weight LLMs for research, experiments, and inference comparisons.

13
Experimental
6747 Adityaram0001/LLM-DeepLearning

A deep dive into the theory and practice of Large Language Models. This...

13
Experimental
6748 priyanshujiiii/awesome_LLM

A curated list of papers, datasets, and resources on Large Language Models (LLMs)

13
Experimental
6749 gokhaneraslan/llm-dataset-generator

Custom dataset generator from text and pdf

13
Experimental
6750 metaskills/fast-llama-inference

Exploring Accelerated Compound AI Systems with SambaNova & Llama 3.3-70B

13
Experimental
6751 Blue-No1/llm-research-notes

Notes & experiments on LLMs, open-weight models, multimodal systems, and...

13
Experimental
6752 Aananda-giri/llm-in-loop-blog

Blog: LLM in Loop (Minimal Claude Code and minimal openclaw implementation)

13
Experimental
6753 dsindex/transformers_examples

reference pytorch code for huggingface transformers

13
Experimental
6754 dacarlin/protein-transformers

Use generative ML to design new proteins using this simple, hackable...

13
Experimental
6755 ParthaPRay/neuro-symbolic_abductive_reasoning_ollama_fault_diagnosis

This repo presents codes that allows user to run localized Ollama based...

13
Experimental
6756 Sahar-Sheikhi/CRM-Data-Automation-Llama-3.2-Finetuned-

A memory-efficient fine-tuning pipeline using Llama-3.2-3B and QLoRA to...

13
Experimental
6757 Mervecaliskann/AI-Data-Analyst

A hybrid AI agent for automated data cleaning. Combines Pandas for...

13
Experimental
6758 Amodni007/Sales_Call_Analyser

AI-powered sales coaching tool that scores call transcripts using the SPIN...

13
Experimental
6759 echenim/hf-batch-downloader

Automate bulk downloads of Hugging Face LLMs with retry logic, manifest...

13
Experimental
6760 RohitPawar001/GPT-2-Implementation

This repository contains the implementation of OpenAI's GPT-2 with LORA,...

13
Experimental
6761 virtualramblas/FlexLLMGenMPS

Running large language models on a single M1/M2 GPU for throughput-oriented...

13
Experimental
6762 joeddav/illustrated-training-cluster

[WIP] Interactive visualization of LLM training parallelism across GPU clusters

13
Experimental
6763 ZeeetOne/llm-inference-deployment

Practical example of deploying fine-tuned LLMs locally with FastAPI....

13
Experimental
6764 G-B-KEVIN-ARJUN/runtime-inference

"Faster AI: Accelerating Qwen 2.5 from 7 t/s to 82 t/s on a single RTX 4060...

13
Experimental
6765 samidala/polyglot-llm-benchmark

A production-ready system to benchmark local LLM inference performance with...

13
Experimental
6766 RelaxUI/RelaxUI.github.io

Browser LLM workflow.

13
Experimental
6767 newblood542/play-chess

♟️ Play chess online in real-time with friends, track your progress, and...

13
Experimental
6768 Ddarkbooked/booked_utils

A set of practical utilities for building Flutter apps faster: typed...

13
Experimental
6769 lazinessllama/llamahost

Deploy stuff while llama is doing all the work for you. Easier solution to self-host.

13
Experimental
6770 bbTwilio/cr-meta-llama

Twilio ConversationRelay with Meta Llama integration for voice AI assistant

13
Experimental
6771 StarDust130/Sarathi-Ai

🪷✨ Sarathi AI: your smart, Krishna-inspired companion 🤖💬 Talk, reflect, and...

13
Experimental
6772 Kashif-Rezwi/better-dev-api

Core server and API for Better DEV, an intelligent AI chat platform with...

13
Experimental
6773 vantuan88291/ai-local-react-native

A powerful React Native mobile application that enables users to chat with...

13
Experimental
6774 xakervrakax522/TempFS

🗂️ Streamline your development with TempFS, a temporary file system for...

13
Experimental
6775 Agent-One-Lab/chat-bricks

Chat template support for training.

13
Experimental
6776 0xnu/rewsury

It helps users interact with multiple AI models directly through Telegram,...

13
Experimental
6777 SadmanMahi67/discord-architect

AI-powered Discord bot that builds entire server structures from a...

13
Experimental
6778 Programmercito/bot-microia

Bot MicroIA es un bot inteligente para Telegram, diseñado para ofrecer...

13
Experimental
6779 ShreyasBh02/Local-LLM-Manual-Test-Case-Generator

🤖 A privacy-first tool that generates structured manual test cases using...

13
Experimental
6780 sandeepbhoir/TokenForge-ERC20

🪙 Create customizable ERC20 tokens with TokenForge, a framework for managing...

13
Experimental
6781 mdaamir2005/GetKickBearerToken-extension

🎟️ Generate and manage bearer tokens for secure API access in your...

13
Experimental
6782 mss-easy/BERTSentimentAnalysis

A project on BERT fine tuning for Sentiment Analysis

13
Experimental
6783 molereddy/Alternate-Preference-Optimization

[COLING 2025] code for "Alternate Preference Optimization for Unlearning...

13
Experimental
6784 hemhemoh/Wazobia-Wellness

Wazobia-Wellness is a multilingual mental health chatbot that provides...

13
Experimental
6785 NeelM47/video-to-book

AI-powered pipeline to transform YouTube videos into polished Bionic Reading...

13
Experimental
6786 winstxnhdw/ct2hf

A friendly CLI tool for converting and uploading transformers for CTranslate2.

13
Experimental
6787 supersimple33/Scaling-Laws

A method for calculating scaling laws for LLMs from publicly available models

13
Experimental
6788 gstenzel/PyTruffle

Block-level Code Retrieval using LLMs

13
Experimental
6789 tusharsinghsoam/building-with-llms

This repository focuses on understanding Large Language Models (LLMs) and...

13
Experimental
6790 nherx/free-llm-api-resources

🤖 Discover free API access and credits for various legitimate large language...

13
Experimental
6791 jakubstetz/resume-scanner

AI-powered résumé analysis tool that evaluates job descriptions against...

13
Experimental
6792 hrswatirai-debug/-Conversational_Chatbot_Using_Mistral-

Run a fully conversational Mistral 7B chatbot entirely on a free Google...

13
Experimental
6793 santiag0m/traveling-words

Code repository for the paper "Traveling Words: A Geometric Interpretation...

13
Experimental
6794 rashi-bhansali/encoder-decoder-transformer-variants-from-scratch

PyTorch implementation of Transformer encoder and GPT-style decoder with...

13
Experimental
6795 nurgalive/nurgavoice

AI Transcription & Summarization Service build with open-source models.

13
Experimental
6796 praveena2j/LAVViT

"ICASSP 2025" : Latent Audio-Visual Vision Transformers for Speaker Verification

13
Experimental
6797 biraj21/llm-server-from-scratch

FastAPI server for locally serving Gemma 3 270M & OpenAI Whisper with...

13
Experimental
6798 d-lab/ecir26-summarisation-llm-relevance-judgement

Code and experiments for The Effect of Document Summarization on LLM-Based...

13
Experimental
6799 CodeClairvoyant/neuralkit

The simplest, fastest repository for training/fine-tuning medium-sized...

13
Experimental
6800 actypedef/AURA

AURA: Augmented Representation for Unified Accuracy-aware Quantization

13
Experimental
« Prev 1 2 3 66 67 68 69 70 76 77 78 Next »