All Transformer Models

7,795 models ranked by quality score · Page 66 of 78

Showing 6501–6600 of 7,795
# Model Score Tier
6501 Bladerex24/simple-llm

🚀 Explore a minimal, extensible LLM inference engine for efficient AI model...

13
Experimental
6502 Himanshu0508Raturi/Fine_Tuning-LLM

This repository shows how to fine tune Llama3.2-Instruct model on custom...

13
Experimental
6503 itsmyfacade/itsmyfacade

Production-grade machine learning systems, model inference pipelines, and...

13
Experimental
6504 Afhrodite/Audio-LLM-Playground

A collection of audio transcription and summarization tools developed during...

13
Experimental
6505 hershdoshi55/proactive-llms

3-tier interruptible chatbot research NLP pipeline: fine-tuned DistilBERT...

13
Experimental
6506 ModouLaminJagne/Naive-RAG-Chatbot-RAG-BootCamp-Task-1

Build and deploy your own Naive Retrieval-Augmented Generation (RAG) system...

13
Experimental
6507 MujahidMalik7/fake-news-detection-experiments

Experiments with fake news detection using transformer models and classical...

13
Experimental
6508 mazsola2k/genaiprompt

GenAI LLM Python SDK-s

13
Experimental
6509 zanvari/llm-lab

Projects on BERT, LLMs, RAG, Document AI, and GenAI using HuggingFace,...

13
Experimental
6510 lawrenceokolo1/vit-faiss-product-recommendation

Production-grade visual product recommendation using ViT + FAISS on Amazon...

13
Experimental
6511 AndreaTribotti/Multi-Label-Story-Classification

Comparative analysis of 3 ML models for Multi-Label Story Classification on...

13
Experimental
6512 kanenorman/grassmann

Attempt at reproducing "Attention Is Not What You Need: Grassmann Flows as...

13
Experimental
6513 nikolareljin/finetorch

Rust-native LLM finetuning toolkit for LoRA/QLoRA, dataset preparation,...

13
Experimental
6514 enigmatronix13/Neural-Style-Transfer

Flask-based web app that performs Neural Style Transfer (NST) using...

13
Experimental
6515 maliknaik16/machine-learning

ML journey to explore concepts and framework through code and math. It...

13
Experimental
6516 jpgramajo/llm_db_memory

Efficient Long Short Term Memory for LLMs

13
Experimental
6517 AMfeta99/NLP_LLM

This repository is dedicated to small projects and some theoretical material...

13
Experimental
6518 Samya-S/Working-with-LLMs

Comprehensive collection of materials and code examples for working with...

13
Experimental
6519 Magnicord/llm-env-templates

A list of uv environments templates for LLM development.

13
Experimental
6520 charanpool/llm-cogs-optmizer

Intelligent middleware that reduces LLM COGS by routing queries between...

13
Experimental
6521 insooeric/LLM_Small

Long Language Model from SCRATCH

13
Experimental
6522 FlorinAndrei/llm-social-media-cheap

LLMs fine-tuned with social media comments on cheap hardware

13
Experimental
6523 oskarfernlund/noskGPT

Simple transformer-based language model which generates Shakespearian dialogue.

13
Experimental
6524 mominalix/LLM-Model-Distillation-for-Text-Classification-Models-GUI

GUI application that performs knowledge distillation from OpenAI models to...

13
Experimental
6525 Shengyu-Feng/TSMC4MATH

[ICLR2025] Step-by-Step Reasoning for Math Problems via Twisted Sequential...

13
Experimental
6526 rodgersmag/tinyllm

TinyLLM is a research project focused on developing and training compact,...

13
Experimental
6527 ayus1234/Text-Generation-with-GPT-2

A comprehensive toolkit for fine-tuning GPT-2 language models and generating...

13
Experimental
6528 patrikwolf/ttt_theory

Specialization after Generalization

13
Experimental
6529 OptimAI-Lab/RoSTE

[ICML 2025] Official code for the paper "RoSTE: An Efficient...

13
Experimental
6530 nabilshadman/llms-on-supercomputers

Jupyter Notebooks containing exercises and lectures for the Foundations of...

13
Experimental
6531 Fabio295/tinysafe-1

Detect harmful content with a 71M-parameter safety classifier using...

13
Experimental
6532 Ranjit2111/Transformer-NMT

A PyTorch implementation of the Transformer architecture from "Attention Is...

13
Experimental
6533 taljindergill78/AI-Indian-Recipe-Generator

AI-powered system that generates authentic Indian recipes using GPT-2 and...

13
Experimental
6534 DongmingShenDS/Mistral_From_Scratch

Mistral and Mixtral (MoE) from scratch

13
Experimental
6535 eshaaaan/tinygpt

🤖 Simplify understanding of large language models with TinyGPT, featuring a...

13
Experimental
6536 MyriamBA/Dialogue-Summarizer

An End-to-End Dialogue Summarization Project using LLMs.

13
Experimental
6537 ledesma-ivan/How-Transformer-LLMs-Work

Understand the architecture behind modern Large Language Models. This...

13
Experimental
6538 sayandeepmaity/luminator

Microphone Array-Based Direction of Arrival of Gunshot Detection .Gun...

13
Experimental
6539 VanCoconut/Customer-Satisfaction

A marketing analytics pipeline using Large Language Models (LLMs) to...

13
Experimental
6540 Aryan246cs/Log-Classification-System-GenAI

Hybrid log classification system combining Regex, Sentence Transformers +...

13
Experimental
6541 ShiningLab/CON2LM

This repository is for the paper Word Surprisal Correlates with Sentential...

13
Experimental
6542 FromZeroToFanatic/LLM_Practical_Implementation_Demo4

大模型实战学习路线阶段4:大模型应用实战(基于langchain搭建LLMs应用)与实战

13
Experimental
6543 capecoder08/llm-playground

Playing with tokenizers, transformers, and LLMs

13
Experimental
6544 husayni/gsm-u

Novel benchmark for underspecified queries

13
Experimental
6545 Farbod-Siahkali/Neural-Networks-and-Deep-Learning

University of Tehran Neural Networks & Deep Learning Course Projects

13
Experimental
6546 sourize/Decodex

This project implements a decoder-only GPT model from scratch using PyTorch.

13
Experimental
6547 krishnakoushik225/CLAP-Optimized-Text-to-Audio-Generation-AudioLDM-

Inference-time optimization for diffusion-based text-to-audio generation...

13
Experimental
6548 Riya-l209/ImageCaptioning_Segmentation

AI-powered Image Captioning & Segmentation | ViT-GPT2 + Mask R-CNN |...

13
Experimental
6549 ai-art-dev99/vision-language-caption-vqa

End-to-end BLIP + LLaVA project for image captioning and VQA with...

13
Experimental
6550 devtitus/Image-Caption-with-Pretrained-model

A simple yet powerful image captioning application that uses Salesforce's...

13
Experimental
6551 albertkjoller/transformer-redundancy

Code for the paper "How Redundant Is the Transformer Stack in Speech...

13
Experimental
6552 boomer3boom/ViT_Skin-Cancer-Diagnosis

Repository for deep learning course on the use of Vision Transformer to make...

13
Experimental
6553 nestivi/bachelor

This project implements an advanced Cascade Convolutional Neural Network...

13
Experimental
6554 DeborahAdedigba/Melanoma-Detection-using-Transformer

Implementation of a transformer-CNN ensemble framework for multi-class...

13
Experimental
6555 galassoandrea/skin-lesion-classification

Fine-tuning of a Transformer-based model for Skin Lesion Classification in...

13
Experimental
6556 usama13o/SwinCup

SwinCup Pytorch implementation of SwinCup: Cascaded Swin Transformer for...

13
Experimental
6557 MHosseinHashemi/NBML_BrTD

Implementation of "Realism in action: anomaly-aware diagnosis of brain...

13
Experimental
6558 Nahom32/ViT

An implementation of the vision transformer using CIFAR-10.

13
Experimental
6559 Moses-Omondi/ai-recruitment-assistant

Fine-tuned Llama 3.1 8B model for professional recruitment communications -...

13
Experimental
6560 zurielsingh/Resume-Evaluator

Transformer-based résumé scoring system using BERT regression to generate...

13
Experimental
6561 prakadeesh01/deepmatch-x

DeepMatch is an AI-powered pipeline that extracts structured data from...

13
Experimental
6562 DhruvilDhorajiya/TalentScout-AI

TalentScout is an AI-powered hiring assistant provide intelligent recruiting...

13
Experimental
6563 gustavoallves/cv-analyzer

Sistema de análise de currículos com IA usando Java, Spring Boot, Spring AI...

13
Experimental
6564 CptAswadu/LLMInsuranceWorkflow

This is a repo for code & data for 'Evaluating Large Language Model Agents...

13
Experimental
6565 CheongWoong/knowledge_probing

A repository for factual knowledge probing with large language models.

13
Experimental
6566 jklsnt/dictembed

A model!

13
Experimental
6567 nopgae/nlp-text-embedding-comparison

From N-grams to CLIP: comparing NLP embedding techniques including Word2Vec,...

13
Experimental
6568 Vishnusai17/NLP

Natural Language Processing projects implementing Transformers, BERT, and...

13
Experimental
6569 snairaadarsh/pdf-semantic-comparison

PDF comparison tool that uses transformer-based embeddings to identify...

13
Experimental
6570 Adithya1209/slm-architecture-benchmarks

Comparative study of Linear, MLP, Attention, and Transformer architectures...

13
Experimental
6571 LMLK-seal/ModelQuants

Professional Model Quantization Converter for HuggingFace Transformers

13
Experimental
6572 j341nono/LLMGusser

CLI guessing game to identify which LLM (Llama vs Gemma) generated text,...

13
Experimental
6573 YeonwooSung/KaggleTools

Tools for DataScience and AI

13
Experimental
6574 oalvarobraz/deep-learning-fundamentals

Mastering Deep Learning with PyTorch: A hands-on journey from basic tensors...

13
Experimental
6575 flaviagiammarino/time-series-llms

Time series LLMs sample notebooks

13
Experimental
6576 Kecski/introduction_to_machine_learning

Introduction to Machine Learning course projects at ETH Zürich

13
Experimental
6577 PeteNaJaXD/mt-code

🖥️ Build and customize your coding environment with mt-code, a lightweight...

13
Experimental
6578 rugwed9/neuron-academy

The most comprehensive free data science, ML & AI learning platform....

13
Experimental
6579 basicv8vc/LLM-Tool-Integrated-Reasoning-TIR-Papers

A curated collection of research papers on LLM Tool-Integrated Reasoning...

13
Experimental
6580 pandabeliz/UniversityProjects

Academic portfolio: ML, NLP & AI projects from my Bachelor's (Cognitive...

13
Experimental
6581 panastasiadis/ai-iot-course-projects

This repository contains three AI and IoT application projects developed as...

13
Experimental
6582 IbraahimLab/Mine

A hands-on AI learning and experimentation repository focused on large...

13
Experimental
6583 Rekhii/Deep-Learning

Daily ML practice notebooks covering tabular data, deep learning, and...

13
Experimental
6584 gwaisey/Analysis-LLM-Implementation

Technical analysis of LLM architectures and their impact on HCI, featuring...

13
Experimental
6585 mkdirer/Artificial-Neural-Networks-and-Machine-Learning-Laboratory

Examples of labs from studies

13
Experimental
6586 vpgits/sdgp-ml

This repository contains notebooks and resources related to the Software...

13
Experimental
6587 linzeyang/competitions

info/data/solutions for various competitions

13
Experimental
6588 AnastasiaZAYU/ml-fundamentals-labs

Data Analysis & Machine Learning: from classical Scikit-learn algorithms to...

13
Experimental
6589 ml-dev-world/the-era-of-foundation-models

Comprehensive course covering modern AI foundation models, including...

13
Experimental
6590 popcorn0118/abdulvahapmutlu

🚀 Build and enhance machine learning systems with practical tools and...

13
Experimental
6591 AbhijitKumarJ/Learn_AcceleratedComputing

Learn Accelerated Computing

13
Experimental
6592 jubaiya12-glitch/code_switching_translator

A multilingual NLP-based translator that detects and translates...

13
Experimental
6593 AlirezaKamyab/NMT-Project

Neural Machine translation project using Transformers to translate English to Persian

13
Experimental
6594 Abdelrahman-Elshahed/English-Arabic-NLP-Translator

A dual-approach English to Arabic translation system implementing both...

13
Experimental
6595 KaushiML3/Transformer_Language-translation

This repository contains a end-to-end Transformer-based language translation...

13
Experimental
6596 MBadriNarayanan/MachineTranslation

English to French neural machine translator built using Transformer...

13
Experimental
6597 Wojtekb30/GPT-2-B200-pre-trainier

Code for pre-training a GPT-2 model on (eight) NVIDIA DGX B200 GPUs and...

13
Experimental
6598 totoberni/COMP6258-RCoTFormer

Reproducing Mohtashami et al's ICLR 2025 Paper. See website for source.

13
Experimental
6599 mcquerol/a-critical-assesment-of-electrical-machines-engineering

Simulations and analysis related to electrical machines engineering using...

13
Experimental
6600 Aishwar1/Search-Autocomplete-System

A real-time search autocomplete system that predicts and ranks multi-word...

13
Experimental
« Prev 1 2 3 64 65 66 67 68 76 77 78 Next »