All Transformer Models

7,795 models ranked by quality score · Page 49 of 78

Showing 4801–4900 of 7,795
# Model Score Tier
4801 Maryamm-2/Emotion-Classification-with-Explainable-AI-Techniques

A modular framework for emotion classification and explainability, comparing...

21
Experimental
4802 snevil/news-article-classification-robust-ensembles

A robust news article classification system combines a two-stage linear...

21
Experimental
4803 viochris/Insightify-Sentiment-API

A robust NLP microservice powering Insightify. Leverages FastAPI and...

21
Experimental
4804 nv-legate/multimesh-jax

PjRt plugin and Python APIs for MPMD workflows in Jax

21
Experimental
4805 AleNard89/py-pytorch-invoice

Automated invoice data extraction using LayoutLMv3 (PyTorch) with PyQt6...

21
Experimental
4806 invergent-ai/surogate-website

Website for surogate.ai

21
Experimental
4807 nisalgunawardhana/Github-Models-Demo

This repository is part of the GitHub Models session and is designed to help...

21
Experimental
4808 carlomarxdk/tab2seq

Transform tabular event data into sequences ready for Transformer and...

21
Experimental
4809 rbourgeat/llm-rp

✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and...

21
Experimental
4810 Tek233/Book-Recommender

A semantic book recommender

21
Experimental
4811 MonitooDev/indiedroid-nova-llm

🚀 Benchmark local LLMs like Llama 3.1 on the Indiedroid Nova with RK3588...

21
Experimental
4812 ldr7/language_model_from_scratch

Build a language model from scratch.

21
Experimental
4813 Cobkgukgg/forgenn

Modern neural networks in pure NumPy - Transformers, ResNet, and more

21
Experimental
4814 abdulvahapmutlu/abdulvahapmutlu

My Profile

21
Experimental
4815 bijinc/speculoos

efficient speculative sampling for language models

21
Experimental
4816 CS-433/ml-project-2-mlp

Advancing Homepage2Vec with LLM-Generated Datasets for Multilingual Website...

20
Experimental
4817 groloch/LocalLlm

Drop-in and advanced solutions to experiment with open source LLM !

20
Experimental
4818 tbogdala/woolyrust

A high-level Rust wrapper around llama.cpp for text generation AI with LLMs.

20
Experimental
4819 smri29/SolarTwinUstt

Features a real-time Streamlit Digital Twin dashboard, pre-trained USTT...

20
Experimental
4820 HEMANGANI/Fine-Tuning-LLM-for-QA

Fine-Tuning Large Language Models for Question Answering

20
Experimental
4821 llamajs/llama

A dynamic logger for the dynamic developer

20
Experimental
4822 kantkrishan0206-crypto/LoRAForge-

Build a production‑grade, modular pipeline for fine‑tuning large language...

20
Experimental
4823 mbkma/Lexi-Voice-Assistant

A fully functional 100% offline voice assistant with multi-language support.

20
Experimental
4824 gdiaz384/py3TranslateLLM

Translates text using Artificial Intelligence (AI) that supports both NMTs and LLMs.

20
Experimental
4825 xxxbf0222/LlamaDeck

A command-line tool for quickly managing and experimenting with multiple...

20
Experimental
4826 ammarhydr/MobilityGPT

PyTorch implementation of MobilityGPT model: https://arxiv.org/abs/2402.03264

20
Experimental
4827 RobinSmits/Dutch-NLP-Experiments

This repository contains a number of experiments with Multi Lingual...

20
Experimental
4828 ShadowMonarchX/Finetuning-LLM-main

A practical repo for fine-tuning LLMs using QLoRA, PEFT, and other efficient...

20
Experimental
4829 gmongaras/Cottention_Transformer

Code for the paper "Cottention: Linear Transformers With Cosine Attention"

20
Experimental
4830 romanyn36/whisperx-asr-with-fastapi

WhisperX ASR is a FastAPI-based application for automatic speech...

20
Experimental
4831 Anri-Lombard/Mamba-SAFE

Generating Molecules with the Mamba architecture

20
Experimental
4832 kylebrussell/cap-rlvr

CAP RLVR: Reinforcement Learning from Human Feedback for Legal Reasoning...

20
Experimental
4833 karan51ngh/LocalLLMtextClassification

This project implements a text classification system powered by Large...

20
Experimental
4834 lucataco/cog-phi-3-mini-4k-instruct

Cog wrapper for the Transformers implementation of microsoft/Phi-3-mini-4k-instruct

20
Experimental
4835 mazurkin/ptn

train own virtual "PTN" LLM model

20
Experimental
4836 bassrehab/speculative-decoding

Reference implementation of LLM inference acceleration techniques. Includes...

20
Experimental
4837 ghassenov/llm_from_scratch

A GPT-2 model from scratch built to explore the inner workings of...

20
Experimental
4838 dcarpintero/pangolin-guard

Open, Lightweight Model for AI Safety.

20
Experimental
4839 hideyuki001/translation-os

Structure-first, language-agnostic Translation OS for deterministic,...

20
Experimental
4840 asigalov61/MIDI-TXT-MIDI

A much-needed implementation of a bi-directional MIDI processor for symbolic...

20
Experimental
4841 simocolo/nnDrain

A PyTorch implementation for structural pruning applied to neural networks...

20
Experimental
4842 iandennismiller/calm

A peaceful user experience for Large Language Models. Calm automatically...

20
Experimental
4843 Yangyi-Chen/LM-TOAST

Source code for ACL 2023 Findings paper "Making Pre-trained Language Models...

20
Experimental
4844 Lucasc-99/NoTorch

A from-scratch neural network and transformers library, with speeds rivaling PyTorch

20
Experimental
4845 QuantiusBenignus/Zshelf

Zsh-centric command-line interface for interacting with local Large Language...

20
Experimental
4846 nsi319/Question-Answering-KG

This module is for QA generation from legal documents. The information is...

20
Experimental
4847 Eden-Eldith/WiggleGPT

WiggleGPT is an language model that integrates bio-inspired neural...

20
Experimental
4848 designer-coderajay/induction-head-detector

Mechanistic interpretability tool to detect induction heads in GPT-2 using...

20
Experimental
4849 iug-htw/GPTAndPrejudice

Research framework for training and interpreting a custom GPT-style language...

20
Experimental
4850 OthmanMohammad/Longformer-Learning-Next-Generation-Sentiment-Analysis

This project applies the Longformer model to sentiment analysis using the...

20
Experimental
4851 marlo-z/reversal_curse_analysis

Code for 'Towards a Theoretical Understanding of the 'Reversal Curse' via...

20
Experimental
4852 313mystery303/vla0-trl

🔍 Explore a minimal reimplementation of VLA-0 with TRL, achieving 90% LIBERO...

20
Experimental
4853 dhcode-cpp/cut-cross-entropy-pytorch

pytorch notebook for implemention for cut-cross-entropy LLM training.

20
Experimental
4854 lorenzomaiuri-dev/quantum-gpt

A hybrid Quantum-Classical Transformer implementation based on nanoGPT,...

20
Experimental
4855 kyegomez/GATS

Implementation of GATS from the paper: "GATS: Gather-Attend-Scatter" in...

20
Experimental
4856 EngrEeshaKhan/Learning-Agency-Lab---Automated-Essay-Scoring-2.0

Improve upon essay scoring algorithms to improve student learning outcomes

20
Experimental
4857 foyzulkarim/transformers-tasks

Examples and tutorials demonstrating various NLP tasks using HuggingFace...

20
Experimental
4858 arunpshankar/VAI-FineTuning-LLMs

"Clean and comprehensive examples for fine-tuning LLMs supported by Vertex...

20
Experimental
4859 Vadimbuildercxx/looped_transformer

Experimental implementation of "Looped Transformers are Better at Learning...

20
Experimental
4860 JosephTLucas/llm_test

A suite of tests to verify bias, safety, trust, and security concerns for LLMs.

20
Experimental
4861 Ari-S-123/pii-masking

Improved PII masking performance in adversarial conditions and diverse...

20
Experimental
4862 kyegomez/AttnWithConvolutions

Interleaved Attention's with convolutions for text modeling

20
Experimental
4863 lorenzflow/robust-moa

This is the official repository for the paper: This is your Doge: Exploring...

20
Experimental
4864 RoboGamer1HD/Transformers-Demonware-Server

Transformers Games by High Moon Studios. Transformers ReEnergized Server...

20
Experimental
4865 kyopark2014/LLM-LangChain

It shows how to use langchain for sagemaker endpoint.

20
Experimental
4866 snoop2head/Deep-Encoder-Shallow-Decoder

🤗 Huggingface Implementation of Kasai et al(2020) "Deep Encoder, Shallow...

20
Experimental
4867 timweri/alpaca.cpp-bot

Serve alpaca.cpp as chat bots

20
Experimental
4868 Spectrewolf8/PHi-3-SQL-generation-fine-tune-experiment

A fine-tuned version of Phi-3-mini-4k-instruct for generating SQL queries...

20
Experimental
4869 hello-shohanur/Fine-Tuning-Llama-on-Bengali-Empathetic-Conversations

A fine-tuned LLaMA 3.1-8B-Instruct to generate empathetic responses in...

20
Experimental
4870 sorcero/ingestum

Read-only mirror of https://gitlab.com/sorcero/community/ingestum

20
Experimental
4871 yc-w-cn/llm-leaderboard

LLM模型对比排行榜 - 帮助用户快速比较不同大语言模型的性能指标、价格和规格

20
Experimental
4872 quentinwendegass/tiny-llm

Research project for LLMs

20
Experimental
4873 Dylsimple60/RLHF_learn

🤖 Enhance reinforcement learning stability and efficiency with advanced...

20
Experimental
4874 shreyanmitra/CandyLLM

A simple, easy-to-use framework for HuggingFace and OpenAI text-generation...

20
Experimental
4875 dhargopala/xplain

Python Library to compute the XPLAIN score for LLM expainability.

20
Experimental
4876 khu-bot/ai-essayist

SKKU AI X Bookathon 4회 [쿠봇] 팀의 레포지토리입니다.

20
Experimental
4877 xyproto/usermodel

Get per-task Ollama models

20
Experimental
4878 141forever/UncerSema4HalluDetec

This is the repository for the paper 'Enhancing Uncertainty Modeling with...

20
Experimental
4879 Harshalnikumbh/RAG-Rathee

RAGRathee lets users ask questions about Dhruv Rathee’s videos and get...

20
Experimental
4880 kyegomez/VO-ROPE

An implementation of the all-new rope from jianlin

20
Experimental
4881 kyegomez/LongVit

A simplistic pytorch implementation of LongVit using my previous...

20
Experimental
4882 AndrewBoessen/neural-game-engine

Neural network approach for modeling interactive game environments using...

20
Experimental
4883 rishub-tamirisa/transformer-mlm

Implementation of Transformer Encoders / Masked Language Modeling Objective

20
Experimental
4884 torotoki/reasoning-minimal

Minimal code to train reasoning model with reinforcement learning.

20
Experimental
4885 perceivelab/Transformer-IPMN

Official PyTorch implementation for paper: "Neural Transformers for...

20
Experimental
4886 Ashish-kharde1/Micro-Reasoner-Qwen

Lightweight reasoning-capable LLM built on Qwen3-4B using LoRA and 4-bit inference

20
Experimental
4887 MyDarapy/gpt-1-from-scratch

Rewriting and pretraining GPT-1 from scratch. Implementing Multihead...

20
Experimental
4888 nerdimite/abstract-to-title-generator

A T5-based Seq2Seq Model that Generates Titles for Machine Learning Papers...

20
Experimental
4889 zaaachos/Thesis-Diagnostic-Captioning

B.Sc. Thesis Deep Learning & NLP research on Medical Image Captioning

20
Experimental
4890 Optum/long-medical-document-lms

Explain and train language models that extract information from long medical...

20
Experimental
4891 safouaneelg/zeroshot-reasoning

Ollama structured output for visual zeroshot reasoning

20
Experimental
4892 ducnh279/Align-LLMs-with-DPO

Align a Large Language Model (LLM) with DPO loss

20
Experimental
4893 USP-Open-Code/icmc-open-chatbot

Este projeto tem como objetivo desenvolver um chatbot para ajudar alunos a...

20
Experimental
4894 samestrin/llm-translate

LLM Translate is an open-source translation tool that uses Large Language...

20
Experimental
4895 nbathreya/Signal-to-Sequence-Transformer

Deep learning classifier for 1D signal data with transformer architecture.

20
Experimental
4896 isshiki-dev/docker-model-runner

Self-hosted Anthropic API Compatible Inference Server with Claude Code...

20
Experimental
4897 tejasvaidhyadev/ALBERT.jl

ALBERT(A Lite BERT for Self-Supervised Learning of Language Representations)...

20
Experimental
4898 augstentatious/TRuCAL

TRuCAL: Truth-Recursive universal Correction Attention Layer An open-source...

20
Experimental
4899 yangzhch6/AlignedCoT

Implementation of our paper "Speak Like a Native: Prompting Large Language...

20
Experimental
4900 codewithdark-git/Bias

Steer Language Models with Interpretable SAE Features

20
Experimental
« Prev 1 2 3 47 48 49 50 51 76 77 78 Next »