All Transformer Models

7,795 models ranked by quality score · Page 31 of 78

Showing 3001–3100 of 7,795
# Model Score Tier
3001 plusnli/medical-knowledge-judgment

Codes and data for paper "Fact or Guesswork? Evaluating Large Language...

29
Experimental
3002 Moeinh77/Virus-DNA-classification-BERT

Classification of 6 viruses including covid-19 based on their DNA sequences...

29
Experimental
3003 PRITHIVSAKTHIUR/Fire-Detection-Siglip2

Fire-Detection-Siglip2 is an image classification vision-language encoder...

29
Experimental
3004 Shinichi0713/LLM-fundamental-study

this site is the fundamental page of LLM-mechanism

29
Experimental
3005 SWCapstone2021/NLP

2021 Ajou University Spring SW capstone design - FindU NLP (Winning the gold...

29
Experimental
3006 sermare/DeepOff

Deep Learning to predict phenotype score associated with heritable gene...

29
Experimental
3007 laelhalawani/glai

glai - GGUF LLAMA AI - Package for simplified model handling and text...

29
Experimental
3008 BhavikBhindora/SmartyPantsAIChatBot

This repo contains GLM model for generating text, analyzing...

29
Experimental
3009 DreamerGPT/DreamerGPT

🌱 梦想家(DreamerGPT):中文大语言模型指令精调

29
Experimental
3010 mohrez86/pyllmut

A research-based LLM-driven mutant generator library for Python

29
Experimental
3011 zer0int/CLIP-fine-tune-registers-gated

Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny...

29
Experimental
3012 eslambakr/LAR-Look-Around-and-Refer

This is the official implementation for our paper;"LAR:Look Around and Refer".

29
Experimental
3013 AniketRajpoot/Automated-Headline-and-Sentiment-Generator

A very simple repo for Text Classification, Sentiment Identification and...

29
Experimental
3014 UKPLab/arxiv2025-misleading-visualizations

Code and datasets accompanying the arXiv preprint: "Protecting multimodal...

29
Experimental
3015 Cabbagito/Generating-South-Park-Episodes

Screw You Guys I'm Going Home

29
Experimental
3016 WooooDyy/LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for...

29
Experimental
3017 phkhanhtrinh23/spelling_correction_project

This spelling correction project helps people fix English spelling mistakes....

29
Experimental
3018 WisconsinAIVision/YoLLaVA

🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)

29
Experimental
3019 VPanjeta/PyLLaMa-CPU

Fast LLaMa inference on CPU using llama.cpp for Python

29
Experimental
3020 CRGBS/DiscordLLaMABOT

let's on discord use llama large lanuage model

29
Experimental
3021 EagleW/Chem-FINESE

Official implementation of the EACL Findings 2024 paper: Chem-FINESE:...

29
Experimental
3022 P-r-e-m-i-u-m/PROXY

Self-hosted OpenAI-compatible reverse proxy with multi-provider load balancing

29
Experimental
3023 calcuis/gguf-selector

GGUF selector

29
Experimental
3024 Atenrev/forocoches-language-generation

This is a PyTorch implementation of a decoder only transformer inspired on...

29
Experimental
3025 Ludobico/KakaoChatData

카카오톡 대화 데이터셋

29
Experimental
3026 liuqidong07/LLM4CDSR-pytorch

[SIGIR'25] The official implementation code of LLM4CDSR

29
Experimental
3027 fabienfrfr/tptt

😊 TPTT: Transforming Pretrained Transformers into Titans

29
Experimental
3028 allenai/staged-training

Staged Training for Transformer Language Models

29
Experimental
3029 CoffeeVampir3/ez-trainer

Train Llama Loras Easily

29
Experimental
3030 corl-team/lime

Official implementation of the paper "You Do Not Fully Utilize Transformer's...

29
Experimental
3031 wschella/llm-reliability

Code for the paper "Larger and more instructable language models become less...

29
Experimental
3032 1ucky40nc3/TREX

🦖 : Technology for Reliable Extensive Chatbot Systems

29
Experimental
3033 gentaiscool/few-shot-lm

The source code of "Language Models are Few-shot Multilingual Learners" (MRL...

29
Experimental
3034 zwhe99/X-SIR

[ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual...

29
Experimental
3035 emrecncelik/zeroshot-turkish

Evaluation of zero-shot classification models on Turkish datasets.

29
Experimental
3036 abhijitpal1247/TripplannerBot

This a streamlit app with langchain. It makes use of Bing maps API,...

29
Experimental
3037 Gary3410/TaPA

[arXiv 2023] Embodied Task Planning with Large Language Models

29
Experimental
3038 UgurkanTech/ArchNetAI

ArchNetAI is a Python library that leverages the Ollama API for generating...

29
Experimental
3039 wafflecomposite/langchain-ask-pdf-local

An AI-app that allows you to upload a PDF and ask questions about it. It...

29
Experimental
3040 dependentsign/Awesome-LLM-based-Evaluators

✨✨Latest Papers about LLM-based Evaluators

29
Experimental
3041 alex-snd/TRecover

📜 A python library for distributed training of a Transformer neural network...

29
Experimental
3042 awilliamson10/clipora

Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank...

29
Experimental
3043 chuksoo/imdb_movie_sentiment_analysisNLP

Practicum by Yandex Project 13: In this natural language processing project,...

29
Experimental
3044 antofuller/configaformers

A python library for highly configurable transformers - easing model...

29
Experimental
3045 Betswish/Cross-Lingual-Consistency

Easy-to-use framework for evaluating cross-lingual consistency of factual...

29
Experimental
3046 SteveKGYang/MetaAligner

Models, data, and codes for the paper: MetaAligner: Towards Generalizable...

29
Experimental
3047 YRL-AIDA/RuTaBERT

RuTaBERT is a framework for solving column type and property annotation...

29
Experimental
3048 Atomic-man007/falcon-7b-lora-fine-tuning

falcon-7b-lora-fine-tuning

29
Experimental
3049 QKV-Core/QKV-Core

"Adaptive Hybrid Quantization Framework for deploying 7B+ LLMs on low-VRAM...

29
Experimental
3050 avilum/llama-saas

A client/server for LLaMA (Large Language Model Meta AI) that can run ANYWHERE.

29
Experimental
3051 PeterGriffinJin/Patton

Patton: Language Model Pretraining on Text-rich Networks (ACL 2023 main oral)

29
Experimental
3052 procesaur/Scratch2LM

Training transformer models (e.g. RoBERTa, GPT2 and GPT-J) from scratch.

29
Experimental
3053 bradym05/Bitcoin-Trader-ML

Automated 24/7 bitcoin trader for Coinbase using Transformer Neural Networks

29
Experimental
3054 ritaranx/BMRetriever

[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large...

29
Experimental
3055 ianchute/generative-reflections

A two-model system for reasonable text generation

29
Experimental
3056 kyegomez/MMCA-MGQA

Experiments around using Multi-Modal Casual Attention with Multi-Grouped...

29
Experimental
3057 ITMO-NSS-team/sea_ice_transformers

This repository contains code for the research of transformer effectiveness...

29
Experimental
3058 Victorwz/MLM_Filter

Official implementation of our paper "Finetuned Multimodal Language Models...

29
Experimental
3059 xvyaward/owq

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization...

29
Experimental
3060 gperdrizet/llms-demo

Demonstration of LLM hosting strategies and framesworks for simple chatbots

29
Experimental
3061 kanad13/MultiAI-Query

MultiAI-Query: Work with multiple AI models with unified API calls.

29
Experimental
3062 nuhmanpk/Awesome-open-LLM

Awesome-Open-LLM : a curated list of open-source Large Language Models (LLMs)

29
Experimental
3063 aju22/LLaMA2

This repository contains an implementation of the LLaMA 2 (Large Language...

29
Experimental
3064 HSaurabh0919/CTransformers

Implementing wide variety of transformers, fine tuning as well as trying...

29
Experimental
3065 pluja/maestro

Turn natual language into commands. Your CLI tasks, now as easy as a...

29
Experimental
3066 Merserk/Caption-Creator

Caption Creator is a fast and portable tool for generating high-quality...

29
Experimental
3067 cokeshao/HoliTom

[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models

29
Experimental
3068 uSaiPrashanth/gpt-j-finetune

Parallelizes finetuning of gpt-j on P3 dataset across multiple gpu nodes

29
Experimental
3069 muhac/llm-actions

Run LLMs for inference in GitHub Actions - add to your workflow!

29
Experimental
3070 DataArcTech/ChartMoE

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for...

29
Experimental
3071 michaelhly/FarGlot

A Transformer-based SocialNLP toolkit for Farcaster

29
Experimental
3072 NebeyouMusie/AI-Blog-Assistant

In this project I've built a streamlit web app that leverages the LLAMA3.1...

29
Experimental
3073 HamzaG737/Sentence-segmentation

Distilbert model for sentence segmentation.

29
Experimental
3074 xlang-ai/text2reward

[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for...

29
Experimental
3075 quarozox/happy-llm

🚀 Explore Happy-LLM, a tool designed to enhance interactions with language...

29
Experimental
3076 gaomingzhao666/AI-Prompts

A fast and modern web page that lists useful and favorite AI/GPT prompts,...

29
Experimental
3077 bminixhofer/zett

Code for Zero-Shot Tokenizer Transfer

29
Experimental
3078 BeinuoYang/Awesome-LLM4Opt

A curated list of Large Language Models (LLMs) for optimization problem...

29
Experimental
3079 Adriankhl/godot-llm-template

Godot LLM Template/Demo

29
Experimental
3080 brucewlee/nutcracker

Large Model Evaluation Experiments

29
Experimental
3081 WangRongsheng/Chinese-LLaMA-Alpaca-Usage

📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解

29
Experimental
3082 ksm26/Function-Calling-and-Data-Extraction-with-LLMs

Master the techniques of function-calling and structured data extraction...

29
Experimental
3083 roychowdhuryresearch/gsw-memory

Long term Structured Memory for Large Language Models

29
Experimental
3084 pranavmangal/termq

A simple tool to query LLMs from the terminal

29
Experimental
3085 akanyaani/Illustrated_GPT2_With_Code

Explained GPT-2 Transformer model step by step with code.

29
Experimental
3086 Merkoba/Meltdown

An interface for llama.cpp, ChatGPT, Gemini, and Claude

29
Experimental
3087 affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition

Inverse DALL-E for Optical Character Recognition

29
Experimental
3088 vicgalle/merging-self-critique-jailbreaks

"Merging Improves Self-Critique Against Jailbreak Attacks", code and models

29
Experimental
3089 Nkluge-correa/Model-Library

The Model Library is a project that maps the risks associated with modern...

29
Experimental
3090 Ketis21/KetisBot

KetisBot is a powerful Discord AI chatbot using KoboldCpp for text...

29
Experimental
3091 gmongaras/Wizard_QLoRA_Finetuning

Finetuning Some Wizard Models With QLoRA

29
Experimental
3092 eltoto1219/vltk

A toolkit for vision-language processing to support the increasing...

29
Experimental
3093 SALT-NLP/Adaptive-Compositional-Modules

Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive...

29
Experimental
3094 astorfi/LLM-Alignment-Project

A comprehensive template for aligning large language models (LLMs) using...

29
Experimental
3095 lucianoayres/nino-cli

Nino is a CLI tool that interacts with local language models via Ollama's...

29
Experimental
3096 mehdihosseinimoghadam/AVA-Llama-3

Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3

29
Experimental
3097 AndyyyYuuu/lm-is-compressor

An accurate language model is a high-compression, lossless data compressor

29
Experimental
3098 jinzhuoran/RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language...

29
Experimental
3099 rameshvarun/magic-lamp

Magic LLM-powered Python functions that return anything you ask for. Many caveats.

29
Experimental
3100 nobel-postech/mirror

Code and data for "MIRROR: Multimodal Cognitive Reframing Therapy for...

29
Experimental
« Prev 1 2 3 29 30 31 32 33 76 77 78 Next »