All Transformer Models

7,795 models ranked by quality score · Page 27 of 78

Showing 2601–2700 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2601	arshadshk/Last_Query_Transformer_RNN-PyTorch Implementation of the paper "Last Query Transformer RNN for knowledge...	32	Emerging	transformer-architecture-tutorials	44	Python
2602	HiThink-Research/BizFinBench A Business-Driven Real-World Financial Benchmark for Evaluating LLMs	32	Emerging	domain-specific-benchmarks	211	Python
2603	katha-ai/EmoTx-CVPR2023 [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions...	32	Emerging	emotion-detection-transformers	58	Python
2604	Mmorgan-ML/Phase-Slip-Sampler Phase-Slip is a stochastic intervention architecture that operates on the...	32	Emerging	llm-implementation-from-scratch	6	Python
2605	varchasvee108/vision-transformer-maze-agent Vision Transformer agent that learns to navigate mazes while visualizing...	31	Emerging	vision-transformer-implementations	3	Python
2606	asiff00/Bangla-Llama Fine tuned llama 3 models for context based question answering in bengali language.	31	Emerging	local-rag-frameworks	18	Jupyter Notebook
2607	ai-art-dev99/llm-from-scratch Build a Large Language Model From Scratch	31	Emerging	llm-implementation-from-scratch	22	Jupyter Notebook
2608	xiaoachen98/Open-LLaVA-NeXT An open-source implementation for training LLaVA-NeXT.	31	Emerging	vision-language-instruction-tuning	436	Python
2609	catherinesyeh/story-viz Reimagining storyline visualizations with LLMs (VIS 2025)	31	Emerging	llm-data-visualization	7	Jupyter Notebook
2610	prateekralhan/Deep-Question-Answering-System A deep learning based Q&A system built using RoBerTa model from huggingface...	31	Emerging	question-answering-systems	4	Python
2611	laclouis5/uform-coreml-converters CLI for converting UForm models to CoreML.	31	Emerging	transformer-frameworks-wrappers	3	Python
2612	conceptofmind/PaLM-flax Implementation of the SOTA Transformer architecture from PaLM - Scaling...	31	Emerging	transformer-frameworks-wrappers	14	Python
2613	patricia-pereira/cd-erc Code for the paper: Context-Dependent Embedding Utterance Representations...	31	Emerging	emotion-detection-transformers	3	Python
2614	john-osborne-j/quantized-clinicalbert This repository contains a 4-bit quantized ClinicalBERT model for disease...	31	Emerging	clinical-text-classification	4	Python
2615	Katashynskyi/Voice_assistant_UA_EN No api-keys \| local \| llama3.1 For language studying and live translation	31	Emerging	conversational-chatbot-applications	3	Python
2616	maxxxzdn/erwin Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical...	31	Emerging	transformer-architecture-tutorials	112	Python
2617	ropensci/pangoling An R package for estimating the log-probabilities of words in a given...	31	Emerging	model-evaluation-diagnostics	12	R
2618	NC0DER/GreekT5 A series of Greek News Summarization Sequence-to-Sequence Models built with...	31	Emerging	text-summarization-transformers	2	Python
2619	ASK-03/Reverse-Chain Implementation of paper - Reverse Chain: A Generic-Rule for LLMs to Master...	31	Emerging	langchain-integration-patterns	16	Python
2620	vcanchik/robotmem Robot memory	31	Emerging	agent-memory-systems	—	Python
2621	asiff00/Bengali-Sentence-Error-Correction Fine-tune mBart 50 for Bengali Sentence Error Correction	31	Emerging	bert-model-implementations	4	Jupyter Notebook
2622	dsdanielpark/hf-transllm LLMtranslator translates and generates text in multiple languages.	31	Emerging	llm-translation-tools	45	Jupyter Notebook
2623	RaptorMai/MLLM-CompBench [NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs...	31	Emerging	domain-specific-benchmarks	44	Jupyter Notebook
2624	PRITHIVSAKTHIUR/Nvidia-Cosmos-Reason1-Demo Physical AI models understand physical common sense and generate appropriate...	31	Emerging	multimodal-fusion-transformers	2	Python
2625	Merterm/Modeling-Intensification-for-SLG Public repo for the paper: "Modeling Intensification for Sign Language...	31	Emerging	3d-vision-transformers	14	Python
2626	SCRN-VRC/Language-Translation-with-Fragment-Shaders EN to JP and JP to EN with transformer models	31	Emerging	neural-machine-translation	98	ShaderLab
2627	Qwen-Applications/CLIPO CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR	31	Emerging	llm-reasoning-research	10	Python
2628	Curated-Awesome-Lists/Awesome-Llama3 A curated, awesome list of resources, tools, and projects for the AI Large...	31	Emerging	multilingual-llm-adaptation	3	—
2629	bobazooba/shurale Conversation AI model for open domain dialogs	31	Emerging	conversational-chatbot-applications	4	Python
2630	ryokamoi/llm-self-correction-papers List of papers on Self-Correction of LLMs.	31	Emerging	math-reasoning-datasets	80	—
2631	KhaledSharif/robot-transformers Train and evaluate an Action Chunking Transformer (ACT) to perform...	31	Emerging	transformer-architecture-tutorials	17	Python
2632	curtisgray/wingman Wingman is the fastest and easiest way to run Llama models on your PC or Mac.	31	Emerging	llm-terminal-automation	44	TypeScript
2633	ItzDerock/llama-playground A simple to use and powerful web-interface to mess around with Meta's LLaMA LLM.	31	Emerging	interactive-ai-chat-uis	16	TypeScript
2634	avatsaev/av-local-llm-api Allows to easily run local REST API with a custom LLM, running locally or...	31	Emerging	local-llm-deployment	4	Python
2635	baldoarbol/BodyShapeGPT Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions...	31	Emerging	multimodal-vision-language	37	Python
2636	akshat0123/GPT-1 Pytorch implementation of GPT-1	31	Emerging	gpt-multilingual-training	35	Python
2637	azzeddineCH/flash-nanoGPT Jax/Flax re-write of @karpathy 🐐 NanoGPT using some of the common Jax...	31	Emerging	transformer-frameworks-wrappers	3	Python
2638	longyuewangdcu/Chinese-Llama-2 improve Llama-2's proficiency in comprehension, generation, and translation...	31	Emerging	llm-frameworks-libraries	441	Python
2639	tongnie/ImputeFormer [KDD 2024] "ImputeFormer: Low Rankness-Induced Transformers for...	31	Emerging	transformer-interpretability-mechanistic	51	Python
2640	bentoml/transformers-nlp-service Online Inference API for NLP Transformer models - summarization, text...	31	Emerging	llm-inference-serving	45	Python
2641	JinXins/Awesome-Token-Merge-for-MLLMs A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.	31	Emerging	llm-training-experimentation	86	—
2642	ntropy-network/enrichment_models This repository benchmark Ntropy API against different Large Language Models...	31	Emerging	llama-model-implementations	34	Jupyter Notebook
2643	AhmetZamanis/DeepLearningEnergyForecasting Time series forecasting on an hourly energy dataset, with LSTM & Transformer...	31	Emerging	time-series-forecasting-transformers	3	Jupyter Notebook
2644	codeastra2/llm-feat Automated feature engineering using Large Language Models (LLMs) for tabular data	31	Emerging	llm-data-labeling	3	Jupyter Notebook
2645	naity/finetune-esm Scalable Protein Language Model Finetuning with Distributed Learning and...	31	Emerging	llm-fine-tuning	34	Jupyter Notebook
2646	vmarinowski/infini-attention An unofficial pytorch implementation of 'Efficient Infinite Context...	31	Emerging	transformer-architecture-tutorials	55	Python
2647	ImplicitLayer/multiagent_environments Envirionments for NLP multiagent tasks	31	Emerging	ml-foundations-curricula	2	Python
2648	liziniu/policy_optimization Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)	31	Emerging	rlhf-alignment-training	28	Python
2649	JiauZhang/nnm Neural Network Models	31	Emerging	llm-learning-resources	7	Python
2650	Relaxed-System-Lab/HexGen [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.	31	Emerging	llm-inference-engines	34	Python
2651	Koziev/LM-pretrain Char-level language model pretraining code and scripts	31	Emerging	llm-training-experimentation	3	Python
2652	Utshav-paudel/LLM-Zero-to-Hero This repo contains the resources, projects and documentation of mine while...	31	Emerging	llm-implementation-tutorials	34	Jupyter Notebook
2653	prajjwal1/generalize_lm_nli Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways...	31	Emerging	model-evaluation-diagnostics	34	Jupyter Notebook
2654	crscardellino/argumentation-mining-transformers Argumentation Mining Transformers Module (AMTM) implementation.	31	Emerging	transformer-architecture-tutorials	2	Python
2655	Basel-anaya/LoreWeaver LoreWeaver is a Novel Generation Multimodal LLM based on Mistral 7B LLM	31	Emerging	llm-training-experimentation	3	Jupyter Notebook
2656	yuchen0515/2022-Competition-CUDAOutOfMemory Our team placed 6th out of 119 teams in E.SUN AI Open Competition Summer...	31	Emerging	essay-scoring-grading	2	Python
2657	lazy-guy/chess-llama Tiny Llama model trained to play chess	31	Emerging	interactive-ai-chat-uis	29	CSS
2658	yyDing1/GNER [ACL 2024 Findings] Code implementation of Paper "Rethinking Negative...	31	Emerging	mathematical-reasoning-transformers	60	Python
2659	misko/spf Signal Processing Fun (in the sun)	31	Emerging	ml-foundations-curricula	9	Jupyter Notebook
2660	j-webtek/Local-LLM_FineTune Finetune Your Local LLM	31	Emerging	llm-fine-tuning	18	Python
2661	muna-ai/muna-predictors Interesting Python functions compiled to run anywhere with Muna.	31	Emerging	llm-implementation-tutorials	11	Python
2662	jshuadvd/LongRoPE Implementation of the LongRoPE: Extending LLM Context Window Beyond 2...	31	Emerging	transformer-training-optimization	151	Python
2663	jordddan/Pruning-LLMs The framework to prune LLMs to any size and any config.	31	Emerging	llm-compression-optimization	95	Python
2664	makllama/makllama MaK(Mac+Kubernetes)llama - Running and orchestrating large language models...	31	Emerging	local-llm-deployment	45	Go
2665	SciCrunch/bio_electra Bio-Electra - Small and efficient discriminatively pre-trained language...	31	Emerging	korean-language-models	4	Python
2666	Giyanellow/llama-chatbot-with-ui This project provides a comprehensive template for self-hosting a Large...	31	Emerging	interactive-ai-chat-uis	1	TypeScript
2667	Aradhye2002/selective-peft-toolkit Official implementation of the paper "Step-by-Step Unmasking for...	31	Emerging	llm-finetuning-frameworks	9	Python
2668	shinomakoi/magi_llm_gui A Qt GUI for large language models	31	Emerging	interactive-ai-chat-uis	45	Python
2669	wassemgtk/llm.scala Extensible implementation of a Language Model (LLM) training framework in Scala.	31	Emerging	llm-frameworks-libraries	34	Scala
2670	koudounasalkis/CLUES This repo contains the code for "A Contrastive Learning Approach to Mitigate...	31	Emerging	bias-detection-transformers	3	Python
2671	raymin0223/fast_robust_early_exit Fast and Robust Early-Exiting Framework for Autoregressive Language Models...	31	Emerging	compositional-reasoning-embeddings	65	Python
2672	tripathiarpan20/self-improvement-4all Private self-improvement coaching with open-source LLMs	31	Emerging	llm-training-experimentation	16	Python
2673	tenghuilee/ScalingCapFusedVisionLM number of tokens <=> performance to a vision language model	31	Emerging	multimodal-vision-language	2	Python
2674	swapUniba/LaikaLLM A hub for training and evaluating LLMs, following the multitask paradigm, in...	31	Emerging	llm-recommendation-systems	4	Python
2675	xmed-lab/TAM [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs	31	Emerging	transformer-interpretability-mechanistic	180	Python
2676	cui-shaobo/defeasibility-in-causality exploring the defeasibility inside causality	31	Emerging	llm-reasoning-research	4	Python
2677	qiqiApink/MotionGPT The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs...	31	Emerging	gpt-multilingual-training	238	Python
2678	just-ctrlC-ctrlV/Mechanical-Assistant Imagine a world where your mechanical tasks are streamlined and optimized by...	31	Emerging	conversational-chatbot-applications	2	Python
2679	alan-turing-institute/prompto An open source library for asynchronous querying of LLM endpoints	31	Emerging	prompt-engineering-security	36	Python
2680	ai4sd/multiscale-byte-lm A hierarchical LM that scales to training on context windows of +5M tokens	31	Emerging	llm-finetuning-frameworks	9	Python
2681	cleopatra-itn/claim_detection Code for tasks in the paper "Check\_square at CheckThat! 2020: Claim...	31	Emerging	fake-news-detection	2	Python
2682	kyegomez/Open-NAMM An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY"	31	Emerging	transformer-architecture-tutorials	6	Python
2683	VidhyaVarshanyJS/EnsembleX EnsembleX utilizes the Knapsack algorithm to optimize Large Language Model...	31	Emerging	streamlit-langchain-apps	4	Python
2684	ziansu/codeart Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention...	31	Emerging	transformer-architecture-tutorials	18	Python
2685	lrusso/llama3pure Three inference engines for Llama 3: pure C for desktop systems, pure...	31	Emerging	local-llm-deployment	21	HTML
2686	IParraMartin/An-Explanation-Is-All-You-Need The original transformer implementation from scratch. It contains...	31	Emerging	transformer-architecture-education	44	Python
2687	nlp-uoregon/Okapi Okapi: Instruction-tuned Large Language Models in Multiple Languages with...	31	Emerging	rlhf-alignment-training	96	Python
2688	hplt-project/monolingual-multilingual-instruction-tuning Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca	31	Emerging	instruction-tuning-datasets	9	Python
2689	codefuse-ai/GALLa [ACL 2025] Graph Aligned Large Language Models for Improved Source Code Understanding	31	Emerging	graph-language-models	43	Python
2690	Orion-AI-Lab/televit Teleconnection-driven vision transformers for improved long-term forecasting	31	Emerging	vit-image-classification	35	Python
2691	HenryCai11/LLM-Self-Control The official repo of paper "Self-Control of LLM Behaviors by Compressing...	31	Emerging	llm-learning-resources	18	Jupyter Notebook
2692	M4TH1EU/llama-assist Manage your smart home in Home Assistant with local LLMs running with llama.cpp	31	Emerging	conversational-chatbot-applications	1	Python
2693	AshutoshDongare/softskill-NER Fine tuning 🤗 transformer model for softskill NER task	31	Emerging	bert-model-implementations	3	Jupyter Notebook
2694	camelop/NLP-Robustness OOD Generalization and Detection (ACL 2020)	31	Emerging	academic-thesis-repositories	59	Python
2695	zeroxt32/Forex-Expert-Advisor-Python Forex Bot Agents Using Machine Learning Implementations. Custom Forex Environments	31	Emerging	financial-return-prediction	3	Jupyter Notebook
2696	nghiempt/llm-analysis-privacy-policy Unveiling Discrepancies in Android App Data Safety Declarations and Privacy...	31	Emerging	vulnerability-detection-llm	3	Jupyter Notebook
2697	vipulraheja/coedit Official implementation of the paper "CoEdIT: Text Editing by Task-Specific...	31	Emerging	llm-knowledge-editing	138	Shell
2698	yfedoseev/llmkit Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,...	31	Emerging	local-llm-deployment	12	Rust
2699	ivanovitchm/PPGEEC2318 Repository for EEC2318, a graduate course on PPgEEC about Machine Learning	31	Emerging	ml-foundations-curricula	31	Jupyter Notebook
2700	TamSiuhin/LLM-UM-Reading A list of large language models for user modeling (LLM-UM) papers, based on...	31	Emerging	llm-research-curation	151	—

« Prev 1 2 3 … 25 26 27 28 29 … 76 77 78 Next »