All Transformer Models

7,795 models ranked by quality score · Page 26 of 78

Showing 2501–2600 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2501	rti/gptvis Understanding Transformers Using A Minimal Example	32	Emerging	nano-gpt-variants	52	Python
2502	EternityYW/BiasEval-LLM-MentalHealth Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models	32	Emerging	llm-bias-evaluation	12	Jupyter Notebook
2503	kennethleungty/DeepSeek-R1-Ollama-Simple-Evals Run and Evaluate DeepSeek-R1 Distilled Models Locally with Ollama and...	32	Emerging	llm-inference-engines	2	Jupyter Notebook
2504	m3hrdadfi/news-headline-generation A Bert2Bert model which able to generate headlines!	32	Emerging	text-summarization-transformers	12	Python
2505	MurtyShikhar/TreeProjections Tool to measure tree-structuredness of the internal algorithm learnt by a...	32	Emerging	transformer-architecture-tutorials	12	Python
2506	affjljoo3581/polyglot-jax-inference TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.	32	Emerging	transformer-frameworks-wrappers	12	Python
2507	BerkeliumLabs/Berkelium-labs Your personal AI Lab, accessible everywhere! Explore, experiment, and...	32	Emerging	local-llm-deployment	2	TypeScript
2508	softsys4ai/differentiable-proving Code and data for the paper "Pretrained Language Models are Symbolic...	32	Emerging	mathematical-reasoning-transformers	12	Python
2509	QwenLM/ParScale Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling	32	Emerging	llm-scaling-architecture	476	Python
2510	kyegomez/VLM-Mamba We introduce VLM-Mamba, the first Vision-Language Model built entirely on...	32	Emerging	3d-vision-transformers	14	Python
2511	jose-compu/cerebras-coding-agent A Cerebras AI LLM coding agent for the command line	32	Emerging	multi-agent-orchestration	4	Python
2512	pleisto/yuren-13b Yuren 13B is an information synthesis large language model that has been...	32	Emerging	mistral-ai-tools	15	Python
2513	rd-serendipity/ai-research-paper-explainer AI-powered tool that transforms complex research papers into clear,...	32	Emerging	ai-powered-business-analytics	3	Python
2514	HyperMink/inferenceable Scalable AI Inference Server for CPU and GPU with Node.js \| Utilizes...	32	Emerging	llm-inference-engines	15	JavaScript
2515	rajaswa/indic-syntax-evaluation Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages	32	Emerging	model-evaluation-diagnostics	15	Jupyter Notebook
2516	taesiri/ArXivQA WIP - Automated Question Answering for ArXiv Papers with Large Language...	32	Emerging	question-answering-systems	377	Python
2517	pyladiesams/llm-guardrails-jul2024 Dive into the world of LLM Guardrails using tools like NVIDIA’s NeMo...	32	Emerging	llm-learning-resources	2	Jupyter Notebook
2518	kanchengw/cnllm 统一的中文大模型适配库，将主流中国大模型 API 输出封装为 OpenAI 格式，无缝协作openai、langchain等大多数openai结构适配的python库	32	Emerging	llm-evaluation-benchmarking	1	Python
2519	clip-italian/clip-italian CLIP (Contrastive Language–Image Pre-training) for Italian	32	Emerging	clip-image-embeddings	185	Jupyter Notebook
2520	namgyu-youn/PyTorch-Pruning Benchmark and profile pruning researches and open-sources	32	Emerging	llm-pruning-compression	4	Python
2521	amazon-science/wqa-contextual-qa Coala is a python package for Contextual Answer Sentence Selection.	32	Emerging	question-answering-systems	15	Python
2522	lxe/llavavision A simple "Be My Eyes" web app with a llama.cpp/llava backend	32	Emerging	interactive-ai-chat-uis	493	JavaScript
2523	xmindflow/SSCT [ICCV 2023] Self-supervised Semantic Segmentation: Consistency over Transformation	32	Emerging	medical-image-segmentation-transformers	26	Jupyter Notebook
2524	asigalov61/Google-Magenta-Piano-Transformer-Colab [DEAD/NOT SUPPORTED ANYMORE] This is the only fully working and functioning...	32	Emerging	music-generation-transformers	25	Jupyter Notebook
2525	microsoft/encoder-decoder-slm Efficient encoder-decoder architecture for small language models (≤1B...	32	Emerging	llm-implementation-tutorials	32	Python
2526	BoHuangLab/CELL-E_2 Multimodal encoder-only transformer model for image-based protein predictions	32	Emerging	protein-transformers-ml	15	Python
2527	PeterGriffinJin/Heterformer Heterformer: Transformer-based Deep Node Representation Learning on...	32	Emerging	graph-language-models	28	Jupyter Notebook
2528	ksm26/Pretraining-LLMs Master the essential steps of pretraining large language models (LLMs)....	32	Emerging	llm-implementation-tutorials	27	Jupyter Notebook
2529	ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning An implementation of the DISP-LLM method from the NeurIPS 2024 paper:...	32	Emerging	llm-pruning-compression	25	Python
2530	HeegyuKim/language-model 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)	32	Emerging	bert-model-implementations	32	Jupyter Notebook
2531	AspirinCode/AlphaPPImd Exploring the conformational ensembles of protein-protein complexes with...	32	Emerging	protein-transformers-ml	33	Jupyter Notebook
2532	gia-uh/cecilia The Cuban Language Model	32	Emerging	llm-frameworks-libraries	27	TeX
2533	AbhinaavRamesh/ollama-local-serve Local LLM infrastructure for distributed AI applications. Serve...	32	Emerging	llm-docker-deployments	4	Python
2534	psychbruce/FMAT 😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.	32	Emerging	bert-model-implementations	16	R
2535	anyantudre/Florence-2-Vision-Language-Model Florence-2 is a novel vision foundation model with a unified, prompt-based...	32	Emerging	multimodal-search-engines	159	Jupyter Notebook
2536	Bruce-Lee-LY/cutlass_gemm Multiple GEMM operators are constructed with cutlass to support LLM inference.	32	Emerging	llm-cuda-optimization	19	C++
2537	The-Martyr/CausalMM [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal...	32	Emerging	llm-reasoning-research	61	Python
2538	AntonGuan/TimeOmni-1 [ICLR 2026] Official implementation of " 🦙 TimeOmni-1: Incentivizing Complex...	32	Emerging	multimodal-vision-language	18	Python
2539	tommasocerruti/detllm Deterministic-mode checks for LLM inference: measure run/batch variance,...	32	Emerging	llm-inference-engines	18	Python
2540	Simplifine-gamedev/Simplifine 🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud...	32	Emerging	lora-qlora-fine-tuning	96	Python
2541	MaxwellYaoNi/PACE [NeurIPS 2024 Spotlight] Official implementation for "PACE: marrying...	32	Emerging	lora-qlora-fine-tuning	18	Python
2542	mahsasheikh/DrugGen DrugGen: Advancing Drug Discovery with Large Language Models and...	32	Emerging	molecular-generation-transformers	21	Python
2543	rabilrbl/llamafile-builder A simple github actions script to build a llamafile and uploads to huggingface	32	Emerging	local-llm-deployment	17	Python
2544	zTgx/llmweb-rs Webpage to structured data in Rust & LLM	32	Emerging	local-llm-deployment	16	Rust
2545	ybubnov/metalchat Pure C++23 Llama inference for Apple Silicon chips	32	Emerging	llm-inference-engines	19	C++
2546	voidism/Lookback-Lens Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual...	32	Emerging	llm-hallucination-mitigation	147	Python
2547	juzhengz/LoRI [COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	32	Emerging	llm-fine-tuning	171	Python
2548	ShengcaiLiao/TransMatcher [NeurIPS 2021] TransMatcher: Deep Image Matching Through Transformers for...	32	Emerging	3d-vision-transformers	29	Python
2549	KasraAhmadi/PII-360 An open-source Chrome Extension that identifies Personally Identifiable...	32	Emerging	pii-redaction-anonymization	3	JavaScript
2550	mddunlap924/PyTorch-LLM Fine-tuning an LLM using a Generic Workflow and Best Practices with PyTorch	32	Emerging	llm-fine-tuning	28	Jupyter Notebook
2551	guanwei49/DABL DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models	32	Emerging	llm-research-curation	6	Python
2552	oxidized-transformers/oxidized-transformers Modular Rust transformer/LLM library using Candle	32	Emerging	browser-based-ml-inference	38	Rust
2553	ShiZhengyan/InstructionModelling [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With...	32	Emerging	compositional-reasoning-embeddings	38	Python
2554	leondz/lm_risk_cards Risks and targets for assessing LLMs & LLM vulnerabilities	32	Emerging	llm-pentest-automation	34	Python
2555	shunk031/allennlp-shiba-model AllenNLP integration for Shiba: Japanese CANINE model	32	Emerging	model-evaluation-diagnostics	12	Python
2556	Tebmer/Rereading-LLM-Reasoning EMNLP 2024 "Re-reading improves reasoning in large language models". Simply...	32	Emerging	llm-reasoning-research	29	Python
2557	myscience/x-lstm Pytorch implementation of the xLSTM model by Beck et al. (2024)	32	Emerging	llm-implementation-tutorials	183	Python
2558	MusadiqPasha/Turkish-Hate-Speech-Classification-Explanation Classify, explain, and rewrite Turkish hate speech tweets using BERT, SHAP,...	32	Emerging	hate-speech-detection	3	Jupyter Notebook
2559	BFCmath/FinetuneAI_Learning How to effectively finetune CV/LLM models (without local gpu)	32	Emerging	llm-fine-tuning	38	Jupyter Notebook
2560	bayer-science-for-a-better-life/data2text-bioleaflets Biomedical Data-to-Text Generation via Fine-Tuning Transformers	32	Emerging	cybersecurity-threat-detection	29	Python
2561	xdevfaheem/Transformers A Comprehensive Implementation of Transformers Architecture from Scratch	32	Emerging	transformer-architecture-tutorials	4	Python
2562	samadon1/LLM-From-Scratch Medical Language Model fine-tuned using pretraining, instruction tuning, and...	32	Emerging	llm-fine-tuning	29	Jupyter Notebook
2563	kodejuice/ai-trade A smart AI-powered trading assistant that uses large language models (LLMs)...	32	Emerging	ai-powered-business-analytics	6	JavaScript
2564	prakash-aryan/debatebrawl-app A sophisticated AI-powered debate platform that integrates Large Language...	32	Emerging	multi-agent-orchestration	3	Python
2565	anas-zafar/LLM-Survey The official GitHub page for the survey paper "A Survey on Large Language...	32	Emerging	llm-finetuning-frameworks	40	—
2566	yaodongC/awesome-instruction-dataset A collection of open-source dataset to train instruction-following LLMs...	32	Emerging	multilingual-llm-adaptation	1,145	—
2567	RakePants/nerdless Conversational AI Telegram bot based on a finetuned language model	32	Emerging	conversational-chatbot-applications	3	Jupyter Notebook
2568	didier-durand/llms-in-clouds Experiments with LLMs in clouds (powered by SGLang)	32	Emerging	local-llm-deployment	6	Python
2569	systems-genomics-lab/deeptaxa A deep learning framework for hierarchical taxonomy classification of 16S...	32	Emerging	text-classification-transformers	9	Python
2570	ScottCampit/personalized-marketing-chatbot personalized marketing chatbot	32	Emerging	therapeutic-chatbot-applications	5	Python
2571	rezazad68/TMUnet Contextual Attention Network: Transformer Meets U-Net	32	Emerging	medical-image-segmentation-transformers	95	Jupyter Notebook
2572	azminewasi/Awesome-LLMs-ICLR-24 It is a comprehensive resource hub compiling all LLM papers accepted at the...	32	Emerging	llm-research-curation	66	—
2573	yinzhangyue/SelfAware Do Large Language Models Know What They Don’t Know?	32	Emerging	llm-interpretability-explainability	102	Python
2574	Buyun-Liang/SECA [NeurIPS 2025] SECA: Semantically Equivalent and Coherent Attacks for...	32	Emerging	llm-hallucination-mitigation	68	Python
2575	XunshanMan/MVGFormer This is the official implementation of the work presented at CVPR 2024,...	32	Emerging	3d-vision-transformers	68	Python
2576	cmu-flame/FLAME-MoE Official repository for FLAME-MoE: A Transparent End-to-End Research...	32	Emerging	mixture-of-experts-llms	33	Jupyter Notebook
2577	smvorwerk/xlstm-cuda Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and...	32	Emerging	llm-cuda-optimization	91	C++
2578	open-compass/ANAH [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO	32	Emerging	llm-hallucination-mitigation	63	Python
2579	synlp/R2-LLM The official GitHub repository of the AAAI-2024 paper "Bootstrapping Large...	32	Emerging	clinical-llm-tools	66	Python
2580	HKUNLP/efficient-attention [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control...	32	Emerging	attention-mechanism-implementations	87	Python
2581	Nota-NetsPresso/shortened-llm Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]	32	Emerging	llm-compression-optimization	90	Python
2582	bernardoleite/fairytaleqa-translated Code for paper "FairytaleQA Translated: Enabling Educational Question and...	32	Emerging	question-answering-systems	2	Python
2583	deep-symbolic-mathematics/llm-srbench [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation...	32	Emerging	domain-specific-benchmarks	94	Python
2584	SafeAILab/RAIN [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning	32	Emerging	llm-knowledge-editing	98	Python
2585	AILab-CVC/M2PT [CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data...	32	Emerging	multimodal-fusion-transformers	101	Python
2586	dsdanielpark/open-llm-datasets Repository for organizing datasets and papers used in Open LLM.	32	Emerging	llm-domain-datasets	101	—
2587	BorealisAI/flora-opt This is the official repository for the paper "Flora: Low-Rank Adapters Are...	32	Emerging	transformer-frameworks-wrappers	106	Python
2588	zubair-irshad/NeRF-MAE [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders...	32	Emerging	3d-vision-transformers	104	Python
2589	alvion427/PerroPastor Run Llama based LLMs in Unity entirely in compute shaders with no dependencies	32	Emerging	local-llm-deployment	106	C#
2590	ymoslem/Adaptive-MT-LLM Adaptive Machine Translation with Large Language Models	32	Emerging	llm-scaling-architecture	32	JavaScript
2591	mlverse/mall Run multiple LLM predictions against a data frame with R and Python	32	Emerging	llm-frameworks-libraries	120	R
2592	BillChan226/HALC [ICML 2024] Official implementation for "HALC: Object Hallucination...	32	Emerging	llm-hallucination-mitigation	110	Python
2593	rasbt/faster-pytorch-blog Outlining techniques for improving the training performance of your PyTorch...	32	Emerging	transformer-training-optimization	128	Python
2594	CJReinforce/PURE Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is...	32	Emerging	rlhf-alignment-training	160	Python
2595	TIGER-AI-Lab/TIGERScore "TIGERScore: Towards Building Explainable Metric for All Text Generation...	32	Emerging	evaluation-frameworks-metrics	32	Jupyter Notebook
2596	alexliap/greek_gpt MoE Decoder Transformer implementation with MLX	32	Emerging	mathematical-reasoning-transformers	6	Python
2597	Niez-Gharbi/Youtube-Summariser Summarize your youtube videos with BART on streamlit app.	32	Emerging	youtube-video-summarization	2	Python
2598	xmartlabs/spoter-embeddings Create embeddings from sign pose videos using Transformers	32	Emerging	3d-vision-transformers	32	Python
2599	fvliang/DART Official Implementation of DART (DART: Diffusion-Inspired Speculative...	32	Emerging	diffusion-language-models	45	Python
2600	AIRI-Institute/Probing_framework Framework for probing tasks	32	Emerging	mathematical-reasoning-transformers	31	Python

« Prev 1 2 3 … 24 25 26 27 28 … 76 77 78 Next »