All Transformer Models

7,795 models ranked by quality score · Page 30 of 78

Showing 2901–3000 of 7,795
# Model Score Tier
2901 sodascience/social_science_inferences_with_llms

Addressing LLM-related measurement error in social science modeling research.

30
Emerging
2902 RohitMacherla3/wikiHow_text_summarization_llms

The project aims to utilize pre-trained Large Language Models (LLMs) for...

30
Emerging
2903 fcakyon/gpt2-shakespeare

A tutorial on GPT2 language model training with texts from Shakespeare

30
Emerging
2904 DrHB/rna-stanford

Transformer + GAT for RNA chemical reactivity prediction| Stanford Ribonanza

30
Emerging
2905 lordtt13/transformers-experiments

All my experiments with the various transformers and various transformer...

30
Emerging
2906 mihirchhiber/LLM-ABM-StockSim

LLM-DRIVEN AGENT STOCK MARKET SIMULATION: Built an agent-based simulation...

30
Emerging
2907 aaaastark/Pretrain_Finetune_Transformers_Pytorch

Pre-Training and Fine-Tuning transformer models using PyTorch and the...

30
Emerging
2908 seanbenhur/resusable_text_classification_template

A complete reusable pipeline for text classification using different...

30
Emerging
2909 JawherKl/deep-dive-into-llm

Deep Dive into Large Language Models (LLMs) – A comprehensive study of Large...

30
Emerging
2910 jaketae/tupe

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

30
Emerging
2911 neerajtiwari360/understand_LLM

A comprehensive guide and tools for running large language models (LLMs) on...

30
Emerging
2912 kmkrofficial/LiteGPT

LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and...

30
Emerging
2913 Followb1ind1y/Medical-LLM-Fine-tuning

Fine-tunes LLaMA-3-8B on PubMedQA with QLoRA, optimized via DeepSpeed and...

30
Emerging
2914 ksm26/Reinforcement-Learning-from-Human-Feedback

Embark on the "Reinforcement Learning from Human Feedback" course and align...

30
Emerging
2915 naderabdelghany/project-rev

A proof-of-concept audio-interactive personalized chatbot based on Ted...

30
Emerging
2916 phonism/llm4cp

Large Language Model for Competitive Programming

30
Emerging
2917 BenGJ10/Complete-Machine-Learning-Notes

A complete collection of handwritten notes and learning resources for...

30
Emerging
2918 autobotasia/vitone

Tự động thêm dấu tiếng việt dùng Transformer model

30
Emerging
2919 Mahesh3394/clinical_text_classification

Text classification with fine tuned LLM model. Bert model fine tuned on...

30
Emerging
2920 antonalth/cs2-transformer-agent

Training a Transformer to play Counter Strike

30
Emerging
2921 nawnoes/pytorch-gpt-x

An implementation of an autoregressive language model using an improved...

30
Emerging
2922 GregorKobsik/ImageTransformer

This notebook shows a basic implementation of a transformer (decoder)...

30
Emerging
2923 semaj87/llm-post-generator

Using LLMs & the SERP API to retrieve information on a given topic, which is...

30
Emerging
2924 Ajax0564/VyomAI

VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ...

30
Emerging
2925 Orion-zhen/transAPI

OpenAI compatible API purely based on Transformers

30
Emerging
2926 fshnkarimi/Fine-tuning-an-LLM-using-LoRA

📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models -...

30
Emerging
2927 HySonLab/LANTERN

LANTERN: Leveraging Large Language Models And Transformer For Enhanced...

30
Emerging
2928 davide-coccomini/TimeSformer-Video-Classification

The notebook explains the various steps to obtain the results of...

30
Emerging
2929 kardSIM/Trading_RL_agent_with_transformers

An RL agent that can trade using Deep Q-Network (DQN) and a decoder-only...

30
Emerging
2930 Fisseha-Estifanos/LLM-API

A repository to demonstrate some of the concepts behind large language...

30
Emerging
2931 LazerLambda/Promptzl

Turn LLMs into zero-shot PyTorch classifiers!

30
Emerging
2932 gmontamat/poor-mans-transformers

Implement Transformers (and Deep Learning) from scratch in NumPy

30
Emerging
2933 visresearch/LLaVA-STF

The official implementation of "Learning Compact Vision Tokens for Efficient...

30
Emerging
2934 ToddThomson/Mila

Achilles Mila Deep Neural Network library provides a comprehensive API to...

30
Emerging
2935 gersteinlab/Struc-Bench

[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating...

30
Emerging
2936 SinclairCoder/Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction...

30
Emerging
2937 ma2za/torch-adapters

Small Library of PyTorch Adaptation modules

30
Emerging
2938 ambideXtrous9/GRPO-and-SFT-Finetune-Qwen3-using-Unsloth-Reasoning-and-Non-Reasoning-Dataset

GRPO and SFT Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset

30
Emerging
2939 yeasy/llm_internals

深入剖析大语言模型架构、原理到训练部署 | How LLM works, including Design, Architecture and...

30
Emerging
2940 sukanyabag/Finetuning-Qwen2-7B-VQA-on-Radiology-Scans

This repository is doing the finetuning of the Qwen2 7B VLM for performing...

30
Emerging
2941 amazon-science/isometric-slt

Isometric Spoken Language Translation - Isometric SLT.

30
Emerging
2942 VectorInstitute/VLDBench

VLDBench: A large-scale benchmark for evaluating Vision-Language Models...

30
Emerging
2943 amazon-science/THRONE

Code release for THRONE, a CVPR 2024 paper on measuring object...

30
Emerging
2944 DianaDorobantu/legal-llm

Develop a Romanian legal domain Large Language Model (LLM) using pre-trained...

30
Emerging
2945 Riccorl/llama-trainer

Llama Trainer Utility

30
Emerging
2946 RAravindDS/CharLLMs

Implementing easy to use "Character Level Language Models" 🕺🏽

30
Emerging
2947 cool-japan/trustformers

High-performance, memory-safe Rust implementation of Hugging Face...

30
Emerging
2948 StringNLPLAB/MGS

Repository for the paper "Advancing General-Purpose Reasoning Models with...

30
Emerging
2949 tahaabbas/dictator

Dictator – Supercharge Cursor Chat with voice-to-text, custom AI prompts,...

30
Emerging
2950 CharlieBrown-v1/KALM

[NeurIPS'24] KALM: Knowledgeable Agents by Offline Reinforcement Learning...

30
Emerging
2951 sitammeur/gliner-litserve

Leverage ModernGLiNER's capabilities using LitServe.

30
Emerging
2952 MehnaazAsad/NLP_summarization_bart

NLP summarization task with the Bart LLM

30
Emerging
2953 Subconscious-ai/sublime

🧠Behavior Change as a Service🌞

30
Emerging
2954 GaryYufei/AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

30
Emerging
2955 yinzhangyue/EoT

Exchange-of-Thought: Enhancing Large Language Model Capabilities through...

30
Emerging
2956 NellyW8/VeriReason

This is the Github Repo for the paper: VeriReason: Reinforcement Learning...

30
Emerging
2957 farhan0167/BankAIAgent

A tool to convert bank statements into Excel files

30
Emerging
2958 mrseanryan/gpt-local

Local GPT (llama 2 or dolly or gpt etc.) via Python - using ctransforers project

30
Emerging
2959 skpig/MPSC

[ACL 2024] Enhancing Large Language Models in Coding Through...

30
Emerging
2960 lucky-verma/SaastIE

Document understanding system using Donut transformer architecture

30
Emerging
2961 renan-siqueira/image-to-text-tool

This tool processes images and generates textual descriptions using advanced...

30
Emerging
2962 rochitasundar/Generative-AI-with-Large-Language-Models

This repository contains the lab work for Coursera course on "Generative AI...

30
Emerging
2963 Mechres/text-summarize

Flask-based API that provides a user-friendly interface to summarize text in...

30
Emerging
2964 franciellevargas/MOL

Multilingual Offensive Lexicon consists of the first contextual lexicon for...

30
Emerging
2965 zjukg/KnowPAT

[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in...

30
Emerging
2966 kyegomez/Simba

A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified...

30
Emerging
2967 TwistingTwists/json_partial

json_parser for LLM outputs -> it fixes the malformed json and produces correct json

30
Emerging
2968 relign-ai/relign

post train language models on multi-step reasoning with reinforcement learning

30
Emerging
2969 Jacksonlark/open-mllms

open llm for multimodal

30
Emerging
2970 iqbal-sk/Detecting-Persuasion-Techniques-in-Memes

Hierarchical, multilingual, multimodal detection of persuasion techniques in...

30
Emerging
2971 centre-for-humanities-computing/stormtrooper

Zero/few shot learning components for scikit-learn pipelines with LLMs and...

30
Emerging
2972 honghanhh/semeval8

L3i++ at SemEval2024-task8: Multidomain, Multimodel and Multilingual...

30
Emerging
2973 chandar-lab/CAIRO

We explain why fairness metrics don't correlate and propose CAIRO to make...

30
Emerging
2974 unisa-hpc/llm.sycl

The sycl version of llm.c (for the final project of HPC course 2024, UNISA)

30
Emerging
2975 do-me/qdrant-frontend

A universal Qdrant table frontend based on transformers.js

30
Emerging
2976 murphyhoucn/llm-dev

LLM Dev

30
Emerging
2977 Zhang-Yihao/Adversarial-Representation-Engineering

Official implementation repository for the paper Towards General Conceptual...

30
Emerging
2978 0xJakuzya/sentiment-analysis-tg-news

Sentiment analysis tool for Telegram news: scraping with Telethon, text...

30
Emerging
2979 rishabkr/Attention-Is-All-You-Need-Explained-PyTorch

A paper implementation and tutorial from scratch combining various great...

30
Emerging
2980 neeleshbhalla/transformers_for_time_series_forecasting

Inferencing 'PatchTST' and 'Informer' to harness the power of transformers...

30
Emerging
2981 icon-lab/HST

Official implementation of Hierarchical Spectrogram Transformers (HST)

30
Emerging
2982 SCZwangxiao/RTQ-MM2023

ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding...

30
Emerging
2983 DarshanAdiga/idiom-principle-on-magpie-corpus

Idiom Principle on MAGPIE dataset

30
Emerging
2984 seanpm2001/DALL-E_LLaMA

🤖️🦙️🧠️ DALL-E LLaMA is a combination of DALL-E and LLaMA (Large Language...

30
Emerging
2985 RiccardoSpolaor/Question-Answering

Question answering through pre-trained transformer-based models from Hugging Face.

30
Emerging
2986 Someshog/greenwashing-detection-app

An AI-powered Streamlit web app to detect greenwashing in sustainability...

30
Emerging
2987 avrtt/QASATIK

LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped...

30
Emerging
2988 D1ffic00lt/ai-pastproof

PastProof AI – ML core for automated fact-checking: ingests raw text, finds...

30
Emerging
2989 th789/mbr-for-nmt

Characterizing the performance of minimum Bayes risk (MBR) decoding for...

30
Emerging
2990 seanpm2001/DALL-E_LLaMA_Docs

🤖️🦙️🧠️📖️ The official documentation source repository for DALL-E LLaMA, a...

30
Emerging
2991 pranavsinghps1/CASS

Official PyTorch implementation of CASS, from the following paper: CASS:...

30
Emerging
2992 francoislanc/midistral

LLM finetuned for generating symbolic music

29
Experimental
2993 datasig-ac-uk/nlpsig

Package for constructing paths of embeddings obtained from transformers.

29
Experimental
2994 cliang1453/SAGE

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for...

29
Experimental
2995 zchuz/TimeBench

The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of...

29
Experimental
2996 XingLuxi/Cal-FLOPs-for-PLM

Calculating FLOPs of Pre-trained Models in NLP

29
Experimental
2997 rubencart/LIIR-TextGraphs-14

Code for KU Leuven LIIR lab's submission to the TextGraphs-14 shared task on...

29
Experimental
2998 LoserCheems/WonderfulMatrices

Wonderful Matrices to Build Small Language Models

29
Experimental
2999 LlamaGenAI/LlamaGen

AI Comic Factory - Generate Comics with AI, 🦙 Llama for Scalable Anime...

29
Experimental
3000 andresC98/TSF_Transformers_TFM

Repository containing my Master Thesis for the M.Sc. Big Data Analytics,...

29
Experimental
« Prev 1 2 3 28 29 30 31 32 76 77 78 Next »