Variational Autoencoders NLP NLP Tools

Tools implementing VAEs and related latent variable models for NLP tasks including text generation, language modeling, and conversation modeling. Does NOT include standard autoencoders without probabilistic latents, or VAEs applied primarily to non-text domains like speech or vision.

There are 53 variational autoencoders nlp tools tracked. 1 score above 50 (established tier). The highest-rated is yaserkl/RLSeq2Seq at 51/100 with 768 stars.

Get all 53 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=variational-autoencoders-nlp&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	yaserkl/RLSeq2Seq Deep Reinforcement Learning For Sequence to Sequence Models	51	Established	768	Python
2	kefirski/pytorch_RVAE Recurrent Variational Autoencoder that generates sequential data implemented...	49	Emerging	357	Python
3	ctr4si/A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling PyTorch Implementation of "A Hierarchical Latent Structure for Variational...	47	Emerging	172	Python
4	georgian-io/Multimodal-Toolkit Multimodal model for text and tabular data with HuggingFace transformers as...	47	Emerging	618	Python
5	nurpeiis/LeakGAN-PyTorch A simple implementation of LeakGAN in PyTorch	46	Emerging	63	Python
6	facebookresearch/large_concept_model Large Concept Models: Language modeling in a sentence representation space	45	Emerging	2,341	Python
7	rdspring1/PyTorch_GBW_LM PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset	44	Emerging	123	Python
8	justinphan3110/SciFive SciFive: a text-text transformer model for biomedical literature	43	Emerging	98	Jupyter Notebook
9	naist-nlp/mbrs A library for minimum Bayes risk (MBR) decoding	43	Emerging	52	Python
10	CR-Gjx/LeakGAN The codes of paper "Long Text Generation via Adversarial Training with...	43	Emerging	576	Python
11	amirbar/rnn.wgan Code for training and evaluation of the model from "Language Generation with...	41	Emerging	252	Python
12	lyeoni/pretraining-for-language-understanding Pre-training of Language Models for Language Understanding	41	Emerging	83	Python
13	Khamies/LSTM-Variational-AutoEncoder A PyTorch Implementation of Generating Sentences from a Continuous Space by...	40	Emerging	33	Python
14	jiacheng-xu/vmf_vae_nlp Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"	40	Emerging	171	Python
15	tanyuqian/progressive-generation NAACL 2021 - Progressive Generation of Long Text	40	Emerging	82	Python
16	Sea-Snell/Implicit-Language-Q-Learning Official code from the paper "Offline RL for Natural Language Generation...	40	Emerging	211	Python
17	maum-ai/pnlp-mixer Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP...	37	Emerging	66	Python
18	williamSYSU/CatGAN Codes for Category-aware Generative Adversarial Networks (AAAI 2020)	37	Emerging	19	—
19	hjian42/automatic-personality-prediction [AAAI SAP 2020] Modeling Personality with Attentive Networks and Contextual...	35	Emerging	61	Python
20	X-czh/SeqGAN-PyTorch Implementation of Sequence Generative Adversarial Nets with Policy Gradient...	34	Emerging	54	Jupyter Notebook
21	machine-learning-apps/IssuesLanguageModel A Language model trained on a large corpus of GitHub Issues	33	Emerging	7	Jupyter Notebook
22	FranxYao/Deep-Generative-Models-for-Natural-Language-Processing DGMs for NLP. A roadmap.	33	Emerging	396	—
23	liang8qi/Data-to-Text-Generation Some papers and datasets about Data-To-Text Generation	32	Emerging	108	—
24	ksgr5566/AutoTuneNLP A comprehensive toolkit for seamless data generation and fine-tuning of NLP...	31	Emerging	3	Python
25	torotoki/mamba-lm A minimal implementation of Mamba using PyTorch and Hugging Face Transformers	31	Emerging	6	Python
26	long8v/torch_study torch tutorial and paper implementation mainly about NLP	30	Emerging	32	—
27	hassyGo/NLG-RL Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction	30	Emerging	70	Python
28	ImKeTT/CTG-latentAEs [Paperlist] Awesome paper list of controllable text generation via latent...	29	Experimental	52	—
29	AlaFalaki/AttentionVisualizer A simple library to showcase highest scored words using RoBERTa model	29	Experimental	18	Python
30	DSKSD/Pytorch_models PyTorch study	28	Experimental	14	Jupyter Notebook
31	ImKeTT/PCAE [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable...	27	Experimental	26	Python
32	brandokoch/pytorch-sequence-models This repository implements a variety of sequence model architectures from...	27	Experimental	12	Python
33	KunHanKH/GE2E_Speaker_Verification Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR...	26	Experimental	10	Jupyter Notebook
34	shivakiran-ai/llm-from-scratch Building Large Language Models from scratch — every component implemented in...	26	Experimental	2	Jupyter Notebook
35	Ighina/VQ-VAE_Topic An implementation of the paper [Vector-Quantization-Based Topic...	24	Experimental	4	Jupyter Notebook
36	lyj20071013/Sparse-MoE-Language-Model-v1 This repository contains an implementation of a Sparse Mixture of Experts...	22	Experimental	2	Python
37	Jaykef/min-patchnizer Minimal, clean code for video/image "patchnization" - a process commonly...	20	Experimental	11	Python
38	Satyamkumarnavneet/NeuralTextPredictor Neural Language Model in PyTorch, trained on “Pride and Prejudice” with...	20	Experimental	1	Jupyter Notebook
39	hairav/SlideSpawn Automatic generation of presentation for an academic paper.	20	Experimental	8	Jupyter Notebook
40	zinengtang/ContinuousFlowNLG Pytorch version of Continuous Language Generative Flow (ACL 2021)	20	Experimental	11	Python
41	HenryNdubuaku/autonomous-vehicle-fgpa Progressive Language Enhancement Algorithm Using Masking Filling...	19	Experimental	4	Python
42	kasnerz/lightnlg A minimalistic codebase for training NLG models from HuggingFace...	19	Experimental	3	Python
43	YecanLee/Decoding-Decoded [COLING 2025] Official PyTorch Implementation of "Decoding Decoded:...	19	Experimental	4	Python
44	ImKeTT/FET-LM [TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for...	19	Experimental	14	Python
45	altescy/textvae VAE implementation for text generation with PyTorch	18	Experimental	2	Python
46	justinhchae/transformer_project_pytorch A starter project to get going with transformers and deep learning for NLP...	17	Experimental	1	Python
47	SavinRazvan/attention Visualize BERT's attention mechanism with a user-friendly script. Input text...	17	Experimental	1	Jupyter Notebook
48	lif31up/encoder-only-transformer-adapter PETL adaptor for pretrained BERT	15	Experimental	—	Python
49	dayyass/language-modeling Pipeline for training Language Models using PyTorch.	13	Experimental	12	Python
50	AMDonati/RL-NLP Code for the paper "Learning Natural Language Generation with Truncated...	12	Experimental	6	Jupyter Notebook
51	paraglondhe098/llm-from-scratch Build a transformer architecture from scratch and made an end-to-end...	11	Experimental	3	Jupyter Notebook
52	ShubbhRM/Behavioral-Dynamics-Prediction Attention-based neural point process predicting forum reply behavior...	10	Experimental	1	Jupyter Notebook
53	VodLM/vod-gradients Variational Open-Domain (VOD) - core methods (priority sampling, gradients)	10	Experimental	2	Python