Variational Autoencoders NLP NLP Tools

Tools implementing VAEs and related latent variable models for NLP tasks including text generation, language modeling, and conversation modeling. Does NOT include standard autoencoders without probabilistic latents, or VAEs applied primarily to non-text domains like speech or vision.

There are 53 variational autoencoders nlp tools tracked. 1 score above 50 (established tier). The highest-rated is yaserkl/RLSeq2Seq at 51/100 with 768 stars.

Get all 53 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=variational-autoencoders-nlp&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 yaserkl/RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

51
Established
2 kefirski/pytorch_RVAE

Recurrent Variational Autoencoder that generates sequential data implemented...

49
Emerging
3 ctr4si/A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling

PyTorch Implementation of "A Hierarchical Latent Structure for Variational...

47
Emerging
4 georgian-io/Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as...

47
Emerging
5 nurpeiis/LeakGAN-PyTorch

A simple implementation of LeakGAN in PyTorch

46
Emerging
6 facebookresearch/large_concept_model

Large Concept Models: Language modeling in a sentence representation space

45
Emerging
7 rdspring1/PyTorch_GBW_LM

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

44
Emerging
8 justinphan3110/SciFive

SciFive: a text-text transformer model for biomedical literature

43
Emerging
9 naist-nlp/mbrs

A library for minimum Bayes risk (MBR) decoding

43
Emerging
10 CR-Gjx/LeakGAN

The codes of paper "Long Text Generation via Adversarial Training with...

43
Emerging
11 amirbar/rnn.wgan

Code for training and evaluation of the model from "Language Generation with...

41
Emerging
12 lyeoni/pretraining-for-language-understanding

Pre-training of Language Models for Language Understanding

41
Emerging
13 Khamies/LSTM-Variational-AutoEncoder

A PyTorch Implementation of Generating Sentences from a Continuous Space by...

40
Emerging
14 jiacheng-xu/vmf_vae_nlp

Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"

40
Emerging
15 tanyuqian/progressive-generation

NAACL 2021 - Progressive Generation of Long Text

40
Emerging
16 Sea-Snell/Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation...

40
Emerging
17 maum-ai/pnlp-mixer

Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP...

37
Emerging
18 williamSYSU/CatGAN

Codes for Category-aware Generative Adversarial Networks (AAAI 2020)

37
Emerging
19 hjian42/automatic-personality-prediction

[AAAI SAP 2020] Modeling Personality with Attentive Networks and Contextual...

35
Emerging
20 X-czh/SeqGAN-PyTorch

Implementation of Sequence Generative Adversarial Nets with Policy Gradient...

34
Emerging
21 machine-learning-apps/IssuesLanguageModel

A Language model trained on a large corpus of GitHub Issues

33
Emerging
22 FranxYao/Deep-Generative-Models-for-Natural-Language-Processing

DGMs for NLP. A roadmap.

33
Emerging
23 liang8qi/Data-to-Text-Generation

Some papers and datasets about Data-To-Text Generation

32
Emerging
24 ksgr5566/AutoTuneNLP

A comprehensive toolkit for seamless data generation and fine-tuning of NLP...

31
Emerging
25 torotoki/mamba-lm

A minimal implementation of Mamba using PyTorch and Hugging Face Transformers

31
Emerging
26 long8v/torch_study

torch tutorial and paper implementation mainly about NLP

30
Emerging
27 hassyGo/NLG-RL

Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction

30
Emerging
28 ImKeTT/CTG-latentAEs

[Paperlist] Awesome paper list of controllable text generation via latent...

29
Experimental
29 AlaFalaki/AttentionVisualizer

A simple library to showcase highest scored words using RoBERTa model

29
Experimental
30 DSKSD/Pytorch_models

PyTorch study

28
Experimental
31 ImKeTT/PCAE

[KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable...

27
Experimental
32 brandokoch/pytorch-sequence-models

This repository implements a variety of sequence model architectures from...

27
Experimental
33 KunHanKH/GE2E_Speaker_Verification

Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR...

26
Experimental
34 shivakiran-ai/llm-from-scratch

Building Large Language Models from scratch — every component implemented in...

26
Experimental
35 Ighina/VQ-VAE_Topic

An implementation of the paper [Vector-Quantization-Based Topic...

24
Experimental
36 lyj20071013/Sparse-MoE-Language-Model-v1

This repository contains an implementation of a Sparse Mixture of Experts...

22
Experimental
37 Jaykef/min-patchnizer

Minimal, clean code for video/image "patchnization" - a process commonly...

20
Experimental
38 Satyamkumarnavneet/NeuralTextPredictor

Neural Language Model in PyTorch, trained on “Pride and Prejudice” with...

20
Experimental
39 hairav/SlideSpawn

Automatic generation of presentation for an academic paper.

20
Experimental
40 zinengtang/ContinuousFlowNLG

Pytorch version of Continuous Language Generative Flow (ACL 2021)

20
Experimental
41 HenryNdubuaku/autonomous-vehicle-fgpa

Progressive Language Enhancement Algorithm Using Masking Filling...

19
Experimental
42 kasnerz/lightnlg

A minimalistic codebase for training NLG models from HuggingFace...

19
Experimental
43 YecanLee/Decoding-Decoded

[COLING 2025] Official PyTorch Implementation of "Decoding Decoded:...

19
Experimental
44 ImKeTT/FET-LM

[TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for...

19
Experimental
45 altescy/textvae

VAE implementation for text generation with PyTorch

18
Experimental
46 justinhchae/transformer_project_pytorch

A starter project to get going with transformers and deep learning for NLP...

17
Experimental
47 SavinRazvan/attention

Visualize BERT's attention mechanism with a user-friendly script. Input text...

17
Experimental
48 lif31up/encoder-only-transformer-adapter

PETL adaptor for pretrained BERT

15
Experimental
49 dayyass/language-modeling

Pipeline for training Language Models using PyTorch.

13
Experimental
50 AMDonati/RL-NLP

Code for the paper "Learning Natural Language Generation with Truncated...

12
Experimental
51 paraglondhe098/llm-from-scratch

Build a transformer architecture from scratch and made an end-to-end...

11
Experimental
52 ShubbhRM/Behavioral-Dynamics-Prediction

Attention-based neural point process predicting forum reply behavior...

10
Experimental
53 VodLM/vod-gradients

Variational Open-Domain (VOD) - core methods (priority sampling, gradients)

10
Experimental