rdspring1/PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
This project offers a pre-trained language model that helps with natural language processing tasks requiring an understanding of English text. It takes large datasets of English sentences and words as input and outputs a trained model capable of predicting the next word in a sequence. Researchers and developers working on advanced NLP systems would use this to build foundational language understanding capabilities.
123 stars. No commits in the last 6 months.
Use this if you are a researcher or developer who needs a robust, high-performance language model foundation, specifically for the English language, to integrate into larger NLP applications.
Not ideal if you are looking for a plug-and-play solution for general text analysis or if your work does not involve large-scale language modeling development.
Stars
123
Forks
21
Language
Python
License
Apache-2.0
Category
Last pushed
Aug 22, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/rdspring1/PyTorch_GBW_LM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
yaserkl/RLSeq2Seq
Deep Reinforcement Learning For Sequence to Sequence Models
kefirski/pytorch_RVAE
Recurrent Variational Autoencoder that generates sequential data implemented with pytorch
georgian-io/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
ctr4si/A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling
PyTorch Implementation of "A Hierarchical Latent Structure for Variational Conversation...
nurpeiis/LeakGAN-PyTorch
A simple implementation of LeakGAN in PyTorch