Atenrev/forocoches-language-generation
This is a PyTorch implementation of a decoder only transformer inspired on GPT-2. The model was trained from scratch on a custom dataset of over 1 million threads from the Spanish forum ForoCoches. The dataset is publicly available.
This project helps researchers and natural language processing practitioners explore text generation specifically within the context of the Spanish ForoCoches online forum. You provide a prompt in Spanish, and the system generates new text that mimics the style and content found on ForoCoches. This is ideal for those studying online community language or specialized text generation.
No commits in the last 6 months.
Use this if you are a researcher or developer who needs to generate text in the unique, informal, and potentially offensive style of the Spanish ForoCoches forum, or if you want to experiment with a pre-trained model on a specialized social media dataset.
Not ideal if you need to generate polite, professional, or general-purpose Spanish text, as the model is specifically trained on and reflects the potentially offensive language of the ForoCoches forum.
Stars
7
Forks
1
Language
Python
License
GPL-3.0
Category
Last pushed
Jun 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Atenrev/forocoches-language-generation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...
shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...
ai-forever/ru-gpts
Russian GPT3 models.
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...