Atenrev/forocoches-language-generation

This is a PyTorch implementation of a decoder only transformer inspired on GPT-2. The model was trained from scratch on a custom dataset of over 1 million threads from the Spanish forum ForoCoches. The dataset is publicly available.

29
/ 100
Experimental

This project helps researchers and natural language processing practitioners explore text generation specifically within the context of the Spanish ForoCoches online forum. You provide a prompt in Spanish, and the system generates new text that mimics the style and content found on ForoCoches. This is ideal for those studying online community language or specialized text generation.

No commits in the last 6 months.

Use this if you are a researcher or developer who needs to generate text in the unique, informal, and potentially offensive style of the Spanish ForoCoches forum, or if you want to experiment with a pre-trained model on a specialized social media dataset.

Not ideal if you need to generate polite, professional, or general-purpose Spanish text, as the model is specifically trained on and reflects the potentially offensive language of the ForoCoches forum.

Spanish online forums Social media text generation Linguistic research Informal language processing Community content simulation
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

7

Forks

1

Language

Python

License

GPL-3.0

Last pushed

Jun 13, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Atenrev/forocoches-language-generation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.