HeegyuKim/language-model
한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)
This project provides pre-trained Korean language models based on GPT-J and GPT2 architectures, along with the code to train them. It takes raw Korean text as input and produces language models capable of understanding and generating human-like Korean text. Data scientists, machine learning engineers, and researchers focusing on natural language processing for the Korean language would find this useful.
No commits in the last 6 months.
Use this if you need pre-trained Korean language models for tasks like dialogue generation or summarization, or if you want to train your own custom Korean language models efficiently on cloud TPUs.
Not ideal if your project does not involve the Korean language or if you are looking for ready-to-use applications rather than foundational language models.
Stars
32
Forks
3
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Sep 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/HeegyuKim/language-model"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Tongjilibo/bert4torch
An elegent pytorch implement of transformers
nyu-mll/jiant
jiant is an nlp toolkit
lonePatient/TorchBlocks
A PyTorch-based toolkit for natural language processing
monologg/JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite"...