codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
This project helps natural language processing practitioners train custom language models for their specific text data. You input a large collection of text documents, and it produces a pre-trained BERT model. This model can then be used as a foundation for various downstream tasks like text classification, question answering, or sentiment analysis, making it valuable for researchers and data scientists working with language.
6,517 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to train a foundational language model (BERT) from scratch or fine-tune it on your unique dataset, especially when working with specialized jargon or a particular domain of text.
Not ideal if you're looking for a ready-to-use, pre-trained BERT model for common language understanding tasks, as this requires you to prepare your own corpus and perform the training yourself.
Stars
6,517
Forks
1,328
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 15, 2023
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/codertimo/BERT-pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
JayYip/m3tl
BERT for Multitask Learning
920232796/bert_seq2seq
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
sileod/tasknet
Easy modernBERT fine-tuning and multi-task learning
graykode/toeicbert
TOEIC(Test of English for International Communication) solving using pytorch-pretrained-BERT model.
guotong1988/BERT-pre-training
multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)