microsoft/COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

/ 100

Emerging

This project offers tools to fine-tune existing COCO-LM language models for specific natural language understanding (NLU) tasks. You provide a pre-trained COCO-LM model and your domain-specific text datasets, and the output is a more accurate language model tailored to tasks like text classification, question answering, or natural language inference. It's designed for researchers and practitioners working on advanced AI applications involving text understanding.

118 stars. No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer looking to improve the performance of language models for specific NLU benchmarks or custom text-based applications.

Not ideal if you are an end-user without a technical background in machine learning or if you need a ready-to-use application rather than a model fine-tuning toolkit.

natural-language-processing text-classification question-answering AI-research machine-learning-engineering

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

118

Forks

Language

Python

License

MIT

Higher-rated alternatives

galilai-group/stable-pretraining

Reliable, minimal and scalable library for pretraining foundation and world models

CognitiveAISystems/MAPF-GPT

[AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF...

UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled...

larslorch/avici

Amortized Inference for Causal Structure Learning, NeurIPS 2022

svdrecbd/mhc-mlx

MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by DeepSeek-AI.

Explore Transformer Models

All categories Trending Transformer directory Insights