oooranz/Baby-CoThought

🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)

/ 100

Experimental

This project helps AI researchers and developers working on language models to train more efficient, compact models using human-like data. It takes diverse, smaller text corpora, processes them using larger language models to generate new natural language understanding examples, and then uses these examples to pretrain a smaller RoBERTa-like model. The output is a "Baby Language Model" that demonstrates enhanced reasoning capabilities with less training data.

No commits in the last 6 months.

Use this if you are an NLP researcher or machine learning engineer looking to develop small, sample-efficient language models that still possess strong reasoning abilities, mirroring human language acquisition.

Not ideal if you need to train a full-scale, cutting-edge large language model for production use, as this project focuses on compact models and sample efficiency rather than maximizing overall performance.

NLP research language model pretraining AI model efficiency natural language understanding compact AI models

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

ShiZhengyan/InstructionModelling

[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"

raymin0223/fast_robust_early_exit

Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized...

SALT-NLP/Adaptive-Compositional-Modules

Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"

joisino/zeh

Code for "Even GPT-5.2 Can’t Count to Five: The Case for Zero-Error Horizons in Trustworthy LLMs"

yhy1117/X-Mixup

Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".

Explore Transformer Models

All categories Trending Transformer directory Insights