kvignesh1420/cot-icl-lab

[ACL 2025] Official implementation of the "CoT-ICL Lab" framework

/ 100

Emerging

This framework helps AI researchers and machine learning engineers study how large language models learn complex reasoning steps from examples. It generates synthetic, tokenized datasets based on customizable graph structures (DAGs) and token configurations. The output is a dataset ready for training and evaluating transformer models, complete with input IDs, attention masks, and chain-of-thought elements. This allows researchers to rigorously test hypotheses about in-context learning.

No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer who needs to generate controlled synthetic datasets to understand how transformer models perform chain-of-thought reasoning from in-context examples.

Not ideal if you need to work with real-world, non-synthetic datasets or are looking for a general-purpose fine-tuning framework for pre-trained language models.

AI-research NLP-benchmarking synthetic-data-generation transformer-evaluation in-context-learning

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...

arcee-ai/mergekit

Tools for merging pretrained large language models.

changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

Explore Transformer Models

All categories Trending Transformer directory Insights