OpenBioLink/ThoughtSource
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
This resource provides structured datasets of questions, answers, and detailed reasoning steps for various domains. It takes in raw question-answering datasets and outputs them with enhanced 'chain-of-thought' reasoning, ideal for training and evaluating large language models. This is for AI researchers and developers who are building or improving AI systems that need to explain their decision-making process.
1,015 stars. No commits in the last 6 months.
Use this if you are developing AI models and need high-quality, pre-processed datasets with explicit reasoning steps to enhance your model's ability to think step-by-step.
Not ideal if you are a casual user looking for a ready-to-use AI application, as this is a foundational resource for AI development, not an end-user tool.
Stars
1,015
Forks
82
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/OpenBioLink/ThoughtSource"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis
Papers, code and datasets about deep learning and multi-modal learning for video analysis
KaiyangZhou/pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch