HomoScriptor-Project/HomoScriptor

Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.

/ 100

Emerging

This project provides a collection of human-written text examples, organized into categories like language tasks (rhyming, poetry, summarization) and logic tasks (puzzles, riddles). It's designed to be used by researchers and data scientists who are developing or improving large language models. You input this dataset into your model's training process, and it helps the model learn to generate more human-like and nuanced text outputs.

No commits in the last 6 months.

Use this if you are a language model researcher or data scientist looking for diverse, human-written examples to fine-tune your large language models and improve their performance on various tasks.

Not ideal if you are looking for a tool to directly apply language models or if you need highly specialized domain-specific data outside of general language and logic tasks.

AI research language model training natural language processing machine learning datasets data science for AI

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

—

License

Apache-2.0

Higher-rated alternatives

axolotl-ai-cloud/axolotl

Go ahead and axolotl questions

google/paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for...

JosefAlbers/PVM

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

iamarunbrahma/finetuned-qlora-falcon7b-medical

Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset

h2oai/h2o-wizardlm

Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning

Explore LLM Tools

All categories Trending LLM Tool directory Insights