HomoScriptor-Project/HomoScriptor
Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.
This project provides a collection of human-written text examples, organized into categories like language tasks (rhyming, poetry, summarization) and logic tasks (puzzles, riddles). It's designed to be used by researchers and data scientists who are developing or improving large language models. You input this dataset into your model's training process, and it helps the model learn to generate more human-like and nuanced text outputs.
No commits in the last 6 months.
Use this if you are a language model researcher or data scientist looking for diverse, human-written examples to fine-tune your large language models and improve their performance on various tasks.
Not ideal if you are looking for a tool to directly apply language models or if you need highly specialized domain-specific data outside of general language and logic tasks.
Stars
18
Forks
3
Language
—
License
Apache-2.0
Category
Last pushed
Oct 14, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/HomoScriptor-Project/HomoScriptor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
google/paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for...
JosefAlbers/PVM
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon
iamarunbrahma/finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
h2oai/h2o-wizardlm
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning