angeluriot/French_instruct
A dataset of instructions and answers in natural language for machine learning.
This dataset provides a large collection of French-language instructions and corresponding answers, including multi-turn conversations. It's designed to help train conversational AI systems to understand and respond in natural French. The dataset takes diverse French textual prompts as input and provides suitable French responses, serving machine learning engineers building French-speaking chatbots or virtual assistants.
No commits in the last 6 months.
Use this if you are developing or fine-tuning a large language model specifically for French natural language processing and need a high-quality dataset of conversational exchanges.
Not ideal if your project focuses on languages other than French, or if you require very specific domain-focused conversations outside of general knowledge or coding assistance.
Stars
26
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jan 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/angeluriot/French_instruct"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EQTPartners/PTEC
Code repository corresponding to the paper "Prompt Tuned Embedding Classification for...
ImadSaddik/BoDmaghDataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
andrewzamai/SLIMER
Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER