john-hawkins/data-science-first
Repository of code examples for the book "Data Science First: Using Language Models in AI-Enabled Applications" -- Broken down into case studies using data from media, education, telecommunications and investment research.
This project provides practical code examples for data scientists and software engineers looking to integrate large language models (LLMs) into their existing data science workflows. It takes real-world datasets from industries like media, education, and banking as input, applies LLM-enhanced techniques for tasks such as customer churn analysis or fraud detection, and outputs improved data insights or AI-powered solutions. The resource is designed for practitioners who need to leverage modern AI tools effectively and reliably.
Use this if you are a data scientist or software engineer who wants to incorporate language models into your data science projects with robust, foundational methodologies, rather than relying on rapidly changing proprietary tools.
Not ideal if you are looking for a simple drag-and-drop solution without needing to understand underlying data science principles or write code.
Stars
20
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 16, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/john-hawkins/data-science-first"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
xuedong/machine-learning-summer-schools
Curated materials for different machine learning related summer schools
danielsobrado/ml-animations
Learn Machine learning by doing exercises and intuitive animations
RealYoshiWaton/real-world-ai-projects
🤖 Explore real-world AI projects focused on machine learning techniques, offering practical...
kira23j/AI-Engineering
Hands-on AI engineering
birhanukassahunabuye-hue/freecodecamp-python
A collection of practice challenges and curriculum projects completed during my Python...