john-hawkins/data-science-first

Repository of code examples for the book "Data Science First: Using Language Models in AI-Enabled Applications" -- Broken down into case studies using data from media, education, telecommunications and investment research.

24
/ 100
Experimental

This project provides practical code examples for data scientists and software engineers looking to integrate large language models (LLMs) into their existing data science workflows. It takes real-world datasets from industries like media, education, and banking as input, applies LLM-enhanced techniques for tasks such as customer churn analysis or fraud detection, and outputs improved data insights or AI-powered solutions. The resource is designed for practitioners who need to leverage modern AI tools effectively and reliably.

Use this if you are a data scientist or software engineer who wants to incorporate language models into your data science projects with robust, foundational methodologies, rather than relying on rapidly changing proprietary tools.

Not ideal if you are looking for a simple drag-and-drop solution without needing to understand underlying data science principles or write code.

data science AI applications natural language processing customer analytics fraud detection
No License No Package No Dependents
Maintenance 13 / 25
Adoption 6 / 25
Maturity 5 / 25
Community 0 / 25

How are scores calculated?

Stars

20

Forks

Language

Jupyter Notebook

License

Last pushed

Mar 16, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/john-hawkins/data-science-first"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.