pyjanitor-devs/pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

76
/ 100
Verified

This tool helps data analysts and scientists transform messy raw datasets into clean, usable formats for analysis or modeling. It takes common tabular data, like spreadsheets or database exports, and processes it by renaming columns, handling missing values, or restructuring information to produce a tidied dataset. Anyone working with data that requires preparation before it can be used effectively will find this project beneficial.

1,484 stars. Actively maintained with 10 commits in the last 30 days. Available on PyPI.

Use this if you regularly spend significant time manually cleaning and preparing data using pandas, and want a more efficient, readable, and consistent way to perform common data cleaning tasks.

Not ideal if you primarily work with data that is already perfectly structured and clean, or if you prefer to build all your data manipulation logic from scratch without relying on extended libraries.

data-preparation data-analysis data-wrangling business-intelligence statistical-modeling
Maintenance 20 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 21 / 25

How are scores calculated?

Stars

1,484

Forks

182

Language

Python

License

MIT

Last pushed

Mar 15, 2026

Commits (30d)

10

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/pyjanitor-devs/pyjanitor"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.