aws/aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

70
/ 100
Verified

This project helps data professionals working with AWS to easily move data between their Python Pandas dataframes and various AWS services. You can read data from services like S3, Athena, Redshift, and DynamoDB into a Pandas DataFrame, and then write your processed DataFrame back to these services. This simplifies data preparation, analysis, and ETL (Extract, Transform, Load) workflows for data scientists, data engineers, and analysts who use Python.

4,106 stars. Actively maintained with 16 commits in the last 30 days.

Use this if you need to seamlessly integrate your Python Pandas data processing with a wide range of AWS data storage and analytics services, treating AWS as a natural extension of your DataFrame operations.

Not ideal if your data workflows do not involve AWS services or if you prefer to manage data interactions directly through native AWS SDKs without a Pandas-centric approach.

data-engineering data-analysis cloud-data-warehousing ETL AWS-data-services
No Package No Dependents
Maintenance 20 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

4,106

Forks

723

Language

Python

License

Apache-2.0

Last pushed

Mar 18, 2026

Commits (30d)

16

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/aws/aws-sdk-pandas"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.