aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
This project helps data professionals working with AWS to easily move data between their Python Pandas dataframes and various AWS services. You can read data from services like S3, Athena, Redshift, and DynamoDB into a Pandas DataFrame, and then write your processed DataFrame back to these services. This simplifies data preparation, analysis, and ETL (Extract, Transform, Load) workflows for data scientists, data engineers, and analysts who use Python.
4,106 stars. Actively maintained with 16 commits in the last 30 days.
Use this if you need to seamlessly integrate your Python Pandas data processing with a wide range of AWS data storage and analytics services, treating AWS as a natural extension of your DataFrame operations.
Not ideal if your data workflows do not involve AWS services or if you prefer to manage data interactions directly through native AWS SDKs without a Pandas-centric approach.
Stars
4,106
Forks
723
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
16
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/aws/aws-sdk-pandas"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.