awslabs/project-lakechain

:zap: Cloud-native, AI-powered, document processing pipelines on AWS.

53
/ 100
Established

This project helps you build automated systems to process large volumes of documents, audio, and video files. You provide the raw media, and it uses AI to perform tasks like summarizing videos, transcribing audio, detecting faces in images, or extracting information from emails. This is for cloud architects, data engineers, or machine learning engineers who need to create scalable, automated media and document analysis workflows.

186 stars.

Use this if you need to build a custom, AI-powered pipeline to automatically process documents, images, audio, or video files at scale on AWS.

Not ideal if you're looking for a simple, off-the-shelf application to process a few documents manually, or if you're not working within the AWS cloud environment.

document-processing media-analysis information-extraction ai-automation cloud-architecture
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

186

Forks

27

Language

TypeScript

License

Apache-2.0

Last pushed

Jan 22, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/mlops/awslabs/project-lakechain"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.