Ganeshsivakumar/langchain-beam

Integrates LLMs as PTransform in Apache Beam pipelines using LangChain

35
/ 100
Emerging

This is a developer library that helps data engineers integrate Large Language Models (LLMs) into their data pipelines. It allows you to feed structured or unstructured data into an LLM, apply instructions (like classification or summarization), and get back processed data as part of your existing data workflows. It's for data engineers who build and manage data processing systems.

Use this if you are a data engineer building Apache Beam pipelines and need to add AI capabilities like text classification, summarization, or data generation directly within your data processing workflow.

Not ideal if you are an end-user looking for a no-code solution to apply LLMs, or if you are not working with Apache Beam data pipelines.

data-engineering data-pipelines LLM-integration batch-processing streaming-data
No Package No Dependents
Maintenance 6 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 6 / 25

How are scores calculated?

Stars

30

Forks

2

Language

Java

License

MIT

Last pushed

Oct 20, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/Ganeshsivakumar/langchain-beam"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.