hyeonsangjeon/PDF2LLM-Tuning-Studio

PDF 문서에서 GPU 가속 처리로 고품질 질의응답(QA) 데이터를 자동 생성하고 LLM을 효율적으로 파인튜닝하는 솔루션입니다. Unstructured 라이브러리와 AWS Bedrock Claude로 도메인 특화 QA 쌍을 생성하고, LoRA 기법으로 경량 모델을 훈련합니다.

/ 100

Emerging

This tool helps subject matter experts and businesses extract specific knowledge from their PDF documents and train a custom AI chatbot. It takes your PDF files as input and automatically generates high-quality question-and-answer pairs, then uses these to fine-tune a large language model. The output is a specialized AI model that can answer questions accurately based on your proprietary documents.

Use this if you need to build a domain-specific AI chatbot or knowledge retrieval system from a large collection of internal PDF documents, such as legal contracts, research papers, or financial reports.

Not ideal if you only need general information extraction or summarization, or if you don't have access to GPU hardware for processing and model training.

knowledge-management document-intelligence information-retrieval custom-chatbot enterprise-ai

No License No Package No Dependents

Maintenance 10 / 25

Adoption 4 / 25

Maturity 7 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

eellak/glossAPI

Greek Dataset Production from PDF+

pymupdf/langchain-pymupdf4llm

An integration package connecting PyMuPDF4LLM to LangChain

KalyanM45/DocGenius-Revolutionizing-PDFs-with-AI

This is a Python application that allows you to load a PDF and ask questions about it using...

mozilla-ai/structured-qa

Blueprint by Mozilla.ai for answering questions about structured documents

alejandro-ao/langchain-ask-pdf

An AI-app that allows you to upload a PDF and ask questions about it. It uses OpenAI's LLMs to...

Explore LLM Tools

All categories Trending LLM Tool directory Insights