hyeonsangjeon/PDF2LLM-Tuning-Studio

PDF 문서에서 GPU 가속 처리로 고품질 질의응답(QA) 데이터를 자동 생성하고 LLM을 효율적으로 파인튜닝하는 솔루션입니다. Unstructured 라이브러리와 AWS Bedrock Claude로 도메인 특화 QA 쌍을 생성하고, LoRA 기법으로 경량 모델을 훈련합니다.

34
/ 100
Emerging

This tool helps subject matter experts and businesses extract specific knowledge from their PDF documents and train a custom AI chatbot. It takes your PDF files as input and automatically generates high-quality question-and-answer pairs, then uses these to fine-tune a large language model. The output is a specialized AI model that can answer questions accurately based on your proprietary documents.

Use this if you need to build a domain-specific AI chatbot or knowledge retrieval system from a large collection of internal PDF documents, such as legal contracts, research papers, or financial reports.

Not ideal if you only need general information extraction or summarization, or if you don't have access to GPU hardware for processing and model training.

knowledge-management document-intelligence information-retrieval custom-chatbot enterprise-ai
No License No Package No Dependents
Maintenance 10 / 25
Adoption 4 / 25
Maturity 7 / 25
Community 13 / 25

How are scores calculated?

Stars

7

Forks

2

Language

Jupyter Notebook

License

Category

pdf-qa-systems

Last pushed

Jan 22, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/hyeonsangjeon/PDF2LLM-Tuning-Studio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.