ToxyBorg/llama_langchain_documents_embeddings

just testing langchain with llama cpp documents embeddings

/ 100

Experimental

This project helps developers and data scientists transform unstructured documents like PDFs into a format suitable for advanced search and question-answering systems. It takes a directory of documents, breaks them into smaller pieces, and then processes these pieces to create numerical representations (embeddings) and a searchable index. The output is a highly efficient system that can quickly find relevant document sections and answer specific questions based on the content.

No commits in the last 6 months.

Use this if you are a developer or data scientist building an application that needs to perform semantic search or question answering over a collection of private or specialized documents using local language models.

Not ideal if you are looking for an end-user application to search documents without needing to write code, or if you plan to use cloud-based language models and embedding services.

information-retrieval natural-language-processing semantic-search document-qa local-llm-deployment

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Unlicense

Higher-rated alternatives

Azure-Samples/azure-ai-document-processing-samples

A collection of samples demonstrating techniques for processing documents with Azure AI...

artitw/text2text

Text2Text Language Modeling Toolkit

aiplanethub/beyondllm

Build, evaluate and observe LLM apps

build-on-aws/langchain-embeddings

This repository demonstrates the construction of a state-of-the-art multimodal search engine,...

qianniuspace/llm_notebooks

AI 应用示例合集

Explore Embedding Tools

All categories Trending Embeddings directory Insights