yuanzhoulvpi2017/DocumentSearch
基于sentence transformers和chatglm实现的文档搜索工具
This tool helps you quickly find answers within a collection of your own documents, such as policy documents or reports. You input a folder containing PDF or DOCX files and then ask questions in natural language. The output is a direct answer to your question, along with the relevant snippets from your documents. This is ideal for researchers, policy analysts, or anyone who needs to extract specific information from a large set of local documents.
157 stars. No commits in the last 6 months.
Use this if you need to rapidly search and get answers from a local folder of PDF or DOCX documents without writing complex code.
Not ideal if you need to search '.doc' files, or if your documents are stored online and not in a local folder.
Stars
157
Forks
17
Language
Python
License
Apache-2.0
Category
Last pushed
Apr 17, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/yuanzhoulvpi2017/DocumentSearch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
IAmPara0x/Yuno
Yuno is context based search engine for anime.
canjiali/PARADE
code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper -...
gentaiscool/miners
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP...
D-Roberts/transformers-retrieval-ranking-nli-ECIR2021
Multilingual retrieval, ranking and natural language inference with transformers (mBERT);...
atapour/rank-over-class
Source code for the training pipeline of the text ranking model used in the paper entitled "Rank...