SignitDoc/semantic-file-retrieval

A semantic file retrieval application based on LLM(一个轻量级基于大模型解析的多模态文件语义检索工具,不同于传统基于文件名或metadata检索的方式,该工具可实现基于文件内容的语义检索,支持各类主流格式文档、图片、音频、视频。)

28
/ 100
Experimental

This tool helps you find information within your files by understanding what's inside them, rather than just relying on file names. You input various documents, images, or even multimedia files, and it allows you to search their actual content using natural language queries. Anyone who needs to quickly locate specific information across a large collection of diverse files would find this useful.

No commits in the last 6 months.

Use this if you need to search the actual content of your documents, images, and other media using descriptive phrases, rather than just keywords in file names.

Not ideal if you only need to search files by their names or standard metadata, or if you require advanced features like batch uploads or support for scanned PDFs and Office documents (which are planned for future updates).

document-management information-retrieval knowledge-management multimedia-search content-discovery
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

10

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Jan 15, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/SignitDoc/semantic-file-retrieval"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.