Riddhish1/CogniScrape

Intelligent Web Scraping Library with LLMs

/ 100

Emerging

This is a TypeScript library for developers who need to extract specific, structured data from websites and search results using large language models. It takes URLs, search queries, or even multiple web pages as input, applies intelligent parsing and AI reasoning, and outputs clean, validated data in formats like JSON or CSV. Developers can use this to build robust data collection tools for various business needs.

Available on npm.

Use this if you are a developer building applications that require precise, automated data extraction from dynamic websites or need to combine web scraping with AI analysis for tasks like market research, content aggregation, or lead generation.

Not ideal if you need a no-code solution for simple data extraction or are looking for a standalone application rather than a library to integrate into your codebase.

data-extraction web-automation content-aggregation market-intelligence lead-generation

Maintenance 6 / 25

Adoption 8 / 25

Maturity 22 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

TypeScript

License

MIT

Higher-rated alternatives

carlosplanchon/spidercreator

Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of...

raznem/parsera

Lightweight library for scraping web-sites with LLMs

rednafi/html-to-text

Extract pure text from any webpage

supadata-ai/js

Official TypeScript/JavaScript SDK for the Supadata API.

yeahhe365/JustSearch

基于 Playwright 的自主 AI 搜索智能体。支持迭代式任务规划、深度网页爬取，以及带引用来源的多源知识整合。

Explore LLM Tools

All categories Trending LLM Tool directory Insights