Riddhish1/CogniScrape
Intelligent Web Scraping Library with LLMs
This is a TypeScript library for developers who need to extract specific, structured data from websites and search results using large language models. It takes URLs, search queries, or even multiple web pages as input, applies intelligent parsing and AI reasoning, and outputs clean, validated data in formats like JSON or CSV. Developers can use this to build robust data collection tools for various business needs.
Available on npm.
Use this if you are a developer building applications that require precise, automated data extraction from dynamic websites or need to combine web scraping with AI analysis for tasks like market research, content aggregation, or lead generation.
Not ideal if you need a no-code solution for simple data extraction or are looking for a standalone application rather than a library to integrate into your codebase.
Stars
62
Forks
—
Language
TypeScript
License
MIT
Category
Last pushed
Jan 05, 2026
Commits (30d)
0
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Riddhish1/CogniScrape"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
carlosplanchon/spidercreator
Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of...
raznem/parsera
Lightweight library for scraping web-sites with LLMs
rednafi/html-to-text
Extract pure text from any webpage
supadata-ai/js
Official TypeScript/JavaScript SDK for the Supadata API.
yeahhe365/JustSearch
基于 Playwright 的自主 AI 搜索智能体。支持迭代式任务规划、深度网页爬取,以及带引用来源的多源知识整合。