Riddhish1/CogniScrape

Intelligent Web Scraping Library with LLMs

36
/ 100
Emerging

This is a TypeScript library for developers who need to extract specific, structured data from websites and search results using large language models. It takes URLs, search queries, or even multiple web pages as input, applies intelligent parsing and AI reasoning, and outputs clean, validated data in formats like JSON or CSV. Developers can use this to build robust data collection tools for various business needs.

Available on npm.

Use this if you are a developer building applications that require precise, automated data extraction from dynamic websites or need to combine web scraping with AI analysis for tasks like market research, content aggregation, or lead generation.

Not ideal if you need a no-code solution for simple data extraction or are looking for a standalone application rather than a library to integrate into your codebase.

data-extraction web-automation content-aggregation market-intelligence lead-generation
Maintenance 6 / 25
Adoption 8 / 25
Maturity 22 / 25
Community 0 / 25

How are scores calculated?

Stars

62

Forks

Language

TypeScript

License

MIT

Category

llm-web-scraping

Last pushed

Jan 05, 2026

Commits (30d)

0

Dependencies

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Riddhish1/CogniScrape"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.