apify/rag-web-browser

RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.

/ 100

Emerging

This tool helps AI application developers provide up-to-date web content to their Large Language Models (LLMs) and RAG pipelines. It takes a search query or a specific URL as input, browses the web using Google Search or direct URL access, and outputs cleaned text or Markdown from the relevant web pages. Developers building AI assistants, chatbots, or RAG-enabled applications will find this useful for feeding their models with current information.

Use this if your AI application or LLM needs real-time, accurate information from the internet to enhance its responses or knowledge base.

Not ideal if your application requires only static, pre-defined datasets or doesn't need to interact with the live web.

AI development LLM applications web data integration chatbot development RAG pipelines

No Package No Dependents

Maintenance 6 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

TypeScript

License

Apache-2.0

Higher-rated alternatives

any4ai/AnyCrawl

AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts...

kreuzberg-dev/html-to-markdown

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the...

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping,...

paulpierre/markdown-crawler

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file...

Explore RAG Tools

All categories Trending RAG directory Insights