arthur3486/born2crawl

A highly performant and versatile crawling engine, designed with scalability and extensibility in mind.

20
/ 100
Experimental

This is a versatile toolkit that helps developers build custom data collection systems. It takes an initial data source, like a website, file system, or database, processes it using custom logic, and then stores the extracted information. Developers would use this to create applications that need to automatically gather and organize data from various complex or dynamic sources.

No commits in the last 6 months.

Use this if you are a software developer needing to build a custom, scalable, and high-performance system for extracting information from diverse and potentially large data sources.

Not ideal if you're a non-developer looking for an out-of-the-box solution for simple data extraction without writing code.

data-acquisition web-scraping database-indexing information-extraction content-aggregation
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Kotlin

License

Apache-2.0

Category

scraper

Last pushed

Feb 11, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/arthur3486/born2crawl"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.