crawler
Date | Project Name | 🎉 | ? | Tags |
---|---|---|---|---|
09/20 |
Browsertrix Crawler v1.8.0-beta.1
* [Browsertrix Crawler v1.8.0-beta.1](https://github.com/webrecorder/browsertrix-crawler) – Standalone browser-based high-fidelity crawling system running customizable crawls in a single Docker container.
Standalone browser-based high-fidelity crawling system running customizable crawls in a single Docker container.
|
4
|
TypeScript 876 ⭐1784 days old |
typescript crawler crawling wacz warc web-archiving |
09/19 |
Firecrawl v2.3.0
* [Firecrawl v2.3.0](https://github.com/firecrawl/firecrawl) – API service that crawls URLs and converts web content into clean markdown or structured data without requiring sitemaps.
API service that crawls URLs and converts web content into clean markdown or structured data without requiring sitemaps.
|
9
|
TypeScript 58620 ⭐523 days old |
typescript data ai markdown scraper crawler |
09/12 |
Browsertrix Crawler v1.7.1
* [Browsertrix Crawler v1.7.1](https://github.com/webrecorder/browsertrix-crawler) – Standalone browser-based high-fidelity crawling system running customizable crawls in a single Docker container.
Standalone browser-based high-fidelity crawling system running customizable crawls in a single Docker container.
|
6
|
TypeScript 876 ⭐1784 days old |
typescript crawler crawling wacz warc web-archiving |
09/12 |
Firecrawl v2.2.0
* [Firecrawl v2.2.0](https://github.com/firecrawl/firecrawl) – API service that crawls URLs and converts web content into clean markdown or structured data without requiring sitemaps.
API service that crawls URLs and converts web content into clean markdown or structured data without requiring sitemaps.
|
9
|
TypeScript 58620 ⭐523 days old |
typescript data ai markdown scraper crawler |