The Apache Software Foundation
The Apache Software Foundation Incubator
    

Apache StormCrawler (Incubating)

StormCrawler is a collection of resources for building low-latency, customisable and scalable web crawlers on Apache Storm.

  • Resources: stormcrawler

  • Started: 2024-03-19; Last Status Update: 2024-03-29

  • Reporting: Monthly

  • Committers: 9

  • All Committers are PPMC members

  • Mentors: Dave Fisher (wave), Lewis John McGibbney (lewismc), Ayush Saxena (ayushsaxena), PJ Fanning (fanningpj)

News

  • 2024-03-19 Project enters incubation.

Resources

Repositories

1: incubator-stormcrawler

| Gitbox | Github | A scalable, mature and versatile web crawler based on Apache …​ — Updated: 05/01/2024

2: incubator-stormcrawler-site

| Gitbox | Github | Source for the Apache StormCrawler (Incubating) web site — Updated: 04/30/2024

Releases

Current

It is essential that you verify the integrity of release downloads. See instructions here StormCrawler has Signing Keys with either no or invalid releases

Errata

Please investigate the following potential issues

The podling website scan does the best it can. Details are found here