The Apache Software Foundation
The Apache Software Foundation Incubator
    

Apache StormCrawler (Incubating)

StormCrawler is a collection of resources for building low-latency, customisable and scalable web crawlers on Apache Storm.

  • Resources: stormcrawler

  • Started: 2024-03-19; Last Status Update: 2024-03-29

  • Reporting: Monthly

  • Committers: 9

  • All Committers are PPMC members

  • Mentors: Dave Fisher (wave), Lewis John McGibbney (lewismc), Ayush Saxena (ayushsaxena), PJ Fanning (fanningpj)

News

  • 2024-03-19 Project enters incubation.

Resources

Repositories

1: incubator-stormcrawler

| Gitbox | Github | A scalable, mature and versatile web crawler based on Apache …​ — Updated: 05/21/2024

2: incubator-stormcrawler-site

| Gitbox | Github | Source for the Apache StormCrawler (Incubating) web site — Updated: 05/21/2024

Releases

Current

It is essential that you verify the integrity of release downloads. See instructions here

1: apache-stormcrawler-incubating-3.0-source-release.zip

| Download | Signature | Hash | apache-stormcrawler-incubating-3.0-source-release.zip — Filesize: 2.75 MB — Released: 05/07/2024 by rzo1 in r68995

Total size of all downloads = 2.75 MB

Errata

Please investigate the following potential issues

The podling website scan does the best it can. Details are found here