The Apache Software Foundation
The Apache Software Foundation Incubator
    

Apache StormCrawler (Incubating)

StormCrawler is a collection of resources for building low-latency, customisable and scalable web crawlers on Apache Storm.

  • Resources: stormcrawler

  • Started: 2024-03-19; Last Status Update: 2024-03-29

  • Reporting: Monthly

  • Committers: 10

  • All Committers are PPMC members

  • Mentors: Dave Fisher (wave), Lewis John McGibbney (lewismc), Ayush Saxena (ayushsaxena), PJ Fanning (fanningpj)

News

  • 2024-03-19 Project enters incubation.

Resources

Repositories

1: incubator-stormcrawler

| Gitbox | Github | A scalable, mature and versatile web crawler based on Apache …​ — Updated: 11/29/2024

2: incubator-stormcrawler-site

| Gitbox | Github | Source for the Apache StormCrawler (Incubating) web site — Updated: 11/15/2024

Releases

Current

It is essential that you verify the integrity of release downloads. See instructions here

1: apache-stormcrawler-3.1.0-incubating-source-release.tar.gz

| Download | Signature | Hash | apache-stormcrawler-3.1.0-incubating-source-release.tar.gz — Filesize: 2.47 MB — Released: 09/13/2024 by rzo1 in r71543

Total size of all downloads = 2.47 MB

Errata

Please investigate the following potential issues

The podling website scan does the best it can. Details are found here