The Apache Software Foundation
The Apache Software Foundation Incubator
    

Apache StormCrawler (Incubating)

StormCrawler is a collection of resources for building low-latency, customisable and scalable web crawlers on Apache Storm.

  • Resources: stormcrawler

  • Started: 2024-03-19; Last Status Update: 2024-03-29

  • Reporting: January, April, July, October

  • Committers: 10

  • All Committers are PPMC members

  • Mentors: Dave Fisher (wave), Lewis John McGibbney (lewismc), Ayush Saxena (ayushsaxena), PJ Fanning (fanningpj)

News

  • 2024-03-19 Project enters incubation.

Resources

Repositories

1: incubator-stormcrawler

| Gitbox | Github | A scalable, mature and versatile web crawler based on Apache …​ — Updated: 02/21/2025

2: incubator-stormcrawler-site

| Gitbox | Github | Source for the Apache StormCrawler (Incubating) web site — Updated: 12/10/2024

Releases

Current

It is essential that you verify the integrity of release downloads. See instructions here

1: apache-stormcrawler-3.2.0-incubating-source-release.tar.gz

| Download | Signature | Hash | apache-stormcrawler-3.2.0-incubating-source-release.tar.gz — Filesize: 714.67 KB — Released: 12/03/2024 by tallison in r73465

2: apache-stormcrawler-3.2.0-incubating-source-release.zip

| Download | Signature | Hash | apache-stormcrawler-3.2.0-incubating-source-release.zip — Filesize: 1.04 MB — Released: 12/03/2024 by tallison in r73465

Total size of all downloads = 1.74 MB

Errata

Please investigate the following potential issues

The podling website scan does the best it can. Details are found here