heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Main metrics

Overview

Name With Ownerinternetarchive/heritrix3
Primary LanguageJava
Program languageJava (Language Count: 11)
Platform
License:Other
Release Count26
Last Release Name3.12.0 (Posted on 2025-10-30 09:25:13)
First Release Name3.0.0 (Posted on 2009-12-05 17:28:19)
Created At2011-10-22 06:00:17
Pushed At2025-11-02 07:26:31
Last Commit At2025-10-30 09:48:07
Stargazers Count3084
Watchers Count185
Fork Count774
Commits Count2853
Has Issues Enabled
Issues Count166
Issue Open Count31
Pull Requests Count410
Pull Requests Open Count5
Pull Requests Close Count53
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private
To the top