heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Main metrics

Overview

Name With Ownerinternetarchive/heritrix3
Primary LanguageJava
Program languageJava (Language Count: 11)
Platform
License:Other
Release Count22
Last Release Name3.10.0 (Posted on 2025-06-12 22:01:13)
First Release Name3.0.0 (Posted on 2009-12-05 09:28:19)
Created At2011-10-21 22:00:17
Pushed At2025-06-12 15:44:46
Last Commit At2025-06-13 00:44:24
Stargazers Count2984
Watchers Count187
Fork Count760
Commits Count2763
Has Issues Enabled
Issues Count164
Issue Open Count32
Pull Requests Count390
Pull Requests Open Count4
Pull Requests Close Count50
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private
To the top