antch

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

Github stars Tracking Chart

Antch

Build Status
Coverage Status
Go Report Card
GoDoc

Antch, inspired by Scrapy. If you're familiar with scrapy,
you can quickly get started.

Antch is a fast, powerful and extensible web crawling & scraping framework for Go, used
to crawl websites and extract structured data from their pages.

Get Started

Getting Started

Follow the Getting Started instructions to start your first spider.

Features

  • Polite, highly concurrent web crawler.
  • Powerful and customizable HTTP middleware.
  • Item data pipeline for the web spider.
  • Built-in proxy support (HTTP, HTTPS, SOCKS5).
  • Built-in XPath query support for HTML/XML documents.
  • Easy to use and integrate with your project.

Examples

BingWallpaper - Bing daily wallpaper.

Documentation

See https://github.com/antchfx/antch/wiki

Main metrics

Overview
Name With Ownerantchfx/antch
Primary LanguageGo
Program languageGo (Language Count: 1)
Platform
License:MIT License
所有者活动
Created At2017-09-28 05:44:17
Pushed At2020-05-31 15:12:21
Last Commit At2020-05-31 23:12:08
Release Count0
用户参与
Stargazers Count262
Watchers Count15
Fork Count41
Commits Count40
Has Issues Enabled
Issues Count5
Issue Open Count4
Pull Requests Count6
Pull Requests Open Count0
Pull Requests Close Count1
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private