portia

Visual scraping for Scrapy

  • 所有者: scrapinghub/portia
  • 平台:
  • 许可证: BSD 3-Clause "New" or "Revised" License
  • 分类:
  • 主题:
  • 喜欢:
    0
      比较:

Github星跟踪图

Portia

Portia is a tool that allows you to visually scrape websites without any programming knowledge required. With Portia you can annotate a web page to identify the data you wish to extract, and Portia will understand based on these annotations how to scrape data from similar pages.

Running Portia

The easiest way to run Portia is using Docker:

You can run Portia using Docker & official Portia-image by running:

docker run -v ~/portia_projects:/app/data/projects:rw -p 9001:9001 scrapinghub/portia

You can also set up a local instance with Docker-compose by cloning this repo & running from the root of the folder:

docker-compose up

For more detailed instructions, and alternatives to using Docker, see the Installation docs.

Documentation

Documentation can be found from Read the docs. Source files can be found in the docs directory.

主要指标

概览
名称与所有者scrapinghub/portia
主编程语言Python
编程语言Shell (语言数: 8)
平台
许可证BSD 3-Clause "New" or "Revised" License
所有者活动
创建于2014-03-21 14:24:31
推送于2024-06-26 19:43:46
最后一次提交2019-07-10 13:43:34
发布数40
最新版本名称slybot-0.13.3 (发布于 )
第一版名称slybot (发布于 )
用户参与
星数9.4k
关注者数497
派生数1.4k
提交数2.7k
已启用问题?
问题数451
打开的问题数111
拉请求数408
打开的拉请求数19
关闭的拉请求数55
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?