portia

Visual scraping for Scrapy

  • 所有者: scrapinghub/portia
  • 平台:
  • 許可證: BSD 3-Clause "New" or "Revised" License
  • 分類:
  • 主題:
  • 喜歡:
    0
      比較:

Github星跟蹤圖

Portia

Portia is a tool that allows you to visually scrape websites without any programming knowledge required. With Portia you can annotate a web page to identify the data you wish to extract, and Portia will understand based on these annotations how to scrape data from similar pages.

Running Portia

The easiest way to run Portia is using Docker:

You can run Portia using Docker & official Portia-image by running:

docker run -v ~/portia_projects:/app/data/projects:rw -p 9001:9001 scrapinghub/portia

You can also set up a local instance with Docker-compose by cloning this repo & running from the root of the folder:

docker-compose up

For more detailed instructions, and alternatives to using Docker, see the Installation docs.

Documentation

Documentation can be found from Read the docs. Source files can be found in the docs directory.

主要指標

概覽
名稱與所有者scrapinghub/portia
主編程語言Python
編程語言Shell (語言數: 8)
平台
許可證BSD 3-Clause "New" or "Revised" License
所有者活动
創建於2014-03-21 14:24:31
推送於2024-06-26 19:43:46
最后一次提交2019-07-10 13:43:34
發布數40
最新版本名稱slybot-0.13.3 (發布於 )
第一版名稱slybot (發布於 )
用户参与
星數9.4k
關注者數497
派生數1.4k
提交數2.7k
已啟用問題?
問題數451
打開的問題數111
拉請求數408
打開的拉請求數19
關閉的拉請求數55
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?