nutella-scrape

:chocolate_bar: learn to scrape the web with Node.js -- it tastes like chocolate

  • Owner: okdistribute/nutella-scrape
  • Platform:
  • License::
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

nutella-scrape

NPM

nutella

  1. Run sudo npm install nutella-scrape -g
  2. Run nutella-scrape
  3. ???
  4. LEARN!!

In this tutorial, we will work through how to scrape websites using Node.js for the primary purpose of using it in other programs -- in servers, frontends (yes, Node works in the browser!), or just writing a table to disk for analysis elsewhere.

The DOM (Document Object Model) is an abstract concept describing how we can interact with HTML. JavaScript is GREAT for traversing HTML (i.e., the DOM) because it was made to work with HTML in the first place.

TODO

  • parallel
  • spoofing
  • cookies/login walls
  • electron-microscope

Main metrics

Overview
Name With Ownerokdistribute/nutella-scrape
Primary LanguageJavaScript
Program languageJavaScript (Language Count: 1)
Platform
License:
所有者活动
Created At2015-08-14 07:03:54
Pushed At2016-05-19 11:45:03
Last Commit At2015-09-11 11:10:37
Release Count0
用户参与
Stargazers Count205
Watchers Count10
Fork Count8
Commits Count20
Has Issues Enabled
Issues Count2
Issue Open Count1
Pull Requests Count1
Pull Requests Open Count0
Pull Requests Close Count2
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private