uber_data

Uber web interface crawler / scraper - Convert the trips table into a CSV file

Github星跟蹤圖

DISCONTINUED -- PROJECT NOT MAINTAINED

Uber Crawler / Usage Analytics

@version alpha-0.2.2, Branch, Build Status, Coverage, ------, ------------, --------, master, [![Build Status](https://travis-ci.org/mena-devs/slack_data_collector.svg?branch=master)](https://travis-ci.org/mena-devs/slack_data_collector), [![Coverage Status](https://coveralls.io/repos/github/Link-/uber_data/badge.svg?branch=master)](https://coveralls.io/github/Link-/uber_data?branch=master), alpha-0.2.3, [![Build Status](https://travis-ci.org/Link-/uber_data.svg?branch=alpha-0.2.3)](https://travis-ci.org/Link-/uber_data), [![Coverage Status](https://coveralls.io/repos/github/Link-/uber_data/badge.svg?branch=alpha-0.2.3)](https://coveralls.io/github/Link-/uber_data?branch=alpha-0.2.3), ### Synopsis

Uber web interface crawler - Convert the trips table into a CSV file

Installation & Configuration

Minimum Requirements

- PHP (5.6+)
- XDebug is a requirement for running the unit tests

Installation is very basic, just:

  1. Clone this repository into any directory:

    git clone https://github.com/Link-/uber_data.git
    
  2. Install dependencies and build the autoload file:

    composer install
    
  3. Build your App.php configuration file:

Using CLI

This repository ships with a handy command-line interface companion named uberc - located at ./bin/uberc

  1. Add ./bin to your path with

    export PATH="$PATH:<project path>/bin"
    
  2. Configure (this has to be done only once)

    uberc config
    
  3. Analyze: Will generate the analytics files in the desired directories specified at the config step

    uberc analyze
    

Sample Output

2016-06-03,Logan,$7.73,uberX,Los Angeles,N.A
2016-06-03,John,$14.45,uberX,Los Angeles,N.A
2016-06-02,Mark,$4.70,uberX,Los Angeles,N.A
2016-06-02,Logan,Canceled,uberX,Los Angeles,N.A
2016-06-02,Morgan,$13.23,uberX,Los Angeles,N.A
2016-06-01,Sleimann,$4.79,uberX,Los Angeles,N.A
2016-06-01,George,$14.36,uberX,Los Angeles,N.A

Jupyter Notebook

Installation & Configuration

Minimum Requirements

python3 (3.4.3)
pip3 (1.5.4)
jupyter (4.1.0)
pandas (0.18.1)
matplotlib (1.5.1)

Review the installation requirements / steps per depedency by following the reference links provided below.

  1. Install python3, you will need a C compiler and the Python headers and finally pip3:

    sudo apt-get install python3 build-essential python3-dev python3-setuptools python3-pip
    
  2. Verify that python3 and pip3 have been downloaded / installed:

    pip3 -V
    pip 1.5.4 from /usr/lib/python3/dist-packages (python 3.4)
    
    python3 -V
    Python 3.4.3
    
  3. Install Jupyter

    sudo pip3 install jupyter
    
  4. Install pandas -- usually numpy gets bundled with pandas but just in case, install it separately (link to the installation guide below)

    sudo pip3 install pandas
    
  5. Install matplotlib

    sudo apt-get install python3-matplotlib
    # Upgrade to v.1.5.1
    

Installation Guides

Execution

  1. Run jupyter notebook:

    jupyter notebook
    
  2. Open the Uber-Data_Analysis-0.1.ipynb found in uber_data/analysis/

  3. In the 3rd row, change the value of file_location as per the below:

    # FROM
    file_location = r'<path to uber_data>/_sample_data/sample_data.csv'
    
    # TO
    file_location = r'<path to uber data>/data/<the file created by the crawler>.csv'
    
  4. Press Cell then Run All from the menubar

  5. Voila, you should game the output as shown in the Sample Analysis Output

Sample Analysis Output

Uber Data Anlysis v0.1 Notebook: Uber-Data_Analysis-0.1.ipynb

image

image

image

image

主要指標

概覽
名稱與所有者Link-/uber_data
主編程語言HTML
編程語言PHP (語言數: 3)
平台
許可證GNU General Public License v3.0
所有者活动
創建於2016-06-02 18:49:42
推送於2018-10-28 19:43:35
最后一次提交2018-10-28 21:43:34
發布數3
最新版本名稱v0.2.2-alpha (發布於 2016-06-22 23:01:34)
第一版名稱v0.2-alpha (發布於 2016-06-12 16:36:36)
用户参与
星數41
關注者數4
派生數11
提交數102
已啟用問題?
問題數9
打開的問題數6
拉請求數2
打開的拉請求數0
關閉的拉請求數2
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?