uber_data

Uber web interface crawler / scraper - Convert the trips table into a CSV file

Github stars Tracking Chart

DISCONTINUED -- PROJECT NOT MAINTAINED

Uber Crawler / Usage Analytics

@version alpha-0.2.2, Branch, Build Status, Coverage, ------, ------------, --------, master, [![Build Status](https://travis-ci.org/mena-devs/slack_data_collector.svg?branch=master)](https://travis-ci.org/mena-devs/slack_data_collector), [![Coverage Status](https://coveralls.io/repos/github/Link-/uber_data/badge.svg?branch=master)](https://coveralls.io/github/Link-/uber_data?branch=master), alpha-0.2.3, [![Build Status](https://travis-ci.org/Link-/uber_data.svg?branch=alpha-0.2.3)](https://travis-ci.org/Link-/uber_data), [![Coverage Status](https://coveralls.io/repos/github/Link-/uber_data/badge.svg?branch=alpha-0.2.3)](https://coveralls.io/github/Link-/uber_data?branch=alpha-0.2.3), ### Synopsis

Uber web interface crawler - Convert the trips table into a CSV file

Installation & Configuration

Minimum Requirements

- PHP (5.6+)
- XDebug is a requirement for running the unit tests

Installation is very basic, just:

  1. Clone this repository into any directory:

    git clone https://github.com/Link-/uber_data.git
    
  2. Install dependencies and build the autoload file:

    composer install
    
  3. Build your App.php configuration file:

Using CLI

This repository ships with a handy command-line interface companion named uberc - located at ./bin/uberc

  1. Add ./bin to your path with

    export PATH="$PATH:<project path>/bin"
    
  2. Configure (this has to be done only once)

    uberc config
    
  3. Analyze: Will generate the analytics files in the desired directories specified at the config step

    uberc analyze
    

Sample Output

2016-06-03,Logan,$7.73,uberX,Los Angeles,N.A
2016-06-03,John,$14.45,uberX,Los Angeles,N.A
2016-06-02,Mark,$4.70,uberX,Los Angeles,N.A
2016-06-02,Logan,Canceled,uberX,Los Angeles,N.A
2016-06-02,Morgan,$13.23,uberX,Los Angeles,N.A
2016-06-01,Sleimann,$4.79,uberX,Los Angeles,N.A
2016-06-01,George,$14.36,uberX,Los Angeles,N.A

Jupyter Notebook

Installation & Configuration

Minimum Requirements

python3 (3.4.3)
pip3 (1.5.4)
jupyter (4.1.0)
pandas (0.18.1)
matplotlib (1.5.1)

Review the installation requirements / steps per depedency by following the reference links provided below.

  1. Install python3, you will need a C compiler and the Python headers and finally pip3:

    sudo apt-get install python3 build-essential python3-dev python3-setuptools python3-pip
    
  2. Verify that python3 and pip3 have been downloaded / installed:

    pip3 -V
    pip 1.5.4 from /usr/lib/python3/dist-packages (python 3.4)
    
    python3 -V
    Python 3.4.3
    
  3. Install Jupyter

    sudo pip3 install jupyter
    
  4. Install pandas -- usually numpy gets bundled with pandas but just in case, install it separately (link to the installation guide below)

    sudo pip3 install pandas
    
  5. Install matplotlib

    sudo apt-get install python3-matplotlib
    # Upgrade to v.1.5.1
    

Installation Guides

Execution

  1. Run jupyter notebook:

    jupyter notebook
    
  2. Open the Uber-Data_Analysis-0.1.ipynb found in uber_data/analysis/

  3. In the 3rd row, change the value of file_location as per the below:

    # FROM
    file_location = r'<path to uber_data>/_sample_data/sample_data.csv'
    
    # TO
    file_location = r'<path to uber data>/data/<the file created by the crawler>.csv'
    
  4. Press Cell then Run All from the menubar

  5. Voila, you should game the output as shown in the Sample Analysis Output

Sample Analysis Output

Uber Data Anlysis v0.1 Notebook: Uber-Data_Analysis-0.1.ipynb

image

image

image

image

Main metrics

Overview
Name With OwnerLink-/uber_data
Primary LanguageHTML
Program languagePHP (Language Count: 3)
Platform
License:GNU General Public License v3.0
所有者活动
Created At2016-06-02 18:49:42
Pushed At2018-10-28 19:43:35
Last Commit At2018-10-28 21:43:34
Release Count3
Last Release Namev0.2.2-alpha (Posted on 2016-06-22 23:01:34)
First Release Namev0.2-alpha (Posted on 2016-06-12 16:36:36)
用户参与
Stargazers Count41
Watchers Count4
Fork Count11
Commits Count102
Has Issues Enabled
Issues Count9
Issue Open Count6
Pull Requests Count2
Pull Requests Open Count0
Pull Requests Close Count2
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private