hustle

A column oriented, embarrassingly distributed relational event database.

  • Owner: tspurway/hustle
  • Platform:
  • License:: Other
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

Hustle

A column oriented, embarrassingly distributed, relational event database.

Features

  • column oriented - super fast queries
  • events - write only semantics
  • distributed insert - designed for petabyte scale distributed datasets with massive write loads
  • compressed - bitmap indexes, lz4, and prefix trie compression
  • relational - join gigantic data sets
  • partitioned - smart shards
  • embarrassingly distributed (based on Disco)
  • embarrassingly fast (uses LMDB)
  • NoSQL - Python DSL
  • bulk append only semantics
  • highly available, horizontally scalable
  • REPL/CLI query interface

Example Query

select(impressions.ad_id, impressions.date, h_sum(pix.amount), h_count(),
       where=((impressions.date < '2014-01-13') & (impressions.ad_id == 30010),
               pix.date < '2014-01-13'),
       join=(impressions.site_id, pix.site_id),
       order_by=impressions.date)

Installation

After cloning this repo, here are some considerations:

  • you will need Python 2.7 or higher - note that it probably won't work on 2.6 (has to do with pickling lambdas...)
  • you need to install Disco 0.5 and its dependencies - get that working first
  • you need to install Hustle and its 'deps' thusly:
cd hustle
sudo ./bootstrap.sh

Please refer to the Installation Guide for more details

Documentation

Hustle User Guide

Hustle Mailing List

Credits

Special thanks to following open-source projects:

Build Status

Main metrics

Overview
Name With Ownertspurway/hustle
Primary LanguagePython
Program languagePython (Language Count: 2)
Platform
License:Other
所有者活动
Created At2014-02-19 02:13:45
Pushed At2018-04-14 02:03:05
Last Commit At2018-04-13 22:03:04
Release Count12
Last Release Name0.2.7 (Posted on )
First Release Name0.1 (Posted on )
用户参与
Stargazers Count239
Watchers Count29
Fork Count32
Commits Count324
Has Issues Enabled
Issues Count44
Issue Open Count12
Pull Requests Count18
Pull Requests Open Count0
Pull Requests Close Count2
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private