Dryad
This is a research prototype of the Dryad and DryadLINQ data-parallel
processing frameworks running on Hadoop YARN. Dryad utilizes cluster
services provided as part of Hadoop YARN to reliably execute
distributed computations on a cluster of computers. DryadLINQ provides
the LINQ programming model for distributed data processing and leverages
Dryad for reliable execution.
If you just want to run a sample DryadLINQ program, the simplest way to get started
is to follow the instructions at
http://microsoftresearch.github.io/Dryad/?topic=html/e992fd94-c956-481d-82e6-dbdf45daa722.htm
Requirements
Visual Studio 2013
An Azure HDInsight 3.1 cluster or a Windows YARN cluster composed of x64 machines
Building Dryad
- Clone the Dryad git repository.
- Use Visual Studio to open The Dryad solution file (Dryad.sln) located
in the root of the repository and build the solution. The dependencies
will be fetched from NuGet.