orchest

Orchest is a tool for creating data science pipelines.

Github stars Tracking Chart

Website
Docs
Quickstart
Slack


Orchest is a web based data science tool that works on top of your filesystem allowing you to use
your editor of choice. With Orchest you get to focus on visually building and iterating on your
pipeline ideas. Under the hood Orchest runs a collection of containers to provide a scalable
platform that can run on your laptop as well as on a large scale cloud cluster.

Orchest lets you

  • Interactively build data science pipelines through its visual interface.
  • Automatically run your pipelines in parallel.
  • Develop your code in your favorite editor. Everything is filesystem based.
  • Tag the notebooks cells you want to skip when running a pipeline. Perfect for prototyping as you
    do not have to maintain a perfectly clean notebook.
  • Run experiments by parametrizing your pipeline. Easily try out all of your modeling ideas.

Table of contents

Installation

Requirements

  • Docker (tested on 19.03.9)

If you do not yet have Docker installed, please visit https://docs.docker.com/get-docker/.

Linux, macOS and Windows

Simply follow the steps below to install Orchest. For Windows, please read the note at the bottom first.

git clone https://github.com/orchest/orchest.git
cd orchest

# The start command will automatically install Orchest if it is not
# yet installed. After installation is finished Orchest is started
./orchest.sh start

Note! On Windows, Docker has to be configured to use WSL 2. Make sure to clone Orchest inside
the Linux environment. For more info about Docker with WSL 2, please visit
https://docs.docker.com/docker-for-windows/wsl/.

Preview

In our docs you can find a comprehensive
quickstart tutorial!

clip-3-cropped
A preview of running pipelines in the pipeline editor of Orchest

License

The software in this repository is licensed as follows:

  • All content residing under the "orchest-sdk/" directory of this repository is licensed under the
    "Apache-2.0" license as defined in "orchest-sdk/LICENSE".
  • Content outside of the above mentioned directory is available under the "AGPL-3.0" license.

Contributing

Contributions are more than welcome! Please see our
contributer guides
for more details.

We love your feedback

We would love to hear what you think and potentially add features based on your ideas. Come chat
with us on Slack.

Main metrics

Overview
Name With Ownerorchest/orchest
Primary LanguageTypeScript
Program languagePython (Language Count: 12)
Platform
License:Apache License 2.0
所有者活动
Created At2020-05-21 17:33:05
Pushed At2023-06-06 09:48:26
Last Commit At2023-06-06 11:48:26
Release Count204
Last Release Namev2023.04.2 (Posted on )
First Release Namev0.2.1-alpha (Posted on )
用户参与
Stargazers Count4.1k
Watchers Count43
Fork Count262
Commits Count9.7k
Has Issues Enabled
Issues Count481
Issue Open Count114
Pull Requests Count977
Pull Requests Open Count16
Pull Requests Close Count60
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private