orchest

Orchest is a tool for creating data science pipelines.

Github星跟蹤圖

Website
Docs
Quickstart
Slack


Orchest is a web based data science tool that works on top of your filesystem allowing you to use
your editor of choice. With Orchest you get to focus on visually building and iterating on your
pipeline ideas. Under the hood Orchest runs a collection of containers to provide a scalable
platform that can run on your laptop as well as on a large scale cloud cluster.

Orchest lets you

  • Interactively build data science pipelines through its visual interface.
  • Automatically run your pipelines in parallel.
  • Develop your code in your favorite editor. Everything is filesystem based.
  • Tag the notebooks cells you want to skip when running a pipeline. Perfect for prototyping as you
    do not have to maintain a perfectly clean notebook.
  • Run experiments by parametrizing your pipeline. Easily try out all of your modeling ideas.

Table of contents

Installation

Requirements

  • Docker (tested on 19.03.9)

If you do not yet have Docker installed, please visit https://docs.docker.com/get-docker/.

Linux, macOS and Windows

Simply follow the steps below to install Orchest. For Windows, please read the note at the bottom first.

git clone https://github.com/orchest/orchest.git
cd orchest

# The start command will automatically install Orchest if it is not
# yet installed. After installation is finished Orchest is started
./orchest.sh start

Note! On Windows, Docker has to be configured to use WSL 2. Make sure to clone Orchest inside
the Linux environment. For more info about Docker with WSL 2, please visit
https://docs.docker.com/docker-for-windows/wsl/.

Preview

In our docs you can find a comprehensive
quickstart tutorial!

clip-3-cropped
A preview of running pipelines in the pipeline editor of Orchest

License

The software in this repository is licensed as follows:

  • All content residing under the "orchest-sdk/" directory of this repository is licensed under the
    "Apache-2.0" license as defined in "orchest-sdk/LICENSE".
  • Content outside of the above mentioned directory is available under the "AGPL-3.0" license.

Contributing

Contributions are more than welcome! Please see our
contributer guides
for more details.

We love your feedback

We would love to hear what you think and potentially add features based on your ideas. Come chat
with us on Slack.

主要指標

概覽
名稱與所有者orchest/orchest
主編程語言TypeScript
編程語言Python (語言數: 12)
平台
許可證Apache License 2.0
所有者活动
創建於2020-05-21 17:33:05
推送於2023-06-06 09:48:26
最后一次提交2023-06-06 11:48:26
發布數204
最新版本名稱v2023.04.2 (發布於 )
第一版名稱v0.2.1-alpha (發布於 )
用户参与
星數4.1k
關注者數43
派生數262
提交數9.7k
已啟用問題?
問題數481
打開的問題數114
拉請求數977
打開的拉請求數16
關閉的拉請求數60
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?