Apache SystemDS

用于端到端数据科学生命周期的开源 ML 系统。「An open source ML system for the end-to-end data science lifecycle」

Github星跟蹤圖

Apache SystemDS

Overview: SystemDS is an open source ML system for the end-to-end data science lifecycle from data integration, cleaning,
and feature engineering, over efficient, local and distributed ML model training, to deployment and serving. To this
end, we aim to provide a stack of declarative languages with R-like syntax for (1) the different tasks of the data-science
lifecycle, and (2) users with different expertise. These high-level scripts are compiled into hybrid execution plans of
local, in-memory CPU and GPU operations, as well as distributed operations on Apache Spark. In contrast to existing
systems - that either provide homogeneous tensors or 2D Datasets - and in order to serve the entire data science lifecycle,
the underlying data model are DataTensors, i.e., tensors (multi-dimensional arrays) whose first dimension may have a
heterogeneous and nested schema.

Quick Start Install, Quick Start and Hello World

Documentation: SystemDS Documentation

Python Documentation Python SystemDS Documentation

Issue Tracker Jira Dashboard

Status and Build: SystemDS is renamed from SystemML which is an Apache Top Level Project.
To build from source visit SystemDS Install from source

Build
Documentation
LicenseCheck
Java Tests
Python Test

主要指標

概覽
名稱與所有者apache/systemds
主編程語言Java
編程語言Shell (語言數: 17)
平台
許可證Apache License 2.0
所有者活动
創建於2015-11-10 08:00:06
推送於2025-06-13 14:52:11
最后一次提交
發布數43
最新版本名稱3.3.0-rc1 (發布於 2025-04-09 15:48:43)
第一版名稱v0.9.0-rc1 (發布於 2016-01-19 21:19:39)
用户参与
星數1k
關注者數85
派生數496
提交數9k
已啟用問題?
問題數0
打開的問題數0
拉請求數234
打開的拉請求數47
關閉的拉請求數1990
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?