Apache SystemDS

用于端到端数据科学生命周期的开源 ML 系统。「An open source ML system for the end-to-end data science lifecycle」

Github星跟踪图

Apache SystemDS

Overview: SystemDS is an open source ML system for the end-to-end data science lifecycle from data integration, cleaning,
and feature engineering, over efficient, local and distributed ML model training, to deployment and serving. To this
end, we aim to provide a stack of declarative languages with R-like syntax for (1) the different tasks of the data-science
lifecycle, and (2) users with different expertise. These high-level scripts are compiled into hybrid execution plans of
local, in-memory CPU and GPU operations, as well as distributed operations on Apache Spark. In contrast to existing
systems - that either provide homogeneous tensors or 2D Datasets - and in order to serve the entire data science lifecycle,
the underlying data model are DataTensors, i.e., tensors (multi-dimensional arrays) whose first dimension may have a
heterogeneous and nested schema.

Quick Start Install, Quick Start and Hello World

Documentation: SystemDS Documentation

Python Documentation Python SystemDS Documentation

Issue Tracker Jira Dashboard

Status and Build: SystemDS is renamed from SystemML which is an Apache Top Level Project.
To build from source visit SystemDS Install from source

Build
Documentation
LicenseCheck
Java Tests
Python Test

主要指标

概览
名称与所有者apache/systemds
主编程语言Java
编程语言Shell (语言数: 17)
平台
许可证Apache License 2.0
所有者活动
创建于2015-11-10 08:00:06
推送于2025-06-13 14:52:11
最后一次提交
发布数43
最新版本名称3.3.0-rc1 (发布于 2025-04-09 15:48:43)
第一版名称v0.9.0-rc1 (发布于 2016-01-19 21:19:39)
用户参与
星数1k
关注者数85
派生数496
提交数9k
已启用问题?
问题数0
打开的问题数0
拉请求数234
打开的拉请求数47
关闭的拉请求数1990
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?