Apache SystemDS

用于端到端数据科学生命周期的开源 ML 系统。「An open source ML system for the end-to-end data science lifecycle」

Github stars Tracking Chart

Apache SystemDS

Overview: SystemDS is an open source ML system for the end-to-end data science lifecycle from data integration, cleaning,
and feature engineering, over efficient, local and distributed ML model training, to deployment and serving. To this
end, we aim to provide a stack of declarative languages with R-like syntax for (1) the different tasks of the data-science
lifecycle, and (2) users with different expertise. These high-level scripts are compiled into hybrid execution plans of
local, in-memory CPU and GPU operations, as well as distributed operations on Apache Spark. In contrast to existing
systems - that either provide homogeneous tensors or 2D Datasets - and in order to serve the entire data science lifecycle,
the underlying data model are DataTensors, i.e., tensors (multi-dimensional arrays) whose first dimension may have a
heterogeneous and nested schema.

Quick Start Install, Quick Start and Hello World

Documentation: SystemDS Documentation

Python Documentation Python SystemDS Documentation

Issue Tracker Jira Dashboard

Status and Build: SystemDS is renamed from SystemML which is an Apache Top Level Project.
To build from source visit SystemDS Install from source

Build
Documentation
LicenseCheck
Java Tests
Python Test

Main metrics

Overview
Name With Ownerapache/systemds
Primary LanguageJava
Program languageShell (Language Count: 17)
Platform
License:Apache License 2.0
所有者活动
Created At2015-11-10 08:00:06
Pushed At2025-06-13 14:52:11
Last Commit At
Release Count43
Last Release Name3.3.0-rc1 (Posted on 2025-04-09 15:48:43)
First Release Namev0.9.0-rc1 (Posted on 2016-01-19 21:19:39)
用户参与
Stargazers Count1k
Watchers Count85
Fork Count496
Commits Count9k
Has Issues Enabled
Issues Count0
Issue Open Count0
Pull Requests Count234
Pull Requests Open Count47
Pull Requests Close Count1990
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private