spline

Data Lineage Tracking and Visualization tool for Apache Spark ™

Github星跟蹤圖

Spline (from Spark lineage) project helps people get insight into data processing performed by Apache Spark ™

Maven Central
TeamCity build (develop)
Codacy Badge
Sonarcloud Status
SonarCloud Maintainability
SonarCloud Reliability
SonarCloud Security

The project consists of three main parts:

  • Spark Agent that sits on drivers, capturing the data lineage from Spark jobs being executed by analyzing the execution plans

  • Rest Gateway, that receive the lineage data from agent and stores it in the database

  • Web UI application that visualizes the stored data lineages

Spline diagram

Spline is aimed to be used with Spark 2.3+ but also provides limited support for Spark 2.2.

For documentation and examples please visit Spline GitHub Pages.


Copyright 2019 ABSA Group Limited

you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

主要指標

概覽
名稱與所有者AbsaOSS/spline
主編程語言Scala
編程語言Scala (語言數: 6)
平台
許可證Apache License 2.0
所有者活动
創建於2017-05-30 08:38:00
推送於2025-08-05 10:37:09
最后一次提交
發布數43
最新版本名稱release/0.8.1 (發布於 2025-07-14 08:55:31)
第一版名稱release/0.2.0 (發布於 2017-08-09 11:53:06)
用户参与
星數638
關注者數38
派生數159
提交數1.6k
已啟用問題?
問題數611
打開的問題數45
拉請求數562
打開的拉請求數0
關閉的拉請求數98
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?