spline

Data Lineage Tracking and Visualization tool for Apache Spark ™

Github星跟踪图

Spline (from Spark lineage) project helps people get insight into data processing performed by Apache Spark ™

Maven Central
TeamCity build (develop)
Codacy Badge
Sonarcloud Status
SonarCloud Maintainability
SonarCloud Reliability
SonarCloud Security

The project consists of three main parts:

  • Spark Agent that sits on drivers, capturing the data lineage from Spark jobs being executed by analyzing the execution plans

  • Rest Gateway, that receive the lineage data from agent and stores it in the database

  • Web UI application that visualizes the stored data lineages

Spline diagram

Spline is aimed to be used with Spark 2.3+ but also provides limited support for Spark 2.2.

For documentation and examples please visit Spline GitHub Pages.


Copyright 2019 ABSA Group Limited

you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

主要指标

概览
名称与所有者AbsaOSS/spline
主编程语言Scala
编程语言Scala (语言数: 6)
平台
许可证Apache License 2.0
所有者活动
创建于2017-05-30 08:38:00
推送于2025-06-09 21:39:50
最后一次提交
发布数41
最新版本名称release/0.7.10 (发布于 2025-06-08 23:21:36)
第一版名称release/0.2.0 (发布于 2017-08-09 11:53:06)
用户参与
星数628
关注者数38
派生数159
提交数1.6k
已启用问题?
问题数606
打开的问题数44
拉请求数551
打开的拉请求数0
关闭的拉请求数98
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?