spline

Data Lineage Tracking and Visualization tool for Apache Spark ™

Github stars Tracking Chart

Spline (from Spark lineage) project helps people get insight into data processing performed by Apache Spark ™

Maven Central
TeamCity build (develop)
Codacy Badge
Sonarcloud Status
SonarCloud Maintainability
SonarCloud Reliability
SonarCloud Security

The project consists of three main parts:

  • Spark Agent that sits on drivers, capturing the data lineage from Spark jobs being executed by analyzing the execution plans

  • Rest Gateway, that receive the lineage data from agent and stores it in the database

  • Web UI application that visualizes the stored data lineages

Spline diagram

Spline is aimed to be used with Spark 2.3+ but also provides limited support for Spark 2.2.

For documentation and examples please visit Spline GitHub Pages.


Copyright 2019 ABSA Group Limited

you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Main metrics

Overview
Name With OwnerAbsaOSS/spline
Primary LanguageScala
Program languageScala (Language Count: 6)
Platform
License:Apache License 2.0
所有者活动
Created At2017-05-30 08:38:00
Pushed At2025-06-09 21:39:50
Last Commit At
Release Count41
Last Release Namerelease/0.7.10 (Posted on 2025-06-08 23:21:36)
First Release Namerelease/0.2.0 (Posted on 2017-08-09 11:53:06)
用户参与
Stargazers Count628
Watchers Count38
Fork Count159
Commits Count1.6k
Has Issues Enabled
Issues Count606
Issue Open Count44
Pull Requests Count551
Pull Requests Open Count0
Pull Requests Close Count98
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private