dabl

Data Analysis Baseline Library

Github stars Tracking Chart

dabl

The data analysis baseline library.

  • "Mr Sanchez, are you a data scientist?"
  • "I dabl, Mr president."

Find more information on the website.

State of the library

Right now, this library is still a prototype. API might change, and you shouldn't rely on it in any critical settings.

Try it out

pip install dabl

or Binder

Current scope and upcoming features

This library is very much still under development. Current code focuses mostly on exploratory visualization and preprocessing.
There are also drop-in replacements for GridSearchCV and RandomizedSearchCV using successive halfing.
There are preliminary portfolios in the style of
POSH
auto-sklearn

to find strong models quickly. In essence that boils down to a quick search
over different gradient boosting models and other tree ensembles and
potentially kernel methods.

Stay Tuned!

Main metrics

Overview
Name With Owneramueller/dabl
Primary LanguageJupyter Notebook
Program languageShell (Language Count: 3)
Platform
License:BSD 3-Clause "New" or "Revised" License
所有者活动
Created At2020-01-30 18:26:49
Pushed At2024-10-23 21:48:40
Last Commit At
Release Count12
Last Release Name0.2.5.1 (Posted on )
First Release Name0.1.1 (Posted on )
用户参与
Stargazers Count132
Watchers Count4
Fork Count9
Commits Count314
Has Issues Enabled
Issues Count0
Issue Open Count0
Pull Requests Count0
Pull Requests Open Count1
Pull Requests Close Count2
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private