petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

Main metrics

Overview

Name With Owneruber/petastorm
Primary LanguagePython
Program languagePython (Language Count: 4)
Platform
License:Apache License 2.0
Release Count115
Last Release Namev0.13.0rc0 (Posted on )
First Release Namev0.1.1 (Posted on 2018-07-19 13:32:09)
Created At2018-06-15 23:15:29
Pushed At2025-09-15 19:17:24
Last Commit At2025-09-15 12:17:24
Stargazers Count1863
Watchers Count36
Fork Count285
Commits Count697
Has Issues Enabled
Issues Count308
Issue Open Count157
Pull Requests Count405
Pull Requests Open Count24
Pull Requests Close Count76
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private
To the top