cloudpickle

Extended pickling support for Python objects

  • Owner: cloudpipe/cloudpickle
  • Platform:
  • License:: Other
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

cloudpickle

Automated Tests
codecov.io

cloudpickle makes it possible to serialize Python constructs not supported
by the default pickle module from the Python standard library.

cloudpickle is especially useful for cluster computing where Python
code is shipped over the network to execute on remote hosts, possibly close
to the data.

Among other things, cloudpickle supports pickling for lambda functions
along with functions and classes defined interactively in the
__main__ module (for instance in a script, a shell or a Jupyter notebook).

Cloudpickle can only be used to send objects between the exact same version
of Python
.

Using cloudpickle for long-term object storage is not supported and
strongly discouraged.

Security notice: one should only load pickle data from trusted sources as
otherwise pickle.load can lead to arbitrary code execution resulting in a critical
security vulnerability.

Installation

The latest release of cloudpickle is available from
pypi:

pip install cloudpickle

Examples

Pickling a lambda expression:

>>> import cloudpickle
>>> squared = lambda x: x ** 2
>>> pickled_lambda = cloudpickle.dumps(squared)

>>> import pickle
>>> new_squared = pickle.loads(pickled_lambda)
>>> new_squared(2)
4

Pickling a function interactively defined in a Python shell session
(in the __main__ module):

>>> CONSTANT = 42
>>> def my_function(data):
...    return data + CONSTANT
...
>>> pickled_function = cloudpickle.dumps(my_function)
>>> pickle.loads(pickled_function)(43)
85

Running the tests

  • With tox, to test run the tests for all the supported versions of
    Python and PyPy:

    pip install tox
    tox
    

    or alternatively for a specific environment:

    tox -e py37
    
  • With py.test to only run the tests for your current version of
    Python:

    pip install -r dev-requirements.txt
    PYTHONPATH='.:tests' py.test
    

Note about function Annotations

Note that because of design issues Python's typing module, cloudpickle
supports pickling type annotations of dynamic functions for Python 3.7 and
later. On Python 3.4, 3.5 and 3.6, those type annotations will be dropped
silently during pickling (example below):

>>> import typing
>>> import cloudpickle
>>> def f(x: typing.Union[list, int]):
...     return x
>>> f
<function __main__.f(x:Union[list, int])>
>>> cloudpickle.loads(cloudpickle.dumps(f))  # drops f's annotations
<function __main__.f(x)>

History

cloudpickle was initially developed by picloud.com and shipped as part of
the client SDK.

A copy of cloudpickle.py was included as part of PySpark, the Python
interface to Apache Spark. Davies Liu, Josh
Rosen, Thom Neale and other Apache Spark developers improved it significantly,
most notably to add support for PyPy and Python 3.

The aim of the cloudpickle project is to make that work available to a wider
audience outside of the Spark ecosystem and to make it easier to improve it
further notably with the help of a dedicated non-regression test suite.

Main metrics

Overview
Name With Ownercloudpipe/cloudpickle
Primary LanguagePython
Program languagePython (Language Count: 2)
Platform
License:Other
所有者活动
Created At2015-04-13 16:33:00
Pushed At2025-03-25 09:32:53
Last Commit At2025-03-25 10:32:53
Release Count45
Last Release Namev3.1.1 (Posted on )
First Release Name0.1.0 (Posted on 2015-04-16 16:31:02)
用户参与
Stargazers Count1.8k
Watchers Count29
Fork Count181
Commits Count457
Has Issues Enabled
Issues Count261
Issue Open Count88
Pull Requests Count239
Pull Requests Open Count13
Pull Requests Close Count57
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private