Fast serialization of Tensors?

stsievert · December 6, 2017, 8:34pm

I’d like a fast serialization for a tensor. Right now, I’m using

x = torch.Tensor(...)
serialized = pickle.dumps(x)

The tensor is not guaranteed to live on the CPU and I want to preserve CPU–GPU bandwidth, meaning I can not use

x.numpy().tobytes()

Is there a fast serialization method for torch Tensors?

Notes on research I’ve done so far:

the source for torch.serialization and it relies on pickle.dumps (plus it seems oriented towards files, not speed).
blosc has compress_ptr which could be useful with x.data_ptr
I’ve looked at pyarrow too, but decided to ask here first

stsievert · December 19, 2017, 7:55pm

I should look more into CuPy and their serializers: https://chainer.readthedocs.io/en/v2-docs-cupy/reference/serializers.html

smth · December 19, 2017, 9:49pm

pytorch defines a custom pickler, so pickle.dump with the custom pickler is actually very fast (we go into C for serializing the storage)

stsievert · December 19, 2017, 11:15pm

Right, I should clarify.

I’m comparing with NumPy serialization. The picture below times serialization for NumPy and PyTorch with pickle.dumps on a Macbook Pro 2015.

The core of my code was

def stat(x, serialize=pickle.dumps):
    start = time.time()
    msg = serialize(x)
    return {'time': time.time() - start, 'bytes': len(msg)}

# ... other functions, for-loops, etc
x = np.random.randn(n).astype('float32')
y = torch.Tensor(x)

This is with torch.__version__ == 0.3.0.post4.

mrshenli · April 18, 2019, 4:01pm

@stsievert Is the fix in #9184 sufficient?

stsievert · April 18, 2019, 8:59pm

Yup. Well, I think so. That PR was actually motivated by some of the work I did over the summer. I wouldn’t be surprised if the NumPy and PyTorch timings are equally fast now (but I’d like to see the graph).

stsievert · April 18, 2019, 9:11pm

Here’s the same graph with PyTorch 1.0.0 and NumPy 1.16.2:

tastyminerals · October 24, 2019, 10:07am

It would be super nice to add pyarrow here.