How to extract CNN features from video frames using pre-trained models? No splitting of data in train and test is recommended

Check this post on how to extract 2D CNN features from a video CNN LSTM implementation for video classification

To save them in binary format I think you can use this function torch.save and io.BytesIO()

https://pytorch.org/docs/stable/torch.html#torch.save