Torchvision.io inconsistent with the rest of the module

I am trying to get images directly from a video using torchvision.io.read_video. I noticed this function reads images in the form [H,W,C] while everything else in torchvision expects [C,H,W]. Why the inconsistency?

Is there an inexpensive way to get the frames from torchvision.io.read_video in [C,H,W] form?