Any PyTorch function can work as Keras' Timedistributed?

sid-ls · May 28, 2020, 5:01pm

You can use time distributed by just iterating over modules in the forward pass using nn.ModuleList

# 24 fc timedistributed
num = 24
fc = nn.ModuleList([nn.Linear(8, 1) for i in range(num)])
# forward pass 
x = np.zeros(64, 24, 8)
outs=[]
for i in range(x.shape[1]):
 outs.append(fc[i](x[:, i, :].unsqueeze(1)))
outs=torch.cat(outs, axis=1)

yongen9696 · October 5, 2020, 7:46am

The repo is not longer available.

ptrblck · October 7, 2020, 4:08am

The folder structure changed and it seems to available here now.

nisharaichur · December 4, 2020, 11:49am

The time distributed wrapper equivalent in Tensorflow states

This wrapper allows to apply a layer to every temporal slice of an input

The reference you included collapses the batches and time steps into one
Are these two operations same?

Thank in advance

ptrblck · December 4, 2020, 8:14pm

I haven’t checked the reference, just posted the updated link as @yongen9696 had trouble accessing the old one.

Sanjay_Krishnamurthy · June 26, 2021, 3:25pm

How about something like this, where we use a FFN for each timestep

def __init__(self, sequence_length, hidden_size):
	self.fc_list = nn.ModuleList()
	for j in range(self.sequence_length):
		fc = nn.Linear(hidden_size, hidden_size)
		self.fc_list.append(fc)
		
def forward(self, x): 
	lst = []
	for j in range(self.sequence_length):
		lst.append(self.fc_list[j](x[:,j:j+1,:]))
	out = torch.cat(lst, axis=1)

Feng_yang · August 24, 2021, 2:37am

Awesome！but I think when the original inputs X shape is (samples, timesteps, output_size)，and we need to reshape Y to shape (timesteps, samples, output_size)， we should use y.transpose after the view ; if not transpose ，the timesteps will mixed！

Feng_yang · August 24, 2021, 2:58am

I have fixed the code to

        y = y.contiguous().view(x.size(0), -1, y.size(-1))  # (samples, timesteps, output_size)

        # IF need timesteps first,   We have to reshape Y
        if not  self.batch_first:
            y = y.transpose(0,1).contiguous()  # transpose to (timesteps, samples, output_size)

Tyllis · October 7, 2022, 1:02am

If you model contains BatchNorm, the output maybe different. The running mean and var will be based on a batch of batch_size*time_step, instead of the actual batch_size.