How to program y = x ^ 2 using MLP?

migu3l705 · March 20, 2020, 6:05am

Hello, I am new in pytorch, I need help, how can I program a multilayer perceptron whose output is the function y = x ^ 2, starting from x = […- 2, -1,0,1,2 …]
I have tried, but I have only been able to get linear functions, like y = a * x + b

ptrblck · March 20, 2020, 6:37am

I’m not sure I understand the use case correctly, but your operation doesn’t seem to contain trainable parameters.

You could create the input tensor and just call x**2 to get the output:

x = torch.linspace(-10, 10, 21)
y = x**2

However, since no parameter is used, this won’t be trainable.

migu3l705 · March 20, 2020, 7:08am

In the case of y = 3x, for example, it could be y = w1 * x, where w1 must be trained until reaching the value of 3, but in y = x ^ 2, I can’t find which parameter to train, I need help.

ptrblck · March 20, 2020, 7:12am

That’s exactly what I mean.
If your target function is x**2, then there is nothing to train or would you like to make the exponent trainable?

migu3l705 · March 20, 2020, 7:15am

the objective is to enter any value into the NN and that it returns, as output, the square of that number.

ptrblck · March 20, 2020, 7:25am

So should the exponent be trained?
If so, you could define it as a parameter and try to optimize it.
However, you would have to take care of negative input values (root of then yields NaN) and probably using the log will be more stable:

e = nn.Parameter(torch.empty(1).uniform_(0, 1))

data = torch.linspace(1, 10, 9)
target = torch.log(data**2)

optimizer = torch.optim.SGD([e], lr=1e-3)
criterion = nn.L1Loss()

for epoch in range(1000):
    optimizer.zero_grad()
    out = e * torch.log(data)
    loss = criterion(out, target)
    loss.backward()
    optimizer.step()
    print('epoch {}, loss {}, e {}'.format(epoch, loss.item(), e.item()))

migu3l705 · March 20, 2020, 8:00am

I thought about training the exponent, or better said, optimizing it, but I didn’t get anywhere, besides that I didn’t use MPL. I will test the code you uploaded, thanks.

KFrank · March 20, 2020, 3:20pm

Hello Miguel!

Let me speculate a little about what you are asking.

As a learning exercise you might be asking how to train a neural
network to reproduce the function x^2 without building into it by
hand any knowledge of that specific function.

One of the interesting and important features of neural networks
is that their linear layers plus non-linear activations can be used
to reproduce / approximate many interesting functions. See, for
example, the “Universal approximation theorem” wikipedia article.

I haven’t experimented with this in particular, but you might try
training a network like (just making some something up):

model = torch.nn.Sequential (
    torch.nn.Linear (1, 50),
    torch.nn.Tanh(),
    torch.nn.Linear (50, 50),
    torch.nn.Tanh(),
    torch.nn.Linear (50, 1)
)

torch.nn.MSELoss would be the appropriate loss function.

(Your inputs would be your various values of x, e.g., your
[…- 2, -1,0,1,2 …]. Your targets would be the corresponding x^2.)

You could also try other activations, e.g., torch.nn.ReLU, and
wider / narrower or more / fewer hidden layers.

If you get such a network working for x^2 it would be informative
to retrain it (from scratch) on something like abs (x)^3.

Good luck.

K. Frank

jsswoosh · September 1, 2023, 12:47pm

I am facing a similar issue. I am not able to train a MLP model for y=x**2 data
Here is my Code. Please help.

def train(epoch=1000):
t = np.arange(0,1,0.001) # 1000 points
sq = np.square(t)
X_t, X_v, Y_t, Y_v = train_test_split(t, sq, test_size=0.2, random_state=1)
print(X_t.shape)
model = nn.Sequential(
nn.Linear(1, 8), nn.ReLU(),
nn.Linear(8, 32), nn.ReLU(),
nn.Linear(32, 64), nn.ReLU(),
nn.Linear(64, 32), nn.ReLU(),
nn.Linear(32, 4), nn.ReLU(),
nn.Linear(4, 1),
).type(torch.DoubleTensor)
loss = nn.MSELoss().type(torch.DoubleTensor)
optim = torch.optim.Adam(model.parameters(), lr=0.01)
#
for i in range(epoch):
model.train()
pred = model(torch.tensor(X_t).unsqueeze(1))
l = loss(pred.squeeze(), torch.tensor(Y_t) )
l.backward()
optim.step()
optim.zero_grad()
if i%100 ==0:
print(f’loss-{l} at epoch:{i}')

train()