I want to optimize [the number of hidden layers, hidden neurons and learning rate]
Since the searching space is both continuous and discrete,
What surrogate model do I have to use?
I know Gaussian process is widely used for Bayesian optimization, but would it be possible to apply Gaussian process in this case?
Thanks in advance!