Bayesian optimization with TPE or Gaussian process?

Hi all,

I want to optimize [the number of hidden layers, hidden neurons and learning rate]

Since the searching space is both continuous and discrete,
What surrogate model do I have to use?
I know Gaussian process is widely used for Bayesian optimization, but would it be possible to apply Gaussian process in this case?

Thanks in advance!