CPU and GPU memory

Yes

Yes, the outputs will use the same device as the inputs of an operation by default.

Also yes. You can use CPU offloading as described here.