Best model to handle multi labeled data?

I have some data that is segmented, and some data that has bbox labels. I want to be able to leverage both label types, and my goal is to create a really strong deep classifier. what model can I use?

The Object Detection tutorial might be a good starter which uses Mask R-CNN for both target types.