Detecting the number of objects in an image

I’m training a mushroom image classifier, and need to identify images with more than one mushroom. Would this require full object detection or can it be done in a simpler way?

Btw, I’m looking into the VGG Image Annotation Tool for bounding box annotation. Is this a good choice or is there something simpler out there for this specific task?