Batch non-maximum suppression on the GPU

This comment is helpful.