How to set QNNPack parallelism?

dbalchev · September 24, 2020, 10:55am

For our usecase we need to do the job on one thread (preferable the main one). For all other workflows we just call torch.set_num_threads(1) and this gets the job done .
But QNNPack (more specifically the quantized Convolution) doesn’t respect this setting, since it uses caffe2::pthreadpool_().
In our tests we get good results by making that function return null.

Is there a way to set the num_threads of that pool without recompiling pytorch?
If not, what’s the minimal acceptable refactoring to make that configurable or respect the torch.set_num_threads?

jerryzh168 · October 7, 2020, 12:19am

This is related to mobile, can you add a ‘mobile’ tag?

dbalchev · October 7, 2020, 7:45am

Our use-case is not for mobile. We plan to use QNNPACK if the machine we’re running on doesn’t support AVX2.

Should I add the mobile tag despite that?

dbalchev · October 19, 2020, 8:19am

I’m not sure how to add the mobile tag?

ptrblck · October 19, 2020, 8:39am

Changed the category for you.

dbalchev · October 22, 2020, 8:27am

@ptrblck thanks (:
@jerryzh168 any updates on this?

kimishpatel · October 26, 2020, 10:02pm

@dbalchev, you can try caffe2::pthreadpool()->set_thread_count(1).

dbalchev · October 27, 2020, 8:36am

@kimishpatel We’re using the python wheel for inference. I’ve tried recompiling pytorch, so that torch.set_num_threads calls pthreadpool()->set_thread_count as well (PR with the rebased commit). It works, but we have to ship a custom pytorch wheel. I didn’t find a way to do that through the python API or even with a C++ extension.

kimishpatel · November 2, 2020, 3:38pm

@dbalchev sorry for late response. For some reason I did not get any notification. Unfortunately at the moment we dont have python API for this. Possibly in next release, or we can thin of introducing it in master if that may work for you.

dbalchev · November 2, 2020, 5:58pm

@kimishpatel Thanks for the reply! In my PR I set the size of that pool in torch.set_num_threads and it’s approved (: