Problem installing torch audio on Windows 10 conda

In Anaconda Python 3.6.7 with PyTorch installed, on Windows 10, I do this sequence:

conda install -c conda-forge librosa
conda install -c groakat sox

then in a fresh download from I do

python install

and it runs for a while and ends like this:

torchaudio/torch_sox.cpp(3): fatal error C1083: Cannot open include file: 'sox.h': No such file or directory
error: command 'C:\\Program Files (x86)\\Microsoft Visual Studio\\2017\\Community\\VC\\Tools\\MSVC\\14.15.26726\\bin\\HostX86\\x64\\cl.exe' failed with exit status 2

I think I’m almost there and this is doable. Please help!

Note this is also an open GitHub issue of long standing:

Not a windows user. But I think the problem is with the environment setup. It is not able to locate the sox.h file, which is installed in your conda environment. You are facing this problem as you are currently not in the folder in which you installed conda. Try moving to the folder of your environment and run the command again.

Kushaj, I also cross-posted on StackOverflow and I got an answer over there which looks like a firm and definitive negative for Windows:

Can you do a check for me. Open terminal and import numpy or any conda package. Tell me if it works or not.

It works fine:

Python 3.6.7 (default, Feb 24 2019, 05:34:16) [MSC v.1900 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy

This is an open, logged issue for Windows. I’m hoping a PyTorch guru will take pity on Windows folks and fix it.

I checked the problem in some detail, I think for now you will not be able to do so on Windows. But if you want a trick, you can run an Ubuntu VM with the libraries installed and try to communicate between Windows and the VM.

Other people have suggested WSL Linux on Windows but there are still problems accessing GPU with that:

For my purposes (training speech recognition in Swahili), I am giving up on PyTorch in favor of this Tensorflow-based project done for Udacity which has no missing pieces on Windows:

For the record, I was trying to reproduce this OpenNMT-py speech training demo on Windows: