I really want to know which algorithm Pytorch use to optimize L1 penalty.
Like scikit-learn clearly mentions that - “We use the truncated gradient algorithm proposed by Tsuruoka et al. 2009 for L1 regularization (and the Elastic Net).” (https://scikit-learn.org/stable/modules/sgd.html#implementation-details).
I really searched a lot but I couldn’t find the answer.