hi! i have tried the pytorch-PCGrad in my project. my network was an mobilenetv2 followed by two task-head and these two task-head have different parameters. when i ran the pytorch-pcgrad, it went wrong.it seems that the separated backward process induce two gradient vectors with different length, thus went wrong in function _project_conflicting.