Gradient Descent Optimizes Normalization-Free ResNets | IEEE Conference Publication | IEEE Xplore