Table of Content

Why going deep is difficult?

Screenshot 2024-03-26 at 7.18.50 PM.png

VGG

ResNet

Understanding Residual Learning

x_l is the input from the previous layer, and F_l(x_l) represents the residual function learned by the l-th layer.

x_l is the input from the previous layer, and F_l(x_l) represents the residual function learned by the l-th layer.

Identity Mappings in Residual Networks (ResNet V2)

(left) The difference between ResNet V1 and ResNet V2. The proposal is to do only the identity addition in the residual connection without any activation. (right) the proposed method lowers the loss significantly.

(left) The difference between ResNet V1 and ResNet V2. The proposal is to do only the identity addition in the residual connection without any activation. (right) the proposed method lowers the loss significantly.

Screenshot 2024-03-27 at 10.23.57 AM.png