( 참고 : 패스트 캠퍼스 , 한번에 끝내는 컴퓨터비전 초격차 패키지 )

MobileNet & ShuffleNet

[1] MobileNet

decouples “channel-wise” feature extractor & “spatial” feature extractor

Standard convolution

Depthwise-Separable Convolution

Computational Gain :

\(\frac{\operatorname{cost}_{d w}}{\operatorname{cost}_{f u l l}}=\frac{1}{C_{i}}+\frac{1}{K_{h} K_{w}}\).

mobilenet_v2 = models.mobilenet_v2()
mobilenet_v3_large = models.mobilenet_v3_large()
mobilenet_v3_small = models.mobilenet_v3_small()

As shown above, 1x1 conv takes most of computation

\(\rightarrow\) how to minimize this overhead?

“G” grouped convolution = “G” seperable convolutions

What if we use this with depth-wise convolution in MobileNet?

\(\rightarrow\) no information exchange between channels!

let’s shuffle information among channels!

Architecture