Model Compression Competition

Perspectives on compression

The ShuffleNetv2 paper proposed the following guidelines, considering factors beyond FLOPs that affect speed:

Memory access cost is lowest when input and output sizes are equal
Large group convolutions increase memory cost
Structures with multiple branching paths — that is, models configured in parallel — cause speed degradation
Element-wise operations have a non-negligible impact, so be careful with them