“Stacked Hourglass Networks for Human Pose Estimation” paper review

paper link submitted in 2016 I’m only interested in the stacked hourglass architecture, not about pose segmentation performance. So the points listed below are only related to stacked hourglass architecture. “encoding-decoding” or “conv-deconv” structure is already introduced. This paper goes one step further and stacks muiltiple “hourglass” structure. While stacking, Read more…

EfficientDet paper review

paper link: https://arxiv.org/pdf/1911.09070.pdf BiFPN multiple bifpn layers for scaling use depth-wise convolution layers bidirectional cross-scale connections + weighted feature fusion. weighted feature fusion different weight for each resolution features learnable weights summarize that there are three different approaches for doing weighted feature fusion unbounded fusion: because it is unbounded, can Read more…