paper review: “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications”

arxiv: key points focus on optimizing for latency, small networks. use depthwise separable convolutions, to reduce computation as much as possible further reduce size of models based on width/resolution multiplier, but at the cost of accuracy depthwise separable convolution This is a combination of depthwise convonlution + pointwise convolution. Read more…

paper review: “FastDepth: Fast Monocular Depth Estimation on Embedded Systems”


key points

  • model to predict depth map
  • maximize speed by making it light as possible
  • focus not only on encoder network but also on decoder network for speed improvement
  • mobilenet for encoder, nearest-neighbor interpolation + NNConv5 for decoders, use skip connection, use depthwise separable convolution where ever possible, do network pruning, use TVM compiler stack to optimize depthwise separable convolution which is not optimized in populate DL frameworks.