论文笔记之：Optical Flow Estimation using a Spatial Pyramid Network

时间：2017-06-21 13:54:19 阅读：203 评论：0 收藏：0 [点我收藏+]

标签：war font size filter ges nts eth enter pre

　　Optical Flow Estimation using a Spatial Pyramid Network

spynet

　　本文将经典的 spatial-pyramid formulation 和 deep learning 的方法相结合，以一种 coarse to fine approach，进行光流的计算。This estiamates large motions in a coarse to fine approach by warping one image of a pair at each pyramid level by the current flow estimate and compute an update to the flow.

　　我们利用 CNN 来进行每一层 flow 的更新，而不是传统方法中目标函数的最小化。与 FlowNet 相比，本文的方法不需要处理 large motions；这些已经在 pyramid 中处理了。该方法的主要优势有：

　　1. our Spatial Pyramid Network is much simpler and 96% smaller than FlowNet in terms of model parameters.

　　2. since the flow at each pyramid level is small (< 1 pixel), a convolutional approach applied to pairs of warped images is appropriate.

　　3. unlike FlowNet, the learned convolution filters appear similar to classical spatio-temporal filters, giving insight into the method and how to improve it.

　　现有方法存在的主要问题：

　　将两张图直接 stack大一起，放到 CNN 当中。当两帧图像之间的 motion 大于 one or a few pixels， spatial-temporal convolutional filters 将不会收到有效的相应。也就是说，if a convolutional window in one image does not overlap with related image pixels at the next time instant, no meaningful temporal filter can be learned.

　　这里需要解决两个关键性的问题：1. 长期依赖的问题；　　2. detailed, sub-pixel, optical flow and precise motion boundaries。FlowNet 是尝试在一个网络中解决这两个问题，而该方法则是用 CNN 来解决第二个问题，用现有的方法来解决第一个问题。

　　Approach：

　　本文用 spatial pyramid 的方式，from coarse to fine 的方法来解决 large motion的问题。

论文笔记之：Optical Flow Estimation using a Spatial Pyramid Network

标签：war font size filter ges nts eth enter pre

原文地址：http://www.cnblogs.com/wangxiaocvpr/p/7058617.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行