Feature extraction using convolution

时间：2014-09-19 17:02:25 阅读：238 评论：0 收藏：0 [点我收藏+]

Fully Connected Networks

In the sparse autoencoder, one design choice that we had made was to "fully connect" all the hidden units to all the input units. On the relatively small images that we were working with (e.g., 8x8 patches for the sparse autoencoder assignment, 28x28 images for the MNIST dataset), it was computationally feasible to learn features on the entire image. However, with larger images (e.g., 96x96 images) learning features that span the entire image (fully connected networks) is very computationally expensive--you would have about 10⁴ input units, and assuming you want to learn 100 features, you would have on the order of 10⁶ parameters to learn. The feedforward and backpropagation computations would also be about 10² times slower, compared to 28x28 images.

对高分辨率图像，输入结点和隐藏结点全连通会带来非常大的计算量

Locally Connected Networks

One simple solution to this problem is to restrict the connections between the hidden units and the input units, allowing each hidden unit to connect to only a small subset of the input units. Specifically, each hidden unit will connect to only a small contiguous region of pixels in the input.

This idea of having locally connected networks also draws inspiration from how the early visual system is wired up in biology. Specifically, neurons in the visual cortex have localized receptive fields (i.e., they respond only to stimuli in a certain location).

局部连接，生物学依据

Convolutions

Natural images have the property of being stationary, meaning that the statistics of one part of the image are the same as any other part. This suggests that the features that we learn at one part of the image can also be applied to other parts of the image, and we can use the same features at all locations.

More precisely, having learned features over small (say 8x8) patches sampled randomly from the larger image, we can then apply this learned 8x8 feature detector anywhere in the image. Specifically, we can take the learned 8x8 features andconvolve them with the larger image, thus obtaining a different feature activation value at each location in the image.

To give a concrete example, suppose you have learned features on 8x8 patches sampled from a 96x96 image. Suppose further this was done with an autoencoder that has 100 hidden units. To get the convolved features, for every 8x8 region of the 96x96 image, that is, the 8x8 regions starting at $bubuko.com,布布扣$ , you would extract the 8x8 patch, and run it through your trained sparse autoencoder to get the feature activations. This would result in 100 sets 89x89 convolved features.

Formally, given some large $bubuko.com,布布扣$ images x_large, we first train a sparse autoencoder on small $bubuko.com,布布扣$ patches x_small sampled from these images, learning k features f = σ(W⁽¹⁾x_small + b⁽¹⁾) (where σ is the sigmoid function), given by the weights W⁽¹⁾and biases b⁽¹⁾ from the visible units to the hidden units. For every $bubuko.com,布布扣$ patch x_s in the large image, we compute f_s = σ(W⁽¹⁾x_s + b⁽¹⁾), giving us f_convolved, a $bubuko.com,布布扣$ array of convolved features.

Feature extraction using convolution

标签：des http io os ar strong for art sp

原文地址：http://www.cnblogs.com/sprint1989/p/3981615.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行