[2005CVPR]Histograms of Oriented Gradients for Human Detection

时间：2014-07-09 11:02:21 阅读：337 评论：0 收藏：0 [点我收藏+]

HOG这种方法跟边缘方向直方图，尺度不变特征变换（SIFT）以及形状上下文方法（shape contexts）有很多相似之处，但与它们的不同点是：HOG描述器是在一个网格密集的大小统一的细胞单元上计算，而且为了提高性能，还采用了重叠的局部对比度归一化.HOG方法是在图像的局部细胞单元上操作，所以它对图像几何和光学的形变都能保持很好的不变性.

bubuko.com,布布扣

算法步骤step1:Gamma/Colour Normalization

作者分别在灰度空间、RGB色彩空间和LAB色彩空间上对图像进行色彩和伽马归一化，但实验结果显示，这个归一化的预处理工作对最后的结果没有影响，原因可能是：在后续步骤中也有归一化的过程，那些过程可以取代这个预处理的归一化。所以，在实际应用中，这一步可以省略。

算法步骤step2:Gradient Computation

Several smoothing scales were testedinclude-ingσ=0 (none)

uncentred [?1, 1],centred [?1, 0, 1] and cubic-corrected[1, ?8, 0, 8, ?1]

2×2 diagonal bubuko.com,布布扣

3×3 Sobel

Simple 1-D [?1, 0, 1] masks atσ=0 workbest

For colour images, wecalculate separate gradients for eachcolour channel, andtake the one with the largest norm as the pixel’s gradient vector.

算法步骤step3:Spatial / Orientation Binning

0?– 180? (“unsigned” gradient) or 0?–360? (“signed”gradient)，作者发现，采用无向的梯度和9个直方图通道，能在行人检测试验中取得最佳的效果

为每个cell统计方向梯度直方图，是一个9维的向量，采用三次插值进行投票。为什么是三次插值呢？［两次是偏移x,y，一次是角度，比如我的角度是20度的话，那就要分给0-20度和20-40度的］

it is useful to downweight pixels near the edges of the block by applying aGaussian spatial window to each pixel before accumulating orientation votesinto cells. (σ = 0.5 ? blockwidth)

算法步骤step4:grouping the cells together into larger blocks