Log-Sum-Exp Trick to Prevent Numerical Underflow

时间：2015-03-15 23:36:10 阅读：561 评论：0 收藏：0 [点我收藏+]

标签：

引自： http://wittawat.com/log-sum_exp_underflow.html

Multiplying a series of terms where can easily result in a numerical underflow. This is especially the case when dealing with probabilistic models which involve such multiplication all the time. One solution commonly used is to work with the log of probabilities instead:

which turns multiplication into a summation. Since a summation does not decrease the magnitude of the result, the underflow problem can be avoided.

Unfortunately, this little trick alone does not work when we have a sum of products of probabilities as in the case of, for example, the backward algorithm used in HMM:

The exact meaning of each term is irrelevant. What is important here is that as we successively apply this recurrence relation (i.e., as gets lower), will underflow. The solution here is the same as before. That is, we work with instead. So, by taking the on both sides as well as putting in the sum, we have:

The reason to apply on the RHS is to make the version of so the recurrence relation can be used. We have defined which absorbs the term on the RHS. Note that typically . So we still have the underflow when calculating . The remedy this time is to define

, a_i = logβ_k₊₁(z_k₊₁)p(x_k₊₁ | z_k₊₁)p(z_k₊₁ | z_k)

and calculate instead.

Since , is actually positive. So, underflow does not occur when computing . We have avoided the underflow problem by using the RHS of the last line.

对比下面这段代码：

def logsumexp(A):
    k = max(A)
    return log(sum( exp(i-k) for i in A ))+k

def calc_feat(x, i, l, r):
    return { ("T", l, r): 1, ("E", r, x[i]): 1 }
def calc_e(x, i, l, r, w, e_prob):
    if (i, l, r) not in e_prob:
        e_prob[i,l,r] = dot(calc_feat(x, i, l, r), w)
    return e_prob[i,l,r]

def calc_f(x, i, l, w, e, f):   #### forward
    if (i, l) not in f:
        if i == 0:
            f[i,0] = 0
        else:
            prev_states = (range(1, len(tagids)) if i != 1 else [0])
            f[i,l] = logsumexp([
                calc_f(x, i-1, k, w, e, f) + calc_e(x, i, k, l, w, e)
                    for k in prev_states])
    return f[i,l]   # 重复利用log值

Soft-Max Function

Just a side note, （分母）is called a soft max function because

The approximation error tends to get larger as the magnitude of gets smaller, however.

Source: (ML 14.10) Underflow and the log-sum-exp trick by mathematicalmonk

Log-Sum-Exp Trick to Prevent Numerical Underflow

标签：

原文地址：http://www.cnblogs.com/Dminer/p/4340716.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行