码迷,mamicode.com
首页 > 其他好文 > 详细

增广拉格朗日乘子法(Augmented Lagrange Method)

时间:2017-06-07 12:44:39      阅读:345      评论:0      收藏:0      [点我收藏+]

标签:ems   end   arm   rest   guess   pos   media   range   because   

转载自:增广拉格朗日乘子法(Augmented Lagrange Method)

增广拉格朗日乘子法的作用是用来解决等式约束下的优化问题,

 

假定需要求解的问题如下:

    minimize   f(X)

    s.t.:     h(X)=0

其中,f:Rn->R; h:Rn->Rm

 

朴素拉格朗日乘子法的解决方案是:

    L(X,λ)=f(X)+μh(X);  μ:Rm

    此时,求解L对X和μ的偏导同时为零就可以得到最优解了。

 

增广拉格朗日乘子法的解决方案是:

    Lc(x,λ)=f(X)+μh(X)+1/2c|h(X)|2

    每次求出一个xi,然后按照梯度更新参数μ,c每次迭代逐渐增大(使用ALM方法好像还有一些假设条件)

    整个流程只需要几步就可以完成了,一直迭代就可得到最优解了。

    技术分享

 

 

参考文献:

  [1]Multiplier and Gradient Methods,1969

  [2]constrained optimization and lagrange multiplier methods(page 104),1982

 

wiki:https://en.wikipedia.org/wiki/Augmented_Lagrangian_method

Let us say we are solving the following constrained problem:

技术分享

subject to

技术分享

This problem can be solved as a series of unconstrained minimization problems. For reference, we first list the penalty method approach:

技术分享

The penalty method solves this problem, then at the next iteration it re-solves the problem using a larger value of 技术分享 (and using the old solution as the initial guess or "warm-start").

The augmented Lagrangian method uses the following unconstrained objective:

技术分享

and after each iteration, in addition to updating技术分享, the variable 技术分享 is also updated according to the rule

技术分享

where 技术分享 is the solution to the unconstrained problem at the kth step, i.e. 技术分享

The variable 技术分享 is an estimate of the Lagrange multiplier, and the accuracy of this estimate improves at every step. The major advantage of the method is that unlike the penalty method, it is not necessary to take 技术分享 in order to solve the original constrained problem. Instead, because of the presence of the Lagrange multiplier term, 技术分享 can stay much smaller.

The method can be extended to handle inequality constraints. For a discussion of practical improvements, see.[4]

 

增广拉格朗日乘子法(Augmented Lagrange Method)

标签:ems   end   arm   rest   guess   pos   media   range   because   

原文地址:http://www.cnblogs.com/hellowooorld/p/6955921.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!