Newton's method for minimisation returns a critical point

14 ビュー (過去 30 日間)

古いコメントを表示

Dussan Radonich 2020 年 11 月 10 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point

編集済み: Bruno Luong 2020 年 11 月 11 日

採用された回答: Matt J

MATLAB Online で開く

I am trying to implement the newton's method to find the minima in the Himmelblau function.

The code does work most of the time, but on cases like this where my initial guess is (0.5 , 1) it returns a critical point of the function. I understand this is because the gradient becomes 0 and no new points are generated.

Now my question would be, is this normal with this method? Is there a way of getting around this problem?

Thanks for any help

close all; clear; clc
% Initialisation of variables to use
x0 = [0.5;1];
tol = 1e-4;
maxits = 50;
% Himmelblau function
him = @(x,y) (x.^2 + y - 11).^2 + (x + y.^2 - 7).^2;
% Gradient of the Himmelblau
grad_him = @(x,y) [[4*x.^3 + 4*x.*y - 42*x + 2*y.^2 - 14];[4*y.^3 + 4*x.*y - 26*y + 2*x.^2 - 22]];
% Hessian matrix of the Himmelblau
hessian_him = @(x,y) [[ 12*x.^2 + 4*y - 42 , 4*x + 4*y ];[ 4*x + 4*y , 12*y.^2 + 4*x - 26 ]];
% Call to newton's function and displaying our results accordingly
[r, iters, flag] = newton_min(grad_him,hessian_him,x0,tol,maxits);
fprintf ("<strong>Newton's method</strong>\n\n");
switch (flag)
    case 0
        fprintf ("There was a convergence on f\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    case 1
        fprintf ("There was a convergence on x\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    otherwise
        fprintf ("There was no convergence\n\n");
        
end
function [r, iters, flag] = newton_min(dg,ddg,x0,tol,maxits)
    x = x0(1); y = x0(2);
    r = NaN;
    flag = -1;
    
    for iters = 1 : maxits
    
        x_old = [x;y];
        
        x_new = x_old - (ddg(x,y)\dg(x,y));
        
        if norm(dg(x,y)) < tol
            
            flag = 0;
            r = x_new;
            return;
        end
        
        if norm(x_new - x_old) <= (tol + eps*norm(x_new))
            
            flag = 1;
            r = x_new;
            return;
        
        end
        
        x = x_new(1);
        y = x_new(2);
    
    end
end

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

採用された回答

Matt J 2020 年 11 月 10 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point#answer_540650

Yes, it's normal.

30 件のコメント
28 件の古いコメントを表示28 件の古いコメントを非表示

Matt J 2020 年 11 月 11 日

MATLAB Online で開く

The following converged:

% Initialisation of variables to use
x0 = [0.5;1];
tol = 1e-10;
maxits = 50000;
% Himmelblau function
him = @(x,y) (x.^2 + y - 11).^2 + (x + y.^2 - 7).^2;
% Gradient of the Himmelblau
grad_him = @(x,y) [[4*x.^3 + 4*x.*y - 42*x + 2*y.^2 - 14];[4*y.^3 + 4*x.*y - 26*y + 2*x.^2 - 22]];
% Hessian matrix of the Himmelblau
hessian_him = @(x,y) [[ 12*x.^2 + 4*y - 42 , 4*x + 4*y ];[ 4*x + 4*y , 12*y.^2 + 4*x - 26 ]];
% Call to newton's function and displaying our results accordingly
[r, iters, flag] = newton_min(grad_him,hessian_him,x0,tol,maxits);
fprintf ("<strong>Newton's method</strong>\n\n");
Newton's method
switch (flag)
    case 0
        fprintf ("There was a convergence on f\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    case 1
        fprintf ("There was a convergence on x\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    otherwise
        fprintf ("There was no convergence\n\n");
        
end
There was a convergence on x
The minima found is: 
    3.0000
    2.0000
It took 61 iterations.
function [r, iters, flag] = newton_min(dg,ddg,x0,tol,maxits)
    x = x0(1); y = x0(2);
    r = NaN;
    flag = -1;
    
    for iters = 1 : maxits
    
        x_old = [x;y];
        
        H=ddg(x,y);
        if any(eig(H)<=0), H=eye(2); end
        
        x_new = x_old - 0.4*(H\dg(x,y));
        
        if norm(dg(x,y)) < tol
            
            flag = 0;
            r = x_new;
            return;
        end
        
        if norm(x_new - x_old) <= (tol + eps*norm(x_new))
            
            flag = 1;
            r = x_new;
            return;
        
        end
        
        x = x_new(1);
        y = x_new(2);
    
    end
end

Bruno Luong 2020 年 11 月 11 日

編集済み: Bruno Luong 2020 年 11 月 11 日

MATLAB Online で開く

In practice gradient-based minimizer first computes the descend direction, it can cen Newton like direction

dk = -H(xk)*g(k)

H is estimated from BFGS formula or by other mean such as conjugate gradient direction.

then do the line search

x(k+1) = x(k) + rho*(dk)

When computing dk, the minimizer always ensure

dot(dk,g) <= 0

so that the linesearch always goes downhill for infinitesimal step. It will automatically satisfies when Hk is positive, such as BFGS approximation. In some other methods that can deal with nonpositive Hessian, the direction is given by for example minimizing quadratic form on the sphere around the current point xk (trust region), this also ensures the 2nd condition is meet. The line search never goes up as with the Newton direction like in your case.

The line search approximative per iteration is enough, something costly as fminbnd is too much. Usualy a couple of evaluations of cost function along descend the direction, then the minimzer performs polynomial fit to find the minimum. It must satisfied some other criiteria such as Wolf's criteria etc... if not the step can be reduced aor increased in a geometrical manner until the criteria meet.

Even there are only a few evaluations per step, it will eventually converges to small-enough step along the iteration (one iteration == one change of dk), since the gradient converges to 0 (first order optimality). Thus the most important thing for an optimiser is to find the right descend direction and there is not need to do exact line search.

Anyway all that is well known and one can read through many books of optimization. A classical book is Nosedal that give an overview. In practice solver differs from technical details , and one have to find the corresponding paper to find out what's going on.

Matt J 2020 年 11 月 11 日

MATLAB Online で開く

But that is not the point where the gradient would be zero, it is the critical point (-0.1280, -1.9537).

Yes, but as long as the algorithm goes downhill from (0.5,1) at every iteration, it can never approach the inflection point (-0.1280, -1.9537). The inflection point lies uphill from your initial point:

>> him(0.5,1)
ans =
  125.3125
>> him(-0.1280, -1.9537)
ans =
  178.3372

Dussan Radonich 2020 年 11 月 11 日

Great guys, I got it! Thank you so much

サインインしてコメントする。

その他の回答 (2 件)

J. Alex Lee 2020 年 11 月 10 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point#answer_540665

Yes this looks normal, you are only asking to zero the gradient of the function, so naturally that includes non-optimal points where the gradient is [vector] zero.

You can use a non-gradient minimizer, like fminsearch to seek local minima

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Dussan Radonich 2020 年 11 月 10 日

Thank you, the idea is not to used fminsearch as I am trying to compare newton's method against fminsesarch

サインインしてコメントする。

Bruno Luong 2020 年 11 月 10 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point#answer_540670

編集済み: Bruno Luong 2020 年 11 月 10 日

"Now my question would be, is this normal with this method?"

Your code juts shows it: yes it is normal.

Now in practice it is very rare that one falls on a stationary point that is not a local minimum. As soon as you work with a non-academic objective function. You won't ever get the gradient == 0 exactly.

"Is there a way of getting around this problem?"

All the book I read about optmization, no one care about this specific problem,since as I said it only happens in academic example. However, many methods will compute for each iteration an approximation of the Hessian, and the positiveness of the Hessian is either enforced or monitored. The Hessian that has negative eigenvalues like yours at (0.5,1) will has automatically a special treatment to escape from non-mimum.

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Dussan Radonich 2020 年 11 月 10 日

Thank you, good to know

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

Mathematics and Optimization Optimization Toolbox Get Started with Optimization Toolbox

Help Center および File Exchange で Get Started with Optimization Toolbox についてさらに検索

製品

MATLAB

リリース

R2020b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Newton's method for minimisation returns a critical point

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

30 件のコメント
28 件の古いコメントを表示28 件の古いコメントを非表示

その他の回答 (2 件)

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Newton's method for minimisation returns a critical point

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

30 件のコメント 28 件の古いコメントを表示28 件の古いコメントを非表示

その他の回答 (2 件)

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

30 件のコメント
28 件の古いコメントを表示28 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示