fminunc gets first-order optimality of zero on iteration 0, does not find optimum

Question

Koen Franse 2020 年 8 月 11 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/578170-fminunc-gets-first-order-optimality-of-zero-on-iteration-0-does-not-find-optimum

コメント済み: Koen Franse 2020 年 8 月 17 日

Hi everyone,

I am trying to optimize a parameter of a Finite Element model by using fminunc. However, the optimization algorithm finishes on iteration 0, with a first-order optimality measure of 0, and therefore does not find the optimum value for the parameter:

I have run my Finite element model with a few values for the parameter that I want to optimize:

As it looks to me, there is a clear global minimum, but for some reason the fminunc doesn't find it now. Does anyone know how to fix this problem?

Thanks in advance, Koen

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

John D'Errico 2020 年 8 月 11 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/578170-fminunc-gets-first-order-optimality-of-zero-on-iteration-0-does-not-find-optimum#answer_478168

編集済み: John D'Errico 2020 年 8 月 11 日

MATLAB Online で開く

First, this appears to be a ONE parameter optimization. Do not use fminunc for that. Instead, us fminbnd. It will be more robust. It may even be more efficient.

Second almost always when someone says what you have, this means the function is coded incorrectly. Before performing an optiimization, test to see if the objective function changes for different inputs. The response you got is the response you would see if a function is just a constant.

For example, see what happens here:

>> testfun = @(x) 1;
>> testfun(1)
ans =
     1
>> testfun(pi)
ans =
     1

Anything I send into testfun, I get 1 out.

Now, what happens when I try fminunc here?

>> fminunc(testfun,pi)
Initial point is a local minimum.
Optimization completed because the size of the gradient at the initial point 
is less than the value of the optimality tolerance.
<stopping criteria details>
ans =
                    3.1416

I get exactly the same result as you got.

So first, verify that if you send different input parametrs into your function, that you get something differnt.

Next, verify that your function is differentiable. fminunc REQUIRES this. If you do something inside that rounds the inputs or the output, then fminunc cannot be used. Again, an example should suffice:

>> testfun = @(x) round(x);
>> fminunc(testfun,1.1)
Initial point is a local minimum.
Optimization completed because the size of the gradient at the initial point 
is less than the value of the optimality tolerance.
<stopping criteria details>
ans =
                       1.1

So for ANY x in the vicinty of an integer, you get the same integer out. NO change. As far as fminunc is concerrned, this is a constant function, and it will terminate immediately.

testfun(.5:.1:1.4)
ans =
     1     1     1     1     1     1     1     1     1     1

Why try to optimize something that does not change?

The odds are therefore good, that if you think your function should produce some non-constant response, then you have a bug in your code. We cannot diagnose that without seeing the code of course, and my MATLAB crystal ball is always on the fritz. :)

5 件のコメント
3 件の古いコメントを表示3 件の古いコメントを非表示

Koen Franse 2020 年 8 月 12 日

MATLAB Online で開く

Hi John,

Thanks for your detailed explanation! Later on I want to use my model for multiple parameters, therefore I decided to use fminunc. My function does not return constant values for different input values, the output for different input values is plotted in the figure of my post.

However, I question if my function is differentiable. Because I'm running a finite element (FE) model outside of matlab in this function I cannot share a working code here. However, what I'm essentially trying to do is optimize the value of the FE model so that the displacement results correspond to a reference state. So minimizing the difference in displacement between two results. My function code looks something like this:

function MSE = obj_function(YM, displ_ref) % computes mean square error between computed and reference state
    displ = calc_displacement(YM); % This function runs the FE model computing the displacement
    
    m = length(displ_ref)
    MSE = (1/(2*m))*sum( (displ_ref - displ ).^2 );
end

Is it possible that this is not differentiable because the function is not solely 'matlab-defined'? Furthermore I tried fminsearch, and that seems to work for me. What would be your advice on that?

Thanks!

Alan Weiss 2020 年 8 月 14 日

I believe that you need to take huge finite difference steps because your problem has poor scaling. I think that internally your objective function should multiply the parameter YM by 1e4 AND you should set the FiniteDifferenceStepSize option to 1e-2 or even 1e-1. Or perhaps internally take exp(YM) as the parameter; I think that is a better idea, now that I think of it. And still take a reasonably large step size, maybe 1e-3 even with an exponential scaling.

Alan Weiss

MATLAB mathematical toolbox documentation

Koen Franse 2020 年 8 月 17 日

MATLAB Online で開く

Thanks for the tip Alan, scaling was indeed the problem. I now have the optimization working correctly.

For other people that might run into similar problems in the future; what I did is take

YM_scaled = log(YM)

as the input parameter for fminunc, and inside the objective function used the 'real'

YM = exp(YM_scaled)

as an input for my FE model. Finally, as the output of my objective function, I took

MSE_scaled = log( (1/(2*m))*sum( (displ_ref - displ ).^2 ) )

So now fminunc uses the log-values for both the input and the output, and in fact is optimizing the function I plotted above without the logarithmic axes. In this way fminunc can find an optimal value.

サインインしてコメントする。

Answer 2

Bruno Luong 2020 年 8 月 14 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/578170-fminunc-gets-first-order-optimality-of-zero-on-iteration-0-does-not-find-optimum#answer_479817

編集済み: Bruno Luong 2020 年 8 月 14 日

If your gradient is 0, then your FEM returns the exact same result for two different values of YM.

This can be created by many things, such as truncation of the interface between MATLAB and your FEM SW, some thresholing in the equation you are trying to solve, mesh generator, etc... No one can tell since you did not disclose to us the details of FEM part.

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

Bruno Luong 2020 年 8 月 14 日

編集済み: Bruno Luong 2020 年 8 月 14 日

Because when you plot you "control" the step.

FMINUNC uses a step-size that might or might not be larger than the threshold to evaluate the gradient, and it requires the function to be differentiable. The FEM obviously is piecewise constant, meaning not differentiable wrt YM parameter.

FMINSEARH is gradient-less method a,nd 1D. It's very poor optimization method IMO.

"the problem is not inside my FEM solver". Let me rephrase itmore accurately: The problem is you select a wrong optimization method because the FEM is not differentiable wrt YM parameter so it does not meet the requirement of "smooth" (C1) objective function.

Koen Franse 2020 年 8 月 14 日

Okay, that explains a little more to me. However there is still one thing I don't completely understand: if I understand the FMINUNC/FMINCON solvers use finite differencing to compute the gradient for each model evaluation. I tried some subtle differences in my YM parameter, which resulted in differences in the value computed by my objective function. In fact that is what the finite differencing method does to approximate the gradient right? Therefore I still do not understand why the FMINUNC solver finds gradients of exactly zero.

You say I selected the wrong optimization method for this problem; do you maybe have a suggestion for a better alternative?

サインインしてコメントする。

Answer 3

Bruno Luong 2020 年 8 月 14 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/578170-fminunc-gets-first-order-optimality-of-zero-on-iteration-0-does-not-find-optimum#answer_480033

Again the step you tried might not be the step FMINUNC selects. It has a bunch of decision tree behind FMINUNC. I don't know why the FEM returns the same value for 2 different YM, but obviously that happens. You might be able to track the step FMINUNC used by adding some instrumental code in your objective function.

Again I don't know how your FEM works, I can't recommend you the method. Your function looks also having a very narrows valley where each side ithat look like a concave shape. This also not something gradient method like.

But again the problem seems that the FEM have some thresholding calculation on YM and it makes your objective function non-smooth thus you can't use any gradient method.

I start to repeat myself a lot.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

fminunc gets first-order optimality of zero on iteration 0, does not find optimum

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (3 件)

5 件のコメント
3 件の古いコメントを表示3 件の古いコメントを非表示

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

fminunc gets first-order optimality of zero on iteration 0, does not find optimum

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (3 件)

5 件のコメント 3 件の古いコメントを表示3 件の古いコメントを非表示

3 件のコメント 1 件の古いコメントを表示1 件の古いコメントを非表示

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

5 件のコメント
3 件の古いコメントを表示3 件の古いコメントを非表示

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示