How to estimate probabilities of an arbitrary range, based on the probability distribution of a given a data set of numbers?

Question

Clarisha Nijman 2018 年 10 月 22 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/425279-how-to-estimate-probabilities-of-an-arbitrary-range-based-on-the-probability-distribution-of-a-give

コメント済み: Clarisha Nijman 2018 年 10 月 23 日

Hello,

Given a series of values x, I want to estimate the probabilities of a range of numbers U, in(using) the probability distribution of the given series x. My code works for one value, but I need probabilities of a range, Can somebody give me some feedback please?

Thank you in advance.

This is the code:

%%Generate some data/series
x=randi([-2 50],25,1);
%Values/ranges of interest
U=[-100:100];
%define histogram and probability distribution of x
h = histogram(x);
h.Normalization = 'probability';%Changing count in probabilities
h.Values(U); %finding probabilities of range U

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Bruno Luong 2018 年 10 月 22 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/425279-how-to-estimate-probabilities-of-an-arbitrary-range-based-on-the-probability-distribution-of-a-give#answer_342725

編集済み: Bruno Luong 2018 年 10 月 22 日

MATLAB Online で開く

Use HISTCOUNTS then

N = histcounts(x, [-Inf, U, Inf]);
P = N(2:end) / sum(N)

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

Clarisha Nijman 2018 年 10 月 22 日

Ok, that is a good idea to study this topic again in Matlab, with this new insight you gave me today!

Thank a lot!

Clarisha Nijman 2018 年 10 月 23 日

x=randi([-3 3],10,1); U=[-5:5];

N = histcounts(x, [-Inf, U, Inf ]) prob = N(2:end) / sum(N)

%alternative code f=hist(x,U); prob=f/sum(f);

Now I fully understand your answer. With this small example it is clear. With the tails you are getting 2 extra intervals. An arbitrary value for U, let's say 2 is associated with interval <1,2] Such that we have eleven intervals, and since the left tail does not live in U, it is excluded, and that's why use (2:end) in the code. Thanks a lot!

サインインしてコメントする。

Answer 2

Torsten 2018 年 10 月 22 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/425279-how-to-estimate-probabilities-of-an-arbitrary-range-based-on-the-probability-distribution-of-a-give#answer_342651

MATLAB Online で開く

%%Generate some data/series
X=randi([-2 50],25,1);
%Values/ranges of interest
U=[-100:100];
X = sort(X)
[countsX, binsX] = hist(X)
cdfX = cumsum(countsX) / sum(countsX)
extrap_left = (min(U) > max(X));
extrap_right = (max(U) > max(X));
p_U_left = interp1(binsX,cdfX,min(U),'linear',extrap_left)
p_U_right = interp1(binsX,cdfX,max(U),'linear',extrap_right)
p_U = p_U_right - p_U_left

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

Clarisha Nijman 2018 年 10 月 22 日

If you want to use data you can not do that, that would be excluding situations that possibly might occur. That is why the frequency polygon is a smooth line. To estimate values in between.

Torsten 2018 年 10 月 22 日

編集済み: Torsten 2018 年 10 月 22 日

If you get discrete values from a random variable, say [ 1 2 4 5 6 ], how should it be possible to tell p({3}) ? (Hint: It's impossible).

In my opinion, the most reasonable estimate would be p=0 since it does not appear in the list.

If you know the distribution the values stem from, you can get a Maximum Likelihood Estimate (MLE) of the parameters describing the distribution. Having calculated these parameters, you can give estimates of probabilities for elements of your choice.

サインインしてコメントする。

Answer 3

Bruno Luong 2018 年 10 月 22 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/425279-how-to-estimate-probabilities-of-an-arbitrary-range-based-on-the-probability-distribution-of-a-give#answer_342718

編集済み: Bruno Luong 2018 年 10 月 22 日

MATLAB Online で開く

not sure, is it what you want?

x=randi([-2 50],10000,1);
U=[-100:100];
h = histogram(x, U);

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Clarisha Nijman 2018 年 10 月 22 日

Let's say x is the profit of a shop observed 20 times. and the values are: 2,5,7,2,20,25,35,15,6,-2,15,27,2,20,15,5,7,2,20,25

This can be associated with a probability distribution. And you can plot it.

Now it is asked to estimate the probability of the values in between, and also in the tails. U=-[5 -4 -3 -2 -1 0 1 2 .... 40]

サインインしてコメントする。

How to estimate probabilities of an arbitrary range, based on the probability distribution of a given a data set of numbers?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

その他の回答 (2 件)

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

How to estimate probabilities of an arbitrary range, based on the probability distribution of a given a data set of numbers?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

4 件のコメント 2 件の古いコメントを表示2 件の古いコメントを非表示

その他の回答 (2 件)

4 件のコメント 2 件の古いコメントを表示2 件の古いコメントを非表示

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示