Extracting data from a histogram

Question

0 投票

Hello all, on the x-axis I have the gray levels, and on the y-axis I have the number of pixels, I think? I'm still trying to understand histograms, but I think what its telling me here is that there is a lot of dark pixels in my image, comparatively to histogram 2, which seems to have a lot less darker pixels. I want to make some sort of if statement, where: if there are around 200+ pixels of gray levels 50-60, then the image probably falls into category 1, else, it belongs in category 2. My roundabout solution to this problem is simply to create a threshold that only excepts pixels between 50-60 and then counting the number of objects (which would be 0 ideally in category 2's case), but this can't be the best way of accomplishing that. Any ideas would be appreciated!

12 件のコメント
10 件の古いコメントを表示 10 件の古いコメントを非表示

Adam Danz 2018 年 6 月 30 日

編集済み: Adam Danz 2018 年 6 月 30 日

MATLAB Online で開く

Hi Kimo, It would be helpful to have axis labels which can be added like this

    xlabel('Gray level (units)'); 
    ylabel('frequency')

I'm not sure what function was used to create these plots but there is a technical difference between a histogram and a bar chart. Histograms are typically used with continuous data (ie, time) grouped into bins where each bin gets its own bar and there's usually no space between the bars. Bar charts are typically used with categorical data where each bar represents a categorie (ie, gender) and there's usually space between the bars (as in your images). So I'm not sure if your data are categorical (ie, distinct shades of gray) or continuous (a smooth transition of gray grouped into bins). I'll assume the prior and call it a bar chart.

Histograms usually plot frequencies or densities along the y axis. So if a bar that extends between x=0.5 and x=1 terminates at y=100 that means there are 100 data points in your population that exist between 0.5 and 1. Bar charts can represent a wide variety of variables along the y axis. You mentioned that your bar charts represent number of pixels along the y axis but you sound uncertain so you'd have to look into the data that was used to create the plots. Your x axis is 'gray levels' -- does that mean smaller values are closer to white while larger values are closer to black?

To compare the two distributions, note that neither distribution is normal so I'd start with medians and quantiles. Eye-balling it, the range of gray values differ but slightly -- you have a larger range which means you have darker and lighter grays than in hist2. It looks like your median is around 70 while the hist2 median is around 67 but those estimates are close so you'd have to calculate it from the data. The most salient difference is the number of pixels. Your data has almost an order of magnitude greater number of pixels than the hist2 data. For example, the hist2 data has ~70 pixels with a gray value of 60 or less. Your data has ~1000 pixels within that same range of gray values.

Now, about your idea to categorize the distributions... I initially thought you wanted to categorize based on darkness but your suggestion is actually to categorize based on number of pixels. You want to sample the 50-60 range and count the number of pixels. If two cameras take identical photos but camera B has a lower resolution than camera A, camera A will win even though there's identical darkness in the photo. So, if you could state more clearly what your goal is, the constraints, etc, it would be helpful.

Kimo Kalip 2018 年 7 月 3 日

MATLAB Online で開く

@Adam Danz

Thank you for the thorough explanation! Sorry for late response, didn't get a notification on activity apparently.

I called it a histogram because the function I was using was:

[pixelCountBG, grayLevelsBG] = imhist(flattenedImage);

So I just sort of assumed it was a histogram, but what you said makes sense, its probably more of a bar graph, no?

From my understanding, the higher values are closer to white and the lower values are darker. (When I threshold based off of these values, it seems like I get darker things when I do < and lighter when I do >). But yes, i am not entirely sure if the Y axis is pixel count, it just seems reasonable to assume based off of what I've observed so far.

As for what I'm going for: In imageA I have one object, and in imageB I have a second object layered ontop of the first object, in such a way that makes it darker. My thought process is: there seems to be a pattern in which if my histogram detects a noteable amoount of dark pixels around the 50-60 range, It'll be considered categoryB, whereas if the range is higher up (60+), it'll be categoryA, if that makes sense?

Also how do you extract the data from the histogram exactly?

Thanks again!

Adam Danz 2018 年 7 月 3 日

編集済み: Adam Danz 2018 年 7 月 3 日

If I understand your plots correctly (which I might not), each bin along the x axis is a range of grays (except maybe for the first and last which are white and black). The y axis is some frequency, let's say number of pixels. For simplicity, let's say you only have 9 bins which are summarized in the image below -- these are your x values. You are proposing to use only the 5th bin (50-60 bin labeled in the image) to judge if picture B is darker than picture A. That's where you lose me. That bin only represents one shade of gray and if the y value for picB is greater than picA that just means picB has more pixels with that shade of gray. Pic B could have 9000 pixels there and picture A could have 1 pixel there but picture A also has 9-million black pixels making it much darker than picA.

If I'm correct that your x data are shades of gray and your y data are frequencies, I'd suggest using means (if ~normally distributed) or medians (otherwise) of the entire distributions. You could also calculate confidence intervals or perform t-tests to determine if the difference is statistically significant.

Feel free to ask follow up questions or correct my assumptions.

Kimo Kalip 2018 年 7 月 3 日

編集済み: Image Analyst 2018 年 7 月 4 日

While I have the originals here, might I ask another separate question?

Is there any way to use the edge of the object to determine whether or not its shaped "properly"? (I says "properly" because the metric seems almost arbitrary, and I can't conceptualize how a computer is supposed to recognize abnormalities in a shape that is always different). I was thinking I'd take the slope of the edge, and that a steeper slope is more desirable, but the edge can also be shaped like ) or (, and it would still be "ok". However, a definite bulge like in image Two is something the computer is supposed to be able to recognize. Do you have any tips on where to start with a question like that?

Image Analyst 2018 年 7 月 4 日

MATLAB Online で開く

Sounds very ad hoc. Sure you can find the boundary and then take slopes perpendicular to the edge in a few places and see if the slopes are as expected. Or you can take the x-y path of the boundary curve and see if that is "straight enough" or "too bulged" whatever that means. Fit a section to a quadratic or whatever and look at the residuals of the actual to the fit. Pretty simple with polyfit() and polyval():

coefficients = polyfit(xActual, yActual, 2);
yFit = polyval(coefficients, xActual);
meanResidual = mean(abs(yFit-yActual))

or something like that. Basically you can do whatever you want - it just depends on how you define a "normal" looking curve.

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

Answer 1

Image Analyst 2018 年 6 月 30 日

MATLAB Online で開く

0 投票

moments_spatial_central.m

What KALYAN was saying is

binaryImage = grayImage <= 60;
if sum(binaryImage(:)) > 200
    % It's the dark image type
else
    % It's the lighter image type
end

However I like Adam's solution of simply comparing mean and standard deviation.

If either of these fail for some images, then we can try some more sophisticated image comparison methods like ssim() or image moments others.

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示

Kimo Kalip 2018 年 7 月 3 日

編集済み: Kimo Kalip 2018 年 7 月 3 日

Ahh, this makes a lot of sense, thanks! I'll have to try it out before I do the accepted answer thing, but it seems like what I was going for - also clarified my original post in case anyone has any more input.

サインインしてコメントする。

Extracting data from a histogram

12 件のコメント
10 件の古いコメントを表示 10 件の古いコメントを非表示

採用された回答

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示

その他の回答 (0 件)

カテゴリ

製品

リリース

タグ

Community Treasure Hunt

Extracting data from a histogram

12 件のコメント 10 件の古いコメントを表示 10 件の古いコメントを非表示

採用された回答

1 件のコメント -1 件の古いコメントを表示 -1 件の古いコメントを非表示

その他の回答 (0 件)

カテゴリ

製品

リリース

タグ

参考

Community Treasure Hunt

12 件のコメント
10 件の古いコメントを表示 10 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示