How do I determine query points?

Question

0 投票

Hi,

I've recently been introduced to Shapley values and I'm trying to follow the Matlab tutorial on these but I've unsure about the section on query Points.

In my instance, I am trying to determine the importance of different predictors from a classification tree based on a table of 19 different possible predictors. The output from this tree will give me a category of either A, B, C or D, with A being the preferential category.

In the tutorial on matlab, when determining query points, the code written is:

queryPoint = tbl(end,:)

By this, it is taking the last line of the tbl in question. This may be a stupid question, but how do I adapt this to my work? Do I replace it with something along the lines of:

queryPoint = tbl(x,:)

where x is the line number of a datapoint that is categorised as A

Apolgies if it sounds dumb, I'm very new to the world of machine learning techniques but thank you for any help?

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

Answer 1

Dheeraj 2024 年 9 月 4 日

編集済み: Dheeraj 2024 年 9 月 4 日

MATLAB Online で開く

1 投票

Hi Mark,

To select a query point for Shapley value analysis in your classification tree model, you can adapt the MATLAB code by choosing a specific data point (row) from your dataset that has been classified into the category of interest, such as A. For example, if you want to analyze why a particular instance was classified as A, you would replace the query point selection with the row number corresponding to that instance, like

queryPoint = tbl(x,:)

where x is the row number of the desired data point. Alternatively, you can automatically select a query point by identifying rows where the category is A using a command like

find(tbl.Category == 'A')

and then choosing one of those rows for further analysis. This allows you to tailor the Shapley value analysis to the specific cases that are most relevant to your study.

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示

Mark 2024 年 9 月 4 日

Thanks Dheeraj!

I had a feeling I'd have to do it effectively manually per instance of category "A", but I suspect a combination of laziness or hope that there would be some way of scanning all the data make the query point line effectively redundant and having it done automatically.

サインインしてコメントする。

Answer 2

Sahas 2024 年 9 月 4 日

MATLAB Online で開く

1 投票

Hi @Mark,

As per my understanding, you would like to analyze a specific data point associated to a specific category. You are using “queryPoint” property to understand the contribution of each feature in the model.

Assuming you are using MATALB R2023b version, specify the query point by using the “QueryPoint” name-value argument instead of “QueryPoints”. This is because before MATLAB R2024a, you can only specify one query point for calculating the “Shapley values”.

Refer to the following MathWorks documentation of MATLAB R2023b version to know how to compute “Shapley values” for a query point: https://www.mathworks.com/help/releases/R2023b/stats/shapley.html

In the above cited example, you can replace the “end” keyword with the index of that particular row to analyze a specific data point in category “A”. Take a look at the following code snippet for a better understanding:

% Assume "Category" is the name of the column
% Find the index of the first occurrence of category "A"
x = find(tbl.Category == 'A', 1); 
% Select that row to analyze
queryPoint = tbl(x,:);

Follow the steps in the given in the documentation to compute ”Shapley values” with a single or multiple query points according to the requirements and the MATLAB version being used.

Refer to the following MathWorks documentation of MATLAB R2024a version to know how to compute “Shapley values” for multiple query points: https://www.mathworks.com/help/stats/shapley.html

Hope this is beneficial!

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

How do I determine query points?

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

採用された回答

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示

その他の回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

カテゴリ

製品

リリース

タグ

Community Treasure Hunt

How do I determine query points?

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

採用された回答

1 件のコメント -1 件の古いコメントを表示 -1 件の古いコメントを非表示

その他の回答 (1 件)

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

カテゴリ

製品

リリース

タグ

参考

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示