Scatter Plot Matrix Chart Giving Wrong Answer?

1011
5
03-12-2020 02:34 PM
Elijah
by
Occasional Contributor II

I assume the result of the scatter plot matrix chart in Argis Pro represent correlation coefficient between two variables. If this is true, I noticed two things I think another person may verify:

1. The result R2 (coefficient values) returned by the chart is not consistent with R2 result values obtained from Correlation in Microsft Excel. Can someone use any data to try to very that I am not doing something wrong from here?

2. Is there any way to add the direction of the relationship in the scatter plot matrix? Noticed that all the values are positive even when some of the relationships are negative really.

You may wish to try with this data as pasted below.

LONGLONGABCD
8.806429.5468124527.67
8.806429.54686.9487235
8.806429.54687.6561.545
8.806429.546816.41234.9648
8.806429.546821.89332.82.220.8
8.806429.546819.3322.531.0231.75
8.806429.546816.7492.180.9720.13
8.806429.546810.33376.970.3412.42
8.806429.54685.15534.670.5917.5
8.806429.546823.92926.330.8724.42
8.806429.546825.04748.992.1611.06
8.806429.546821.461161.610.6212.07
8.806429.546846.63767.541.8326.86
8.806429.546834.49379.981.0224.69
8.806429.546835.71722.071.2516.65
8.806429.546849.06626.863.2220.13
8.806429.546823.68512.883.2970.2
0 Kudos
5 Replies
DanPatterson_Retired
MVP Emeritus

could be that the coefficients you are seeing are adjusted correlation coefficients, is that what excel is using?

Regression analysis basics—ArcGIS Pro | Documentation 

Pearson correlation coefficient - Wikipedia 

0 Kudos
Elijah
by
Occasional Contributor II

Hi Dan,

Thanks for your response. Excel values are not adjusted neither are those from the scatter plot matrix hence I assume they should be same if rounded to same decimals. I ran same values again, just to double check, and I continue to get this differences. Excel values seem to confirm perceptible trend of the variables though. Nevertheless, this discrepancy shouldn't be. Am just a little curious what's going on. This two processes (both Excel and ArcGIS) are fairly simple to handle!! Or what could make things go wrong in ArcGIS when using scatter plot chart; must the data be projected, etc? I will love to know

Thanks.

0 Kudos
DanPatterson_Retired
MVP Emeritus

Andy, I didn't look in detail, but I think Pro is reporting adjusted values.  and are you doing simple correlations or multivariate etc?

0 Kudos
Elijah
by
Occasional Contributor II

Thanks again Dan.

Good to know that Pro's values are adjusted R2. However, this doesn't seem to be the issue. See attached a comparison of values from Excel with Pro. They are distance apart.

Yes, am considering multiple variables same time (multivariate) . Does that have any impact or is it handled anyway differently? I am aware I can only chart a minimum of 3 variables. Meanwhile, can we get the direction relationship (- minus sign, etc)

Please, look at the attached and possibly advise.

Thanks

0 Kudos
ChristopherAllen
Esri Contributor

Hi @Elijah ,

Thanks for the question! It appears that the results you’ve posted from Excel might be the Pearson’s r correlation coefficient values, while the results you’ve posted from Pro are the r-squared values. This is based on the observation that squaring the values from Excel yields the values that are displayed in Pro.

In Pro version 2.7 and above, you can view the Pearson’s r values in a scatter plot matrix by selecting the appropriate option in the Data tab of the Chart Properties pane:

ChristopherAllen_0-1684024475820.png

Thanks!

Chris

0 Kudos