Add sample or population weights to Statistics toolbox

01-10-2023 10:39 AM
Status: Open
New Contributor II

Add the option to use sample or population weights in Summarize Within, Summary Statistics, Multivariate Clustering, and other statistical tools.


My team has been tasked with producing a number of accessibility metrics for client governments. We've done that using custom friction surfaces, and we've extracted those to population and survey points.  The intent is to aggregate these to an admin level because that's where decisions will be made, and budgets allocated. The statistics toolbox (Summarize within and summary stats) and spatial statistics tools do not include the option to apply population or sample weights.  This means that we'll have to do spatial joins/intersect to tack on the admin boundary data to the points, then export to CSVs, ship it to economists to collapse in STATA, then they ship it back to us to link up with the Admin boundaries to display and merge with other variables at the ADM level to continue analysis or display.  ArcGIS is first and foremost a spatial software, but it does include many straight statistics functions, hence all the lovely options in the Summary Stats and summarize within tools.  The use of weights in statistics is a critical function thereof and it would be great to be able to do that directly in ArcPro so we can better leverage ArcPy and ArcNotebooks.  I can see a number of business clients also seeking to use weights in their analytics (Business Analyst does this a little bit) and the lack of weighted statistical function is a pretty sizeable hole in the statistical modernization/census context.  If this functionality were available, I can certainly see the need (and in fact we're bouncing around it) for the use of both spatial and population weights simultaneously.  We just have to do those things separately at the moment in a very clunky manner

1 Comment

So essentially you want to add an optional parameter to the tools to weigh the summary output values of each record by a field in the data?