<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Which statistical library in Python Questions</title>
    <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651111#M50685</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If you are using ArcGIS 10.3 and have pandas and scipy, then someone has already installed additional libraries because those were not bundled with ArcGIS until ArcGIS 10.4.x.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If "getting into one or more of these will be a time investment" for you, then I suspect you are new to Python.&amp;nbsp; You could use this as an opportunity to learn some Python packages like scipy and pandas (after you upgrade); but given your requirements, I think sticking with ArcGIS geoprocessing tools (e.g., Statistics) is likely your best bet until you get more familiar with Python.&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 09 Aug 2018 13:34:09 GMT</pubDate>
    <dc:creator>JoshuaBixby</dc:creator>
    <dc:date>2018-08-09T13:34:09Z</dc:date>
    <item>
      <title>Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651109#M50683</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I'd like opinions, no code necessary, on&amp;nbsp;libraries for performing basic descriptive stats using the python tools that ship in the box with an ArcGIS 10.3 standard licence (so Python 2.7).&amp;nbsp; I'm behind a corporate firewall with &lt;SPAN style="text-decoration: underline;"&gt;no&lt;/SPAN&gt; ability to install any additional libraries.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm not planning in doing ML or predictive analytics just yet.&amp;nbsp;I have a simple requirement to :&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;get simple aggregates (counts, count not null, count grouping by)&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;measures of central tendency (mean, mode, median, range, variance &amp;amp; standard deviation)&lt;/LI&gt;&lt;LI&gt;box plots, histograms and pie charts&lt;/LI&gt;&lt;LI&gt;output results to XLS&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Inputs would be file GDB feature classes ranging in volume from 5 records to 5 million records, with most&amp;nbsp; under 1 million records.&amp;nbsp; &amp;nbsp;Execution would be infrequent, maybe once or twice a day to assist with generic data profiling.&amp;nbsp; I'm not hooking this up to a high volume web service, so speed is not critical.&amp;nbsp; The ability to write clear code is.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I suspect I could achieve my requirements with one or more of:&amp;nbsp;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;arcpy.da.SearchCursor (yeah, the long way 'round)&lt;/LI&gt;&lt;LI&gt;arcpy.Statistics_analysis&lt;/LI&gt;&lt;LI&gt;scipy.stats&lt;/LI&gt;&lt;LI&gt;numpy&lt;/LI&gt;&lt;LI&gt;pandas&lt;/LI&gt;&lt;LI&gt;sqllite&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;mapplotlib&lt;/LI&gt;&lt;LI&gt;xlwt&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Getting into one or more of these will be a time investment that I'd hope to still be useful when I (hope) I&amp;nbsp;will be able to upgrade to ArcGIS 10.6 and 'Pro with Python 3.6 in the next year.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So, which do you think are worth learning for this purpose?&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Aug 2018 12:40:54 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651109#M50683</guid>
      <dc:creator>Anonymous User</dc:creator>
      <dc:date>2018-08-09T12:40:54Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651110#M50684</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;arcmap 10.5/6 ship with various python modules including matplotlib, scipy and numpy.&amp;nbsp; They contain various stats functionality and matplotlib for graphing&lt;/P&gt;&lt;P&gt;&lt;A class="link-titled" href="https://docs.scipy.org/doc/scipy/reference/stats.html" title="https://docs.scipy.org/doc/scipy/reference/stats.html"&gt;Statistical functions (scipy.stats) — SciPy v1.1.0 Reference Guide&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A class="link-titled" href="https://matplotlib.org/examples/statistics/index.html" title="https://matplotlib.org/examples/statistics/index.html"&gt;statistics Examples — Matplotlib 2.0.2 documentation&lt;/A&gt;&amp;nbsp;&lt;A class="link-titled" href="https://docs.scipy.org/doc/scipy/reference/stats.html" title="https://docs.scipy.org/doc/scipy/reference/stats.html"&gt;https://docs.scipy.org/doc/scipy/reference/stats.html&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A class="link-titled" href="https://docs.scipy.org/doc/numpy-1.14.0/reference/routines.statistics.html" title="https://docs.scipy.org/doc/numpy-1.14.0/reference/routines.statistics.html"&gt;Statistics — NumPy v1.14 Manual&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;my blog has several stats and graphing examples&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Aug 2018 13:30:01 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651110#M50684</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2018-08-09T13:30:01Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651111#M50685</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If you are using ArcGIS 10.3 and have pandas and scipy, then someone has already installed additional libraries because those were not bundled with ArcGIS until ArcGIS 10.4.x.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If "getting into one or more of these will be a time investment" for you, then I suspect you are new to Python.&amp;nbsp; You could use this as an opportunity to learn some Python packages like scipy and pandas (after you upgrade); but given your requirements, I think sticking with ArcGIS geoprocessing tools (e.g., Statistics) is likely your best bet until you get more familiar with Python.&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Aug 2018 13:34:09 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651111#M50685</guid>
      <dc:creator>JoshuaBixby</dc:creator>
      <dc:date>2018-08-09T13:34:09Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651112#M50686</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks for responding Dan. Its good to know that SciPy, Matplotlib and NumPy will be available in a version of ArcGIS that I may have in the future.&amp;nbsp; &amp;nbsp;If you were working in this problem now as described above in 10.3, would you start in SciPy, or NumPy?&amp;nbsp; &amp;nbsp;And what are your thoughts on Pandas.&amp;nbsp; I see it did not make your short-list, is that for a reason?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Aug 2018 20:22:43 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651112#M50686</guid>
      <dc:creator>Anonymous User</dc:creator>
      <dc:date>2018-08-09T20:22:43Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651113#M50687</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Joshua.&amp;nbsp; Perhaps I should have been clearer in saying &lt;STRONG style="text-decoration: underline;"&gt;I&lt;/STRONG&gt; do not have the ability to install additional packages&amp;nbsp;&lt;IMG src="https://community.esri.com/legacyfs/online/emoticons/happy.png" /&gt;&lt;/P&gt;&lt;P&gt;I am relatively new to Python.&amp;nbsp; But I have some experience in Java and .net, so I'm not new to code.&amp;nbsp; I am using this as an opportunity to dive into Python.&amp;nbsp; I'm not so much looking for an easy path, as&amp;nbsp;a better path (if that makes sense?).&amp;nbsp; Do you have experience with any of those packages?&amp;nbsp; Did you find one more flexible or functional than another?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Aug 2018 20:28:58 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651113#M50687</guid>
      <dc:creator>Anonymous User</dc:creator>
      <dc:date>2018-08-09T20:28:58Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651114#M50688</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The libraries you've listed are good at certain things, and don't do some things at all. At the beginning at least,&amp;nbsp;I'd suggest being a jack of all trades and master of none. Learn that numpy is good at handling numbers (e.g. rasters), pandas is good at handling tables, arcpy handles ArcGIS-style data (but not as fast as numpy or pandas), and matplotlib draws your graphs. Do a lot of googling for how to do specific things when you need to do them. Honestly, I wouldn't try to learn everything about any of these libraries.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Aug 2018 23:02:06 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651114#M50688</guid>
      <dc:creator>DarrenWiens2</dc:creator>
      <dc:date>2018-08-09T23:02:06Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651115#M50689</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You should invest your time in python 3... ArcGIS Pro uses python 3.5/6 and there is a fundamental statistics package now built into python.&lt;/P&gt;&lt;P&gt;&lt;A class="link-titled" href="https://docs.python.org/3/library/statistics.html" title="https://docs.python.org/3/library/statistics.html" rel="nofollow noopener noreferrer" target="_blank"&gt;9.7. statistics — Mathematical statistics functions — Python 3.7.0 documentation&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE class="lia-code-sample line-numbers language-none"&gt;&lt;CODE&gt;&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; sys

sys&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;version
&lt;SPAN class="string token"&gt;'3.6.6 |Anaconda, Inc.| (default, Jun 28 2018, 11:27:44) [MSC v.1900 64 bit (AMD64)]'&lt;/SPAN&gt;

&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; statistics

dir&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;statistics&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;
 &lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="string token"&gt;'Decimal'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'Fraction'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'StatisticsError'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__all__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__builtins__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__cached__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__doc__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__file__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__loader__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__name__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__package__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'__spec__'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_coerce'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_convert'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_counts'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_exact_ratio'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_fail_neg'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_find_lteq'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_find_rteq'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_isfinite'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_ss'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'_sum'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'bisect_left'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'bisect_right'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'chain'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'collections'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'decimal'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'groupby'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'harmonic_mean'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'math'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'mean'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'median'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'median_grouped'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'median_high'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'median_low'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'mode'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'numbers'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'pstdev'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'pvariance'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'stdev'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'variance'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;
&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 12 Dec 2021 03:36:50 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651115#M50689</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2021-12-12T03:36:50Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651116#M50690</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I agree, in principal.&amp;nbsp; But for now that is not an option.&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 10 Aug 2018 02:04:20 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651116#M50690</guid>
      <dc:creator>Anonymous User</dc:creator>
      <dc:date>2018-08-10T02:04:20Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651117#M50691</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Michael... missed this&lt;/P&gt;&lt;P&gt;Pandas is largely a fancier jacket over numpy (over simplified).&lt;/P&gt;&lt;P&gt;Everything in the SciPy 'stack' depends on numpy, so if you are working with numbers, then that is way to go.&lt;/P&gt;&lt;P&gt;You can treat 'missing' data using 'masked arrays' and/or use what is called 'nan' functions (ie nan is Not a Number).&lt;/P&gt;&lt;P&gt;It is blazingly fast and can benefit from a variety of other modules that speed things up even further.&lt;/P&gt;&lt;P&gt;Pandas is pretty good for a person looking at working with array/tabular data, but at some stage, you will probably step back, particularly if you want to work with some of the graphics libraries like matplotlib, seaborn etc.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Aug 2018 01:17:28 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651117#M50691</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2018-08-15T01:17:28Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651118#M50692</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Dan.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Aug 2018 22:01:18 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651118#M50692</guid>
      <dc:creator>Anonymous User</dc:creator>
      <dc:date>2018-08-15T22:01:18Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651119#M50693</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I really like working with Pandas, I find it very intuitive to learn and powerful. I also use numpy as required but it is has a much steeper learning curve. Pandas has great grouping and aggregating functions that should make it easy to do aggregations and summary statistics. The pandas dataframe has some built in basic plotting but matplotlib allows for a lot more customization.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I use a search cursor to pull all the data into a pandas data frame. I don't think you will run into any memory limits based on the size of file you described.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE class="lia-code-sample line-numbers language-none"&gt;&lt;CODE&gt;&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; arcpy
&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; pandas &lt;SPAN class="keyword token"&gt;as&lt;/SPAN&gt; pd

fields_list &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; &lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="string token"&gt;'field1'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'field2'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;
input_fc &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; r&lt;SPAN class="string token"&gt;'c:\temp\temp.gdb\input_fc'&lt;/SPAN&gt;

feature_data &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; &lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;

&lt;SPAN class="keyword token"&gt;with&lt;/SPAN&gt; arcpy&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;da&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;SearchCursor&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;input_fc&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; fields_list&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt; &lt;SPAN class="keyword token"&gt;as&lt;/SPAN&gt; cursor&lt;SPAN class="punctuation token"&gt;:&lt;/SPAN&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;SPAN class="keyword token"&gt;for&lt;/SPAN&gt; row &lt;SPAN class="keyword token"&gt;in&lt;/SPAN&gt; cursor&lt;SPAN class="punctuation token"&gt;:&lt;/SPAN&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; feature_data&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;append&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;list&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;row&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;

df &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; pd&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;DataFrame&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;feature_data&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; columns&lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt;fields_list&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;‍‍‍‍‍‍‍‍‍‍‍‍‍&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 12 Dec 2021 03:36:53 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651119#M50693</guid>
      <dc:creator>RyanRuthart1</dc:creator>
      <dc:date>2021-12-12T03:36:53Z</dc:date>
    </item>
    <item>
      <title>Re: Which statistical library</title>
      <link>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651120#M50694</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P style="background-color: transparent; color: #3d3d3d; font-family: Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif; font-size: 15px; font-style: normal; font-variant: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px; word-wrap: break-word; padding: 0px; margin: 0px;"&gt;Of course there are similar shortcuts in numpy like skipping the searchcursor and pulling out the data directly from a featureclass.&lt;/P&gt;&lt;P style="background-color: transparent; color: #3d3d3d; font-family: Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif; font-size: 15px; font-style: normal; font-variant: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px; word-wrap: break-word; padding: 0px; margin: 0px;"&gt;It is pretty speedy and there are functions (nan functions) that account for nodata/null values in the data.&lt;/P&gt;&lt;P style="background-color: transparent; color: #3d3d3d; font-family: Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif; font-size: 15px; font-style: normal; font-variant: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px; word-wrap: break-word; padding: 0px; margin: 0px;"&gt;For example&lt;/P&gt;&lt;PRE class="lia-code-sample line-numbers language-none"&gt;&lt;CODE&gt;&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; numpy

&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; arcpy

fc &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; r&lt;SPAN class="string token"&gt;"C:\GIS\A_Tools_scripts\Table_tools\Table_tools.gdb\pnts_2K_normal"&lt;/SPAN&gt;

fields &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; &lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="string token"&gt;'C_0'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; &lt;SPAN class="string token"&gt;'C_1'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;

a &lt;SPAN class="operator token"&gt;=&lt;/SPAN&gt; arcpy&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;da&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;TableToNumPyArray&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;fc&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; fields&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;

a&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;size
&lt;SPAN class="number token"&gt;2000&lt;/SPAN&gt;

np&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;nanmean&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;a&lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="string token"&gt;'C_0'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;&amp;nbsp; &lt;SPAN class="comment token"&gt;# account for nulls aka missing data&lt;/SPAN&gt;
&lt;SPAN class="number token"&gt;10.520978947368421&lt;/SPAN&gt;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Then you can graph (simple ones using matplotlib)&lt;/P&gt;&lt;PRE class="lia-code-sample line-numbers language-none"&gt;&lt;CODE&gt;&lt;SPAN class="keyword token"&gt;import&lt;/SPAN&gt; matplotlib&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;pyplot &lt;SPAN class="keyword token"&gt;as&lt;/SPAN&gt; plt

plt&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;scatter&lt;SPAN class="punctuation token"&gt;(&lt;/SPAN&gt;a&lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="string token"&gt;'C_0'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;,&lt;/SPAN&gt; a&lt;SPAN class="punctuation token"&gt;[&lt;/SPAN&gt;&lt;SPAN class="string token"&gt;'C_1'&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;]&lt;/SPAN&gt;&lt;SPAN class="punctuation token"&gt;)&lt;/SPAN&gt;
&lt;SPAN class="operator token"&gt;&amp;lt;&lt;/SPAN&gt;matplotlib&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;collections&lt;SPAN class="punctuation token"&gt;.&lt;/SPAN&gt;PathCollection object at &lt;SPAN class="number token"&gt;0x0000025D3C94C2E8&lt;/SPAN&gt;&lt;SPAN class="operator token"&gt;&amp;gt;&lt;/SPAN&gt;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;If you are using Spyder (a python IDE... see my blogs), you can do interactive graphing once you have the basic plot and you don't even need to code since you can fancy the plots directly&lt;/P&gt;&lt;P&gt;&lt;IMG alt="" class="image-1 jive-image j-img-original" src="https://community.esri.com/legacyfs/online/418847_matplotlib_01.png" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So numpy and matplotlib are pretty simple to use... I have tons of examples on my blog (just check my user name on GeoNet)&lt;/P&gt;&lt;P&gt;Have fun&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 12 Dec 2021 03:36:56 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/which-statistical-library/m-p/651120#M50694</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2021-12-12T03:36:56Z</dc:date>
    </item>
  </channel>
</rss>

