<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Which type of log transformation is most appropriate? in Spatial Statistics Questions</title>
    <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452367#M1368</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thank you for your reply, Dan.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I did hot spot analysis on each of the 20 variables and I am able to see clusters of high and low values (especially on my dependent variable). Note: I did this on the raw data without applying any log transformation. However, descriptive statistics - the skewness value for most of the variables is outside the range of +/- 1 .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The only reason for performing OLS is that research is about defining “district effectiveness” meaning what district-level factors related to resource availability and its allocation can explain district performance rating.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Yes, zero is a valid observation (not null). For e.g. zero attrition rate of teachers or zero student chronic absenteeism rate.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid Budhwani | ELPS Department Graduate Assistant&lt;/P&gt;&lt;P&gt;Ph.D. Candidate, COESA Executive Board President, &amp;amp; UCEA Jackson Scholar 2018-2020&lt;/P&gt;&lt;P&gt;Morgridge College of Education&lt;/P&gt;&lt;P&gt;1999 E. Evans Avenue | Denver, CO 80210&lt;/P&gt;&lt;P&gt;(c) 720.410.4674 | sajjid.budhwani@du.edu&amp;lt;mailto:sajjid.budhwani@du.edu&amp;gt;&lt;/P&gt;&lt;P&gt;www.morgridge.du.edu/&amp;lt;http://www.morgridge.du.edu/&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sun, 03 Mar 2019 02:38:23 GMT</pubDate>
    <dc:creator>sajjidbudhwani</dc:creator>
    <dc:date>2019-03-03T02:38:23Z</dc:date>
    <item>
      <title>Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452365#M1366</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;SPAN style="font-size: 11.0pt;"&gt;Hello Esri community,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 11.0pt;"&gt;I am a doctoral study at DU and I am working on a research project that includes data of Colorado's 178 school districts. Guided by theory, I have compiled a list of variables to run OLS and analyze how these variables interact with its neighboring school districts and predicts school district performance score.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 11.0pt;"&gt;Unfortunately, I found some of these variables being moderately/substantially skewed (not normally distributed) and hence need data transformation. I am struggling as these variables show a variety of skewness (moderate to substantial), are either positively/negatively skewed, includes some zero values and are in different formats – percentages, ratios, dollar amounts, count, and sum total. Due to such variability in my data, I am uncertain about which data log transformation would be most appropriate on each of these data types.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 11.0pt;"&gt;Any guidance would be very helpful.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 11.0pt;"&gt;Thanks.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 11.0pt;"&gt;Saj&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 02 Mar 2019 23:54:01 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452365#M1366</guid>
      <dc:creator>sajjidbudhwani</dc:creator>
      <dc:date>2019-03-02T23:54:01Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452366#M1367</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;More questions than answered&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;- what do the descriptive statistics show?&amp;nbsp; and/or the spatial patterns?&lt;/P&gt;&lt;P&gt;- why are you needing to use OLS when there are non-parametric alternatives?&lt;/P&gt;&lt;P&gt;- are you just doing&lt;SPAN style="-webkit-text-stroke-width: 0px; color: #3d3d3d; white-space: normal; font-weight: 400; display: inline !important; letter-spacing: normal; text-decoration: none; font-size: 15px; font-style: normal; float: none; overflow-wrap: break-word; background-color: #ffffff; text-transform: none; word-spacing: 0px; font-variant: normal; text-indent: 0px; font-family: Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif; orphans: 2; text-align: left; "&gt; univariate or are you looking at multivariate descriptors&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="background-color: #ffffff;"&gt;- &lt;/SPAN&gt;If zero is a valid observation, then that will limit your transformations (assuming that transformations make sense)&lt;/P&gt;&lt;P&gt;- ratios, percentages and the like can be problematic (eg. spurious correlation and the fallacy of the ratio standard revisited)&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 01:22:35 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452366#M1367</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2019-03-03T01:22:35Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452367#M1368</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thank you for your reply, Dan.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I did hot spot analysis on each of the 20 variables and I am able to see clusters of high and low values (especially on my dependent variable). Note: I did this on the raw data without applying any log transformation. However, descriptive statistics - the skewness value for most of the variables is outside the range of +/- 1 .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The only reason for performing OLS is that research is about defining “district effectiveness” meaning what district-level factors related to resource availability and its allocation can explain district performance rating.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Yes, zero is a valid observation (not null). For e.g. zero attrition rate of teachers or zero student chronic absenteeism rate.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid Budhwani | ELPS Department Graduate Assistant&lt;/P&gt;&lt;P&gt;Ph.D. Candidate, COESA Executive Board President, &amp;amp; UCEA Jackson Scholar 2018-2020&lt;/P&gt;&lt;P&gt;Morgridge College of Education&lt;/P&gt;&lt;P&gt;1999 E. Evans Avenue | Denver, CO 80210&lt;/P&gt;&lt;P&gt;(c) 720.410.4674 | sajjid.budhwani@du.edu&amp;lt;mailto:sajjid.budhwani@du.edu&amp;gt;&lt;/P&gt;&lt;P&gt;www.morgridge.du.edu/&amp;lt;http://www.morgridge.du.edu/&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 02:38:23 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452367#M1368</guid>
      <dc:creator>sajjidbudhwani</dc:creator>
      <dc:date>2019-03-03T02:38:23Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452368#M1369</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;perhaps something spearman's rank correlation would be useful since it probably isn't the actual attrition rate (again it is a rate) versus student absenteeism (a rate as well) but maybe the ranks themselves.&amp;nbsp; The skewness may reflect a threshold.&amp;nbsp; Does the histogram show any bimodality? (you may be dealing with two 'populations' in the broadest sense of the term)&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 02:53:41 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452368#M1369</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2019-03-03T02:53:41Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452369#M1370</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I don’t see bimodality in any of my variables.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Just want to clarify that not all my variables are ratio/percentage. I have 8 variables in percentage form, 1 is ratio, 4 are in dollar amount, 7 are sum total of some kinds of population.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid Budhwani | ELPS Department Graduate Assistant&lt;/P&gt;&lt;P&gt;Ph.D. Candidate, COESA Executive Board President, &amp;amp; UCEA Jackson Scholar 2018-2020&lt;/P&gt;&lt;P&gt;Morgridge College of Education&lt;/P&gt;&lt;P&gt;1999 E. Evans Avenue | Denver, CO 80210&lt;/P&gt;&lt;P&gt;(c) 720.410.4674 | sajjid.budhwani@du.edu&amp;lt;mailto:sajjid.budhwani@du.edu&amp;gt;&lt;/P&gt;&lt;P&gt;www.morgridge.du.edu/&amp;lt;http://www.morgridge.du.edu/&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 03:11:22 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452369#M1370</guid>
      <dc:creator>sajjidbudhwani</dc:creator>
      <dc:date>2019-03-03T03:11:22Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452370#M1371</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;These are categorical data... "&lt;SPAN style="display: inline !important; float: none; background-color: #ffffff; color: #3d3d3d; font-family: inherit; font-size: 100%; font-style: inherit; font-variant: normal; font-weight: inherit; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;7 are sum total of some kinds of population.&lt;/SPAN&gt;" (old school nonparametric data)&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 04:03:26 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452370#M1371</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2019-03-03T04:03:26Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452371#M1372</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;No. They are discrete variables and not sub-types of a parent category. E.g. total population of teachers employed within a school district, total student population, free-and-reduced lunch population, etc.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid Budhwani | ELPS Department Graduate Assistant&lt;/P&gt;&lt;P&gt;Ph.D. Candidate, COESA Executive Board President, &amp;amp; UCEA Jackson Scholar 2018-2020&lt;/P&gt;&lt;P&gt;Morgridge College of Education&lt;/P&gt;&lt;P&gt;1999 E. Evans Avenue | Denver, CO 80210&lt;/P&gt;&lt;P&gt;(c) 720.410.4674 | sajjid.budhwani@du.edu&lt;/P&gt;&lt;P&gt;www.morgridge.du.edu/&lt;/P&gt;&lt;P&gt;Please excuse typos and brevity. Message sent from Outlook for Android&amp;lt;https://aka.ms/ghei36&amp;gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 04:13:10 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452371#M1372</guid>
      <dc:creator>sajjidbudhwani</dc:creator>
      <dc:date>2019-03-03T04:13:10Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452372#M1373</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I'm specifically keen on knowing whether there is any way i can transform ratio/percentage data variables and use them along with other variables in one OLS model. Is that possible using ArcGIS pro?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Sajjid Budhwani | ELPS Department Graduate Assistant&lt;/P&gt;&lt;P&gt;Ph.D. Candidate, COESA Executive Board President, &amp;amp; UCEA Jackson Scholar 2018-2020&lt;/P&gt;&lt;P&gt;Morgridge College of Education&lt;/P&gt;&lt;P&gt;1999 E. Evans Avenue | Denver, CO 80210&lt;/P&gt;&lt;P&gt;(c) 720.410.4674 | sajjid.budhwani@du.edu&lt;/P&gt;&lt;P&gt;www.morgridge.du.edu/&lt;/P&gt;&lt;P&gt;Please excuse typos and brevity. Message sent from Outlook for Android&amp;lt;https://aka.ms/ghei36&amp;gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 04:16:46 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452372#M1373</guid>
      <dc:creator>sajjidbudhwani</dc:creator>
      <dc:date>2019-03-03T04:16:46Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452373#M1374</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;That is different then. and for your second question, you can transform the values using whatever transformation you want, by adding fields in your tables and using the field calculator.&amp;nbsp; There are a whole load of 'math' module functions builtin. &amp;nbsp; But there are workarounds like using different distributions or assigning ridiculously small nonzero numbers (there is lots of discussion in the literature...&lt;/P&gt;&lt;P&gt;(first Dr Google hit &lt;A class="link-titled" href="https://www.researchgate.net/post/Log_transformation_of_values_that_include_0_zero_for_statistical_analyses2" title="https://www.researchgate.net/post/Log_transformation_of_values_that_include_0_zero_for_statistical_analyses2"&gt;Log transformation of values that include 0 (zero) for statistical analyses?&lt;/A&gt; )&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="display: inline !important; float: none; background-color: #ffffff; color: #3d3d3d; font-family: Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif; font-size: 15px; font-style: normal; font-variant: normal; font-weight: 400; letter-spacing: normal; orphans: 2; overflow-wrap: break-word; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;I will leave whether doing it is appropriate in the first place, given there are alternatives.&amp;nbsp; that is a discussion between you and your advisor.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;good luck&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 04:26:45 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452373#M1374</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2019-03-03T04:26:45Z</dc:date>
    </item>
    <item>
      <title>Re: Which type of log transformation is most appropriate?</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452374#M1375</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Almost forgot, if you use R for stats, then you might be interested in tools available through the R bridge... there is a space here that may have resource information&amp;nbsp;&lt;A href="https://community.esri.com/groups/rstats"&gt;https://community.esri.com/groups/rstats&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Mar 2019 09:47:06 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/which-type-of-log-transformation-is-most/m-p/452374#M1375</guid>
      <dc:creator>DanPatterson_Retired</dc:creator>
      <dc:date>2019-03-03T09:47:06Z</dc:date>
    </item>
  </channel>
</rss>

