<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Forest Based Classification and Regression in Spatial Statistics Questions</title>
    <link>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1558104#M2682</link>
    <description>&lt;P&gt;I am using 5 rasters in my model to predict a soil property point data. I am wondering what it means when you have a low variance explained 22% (model bag of errors), however in my training data regresssion diagnostics, the R2 is 89% and the SMSE and SE are also low.&amp;nbsp; Also, when I look at my residuals the model appears to have predicted very well.&amp;nbsp; &amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 13 Nov 2024 16:22:27 GMT</pubDate>
    <dc:creator>ArnieWaddell1</dc:creator>
    <dc:date>2024-11-13T16:22:27Z</dc:date>
    <item>
      <title>Forest Based Classification and Regression</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1558104#M2682</link>
      <description>&lt;P&gt;I am using 5 rasters in my model to predict a soil property point data. I am wondering what it means when you have a low variance explained 22% (model bag of errors), however in my training data regresssion diagnostics, the R2 is 89% and the SMSE and SE are also low.&amp;nbsp; Also, when I look at my residuals the model appears to have predicted very well.&amp;nbsp; &amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 13 Nov 2024 16:22:27 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1558104#M2682</guid>
      <dc:creator>ArnieWaddell1</dc:creator>
      <dc:date>2024-11-13T16:22:27Z</dc:date>
    </item>
    <item>
      <title>Re: Forest Based Classification and Regression</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1559005#M2683</link>
      <description>&lt;P&gt;Hi, it sounds like your model might be overfitting to the training data. So it looks like it's doing a really good job on the data the model was trained on, but then the model won't perform well when predicting to new data.&lt;/P&gt;
&lt;P&gt;There are a couple of ways to avoid this. Start by looking at Validation Options accordion in the forest-based and boosted classification and regression tool, and make sure there is some data set aside for evaluation (Training data excluded for validation %). Then, in the output, you can evaluate your R^2, errors, etc. on both the training and the validation data. If the metrics are much better for training than for testing, your model is overfitting.&lt;/P&gt;
&lt;P&gt;There is a checkbox in the tool to Optimize Parameters. This will choose the parameters (such as tree depth, etc.) that gives you the highest, say, R^2 specifically for your testing data. See more here:&amp;nbsp;&lt;A href="https://pro.arcgis.com/en/pro-app/latest/tool-reference/spatial-statistics/how-forest-works.htm#:~:text=To%20use%20parameter%20optimization" target="_blank"&gt;https://pro.arcgis.com/en/pro-app/latest/tool-reference/spatial-statistics/how-forest-works.htm#:~:text=To%20use%20parameter%20optimization&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;--Catherine McSorley&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 00:33:08 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1559005#M2683</guid>
      <dc:creator>CatherineMcSorley</dc:creator>
      <dc:date>2024-11-15T00:33:08Z</dc:date>
    </item>
    <item>
      <title>Re: Forest Based Classification and Regression</title>
      <link>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1559187#M2684</link>
      <description>&lt;P&gt;Hi Catherine,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for helping out.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My next questions are:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;If my training data and&amp;nbsp; validation show R2 values that vary greatly as shown below for Nb soil property, does this mean that&amp;nbsp; Forest Based Classification and Regression does not work for this data and I should just use interpolation such as Kriging?&lt;/LI&gt;&lt;LI&gt;Does the low % variation explained (8.7% and 9.8%) in the Model Out bag of errors indicate that my rasters have very little relationship in explaining Nb and again therefore the model should not be used?&lt;/LI&gt;&lt;LI&gt;What does the values of importance indicate in the Top Variable Importance Table?&amp;nbsp; They range from .02-.03.&amp;nbsp; The values are very low when compared to &lt;STRONG&gt;Ca &lt;/STRONG&gt;soil property which has a range from 17-57.&amp;nbsp;&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Ca &lt;/STRONG&gt;also has a high variance explained in the Model Out Bag of Errors, therefore I am assuming that this models rasters have a strong relationship with Ca?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Nb&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ArnieWaddell1_0-1731683325759.png" style="width: 400px;"&gt;&lt;img src="https://community.esri.com/t5/image/serverpage/image-id/119653i2A6293F7B6DBC963/image-size/medium?v=v2&amp;amp;px=400" role="button" title="ArnieWaddell1_0-1731683325759.png" alt="ArnieWaddell1_0-1731683325759.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ArnieWaddell1_1-1731683325780.png" style="width: 400px;"&gt;&lt;img src="https://community.esri.com/t5/image/serverpage/image-id/119654i24EBF123805D9E55/image-size/medium?v=v2&amp;amp;px=400" role="button" title="ArnieWaddell1_1-1731683325780.png" alt="ArnieWaddell1_1-1731683325780.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Arnie Waddell, M.A.&lt;/P&gt;&lt;P&gt;GIS Specialist&lt;/P&gt;&lt;P&gt;2nd Floor 303 Main St.&lt;/P&gt;&lt;P&gt;Winnipeg, Manitoba&lt;/P&gt;&lt;P&gt;Agriculture and Agri-Food Canada / Government of Canada&lt;/P&gt;&lt;P&gt;&lt;A href="mailto:arnie.waddell@agr.gc.ca" target="_blank" rel="noopener"&gt;arnie.waddell@agr.gc.ca&lt;/A&gt; / Tel 431-275-4867&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ArnieWaddell1_2-1731683325782.png" style="width: 400px;"&gt;&lt;img src="https://community.esri.com/t5/image/serverpage/image-id/119652i9514A0B5072C93E3/image-size/medium?v=v2&amp;amp;px=400" role="button" title="ArnieWaddell1_2-1731683325782.png" alt="ArnieWaddell1_2-1731683325782.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;From:&lt;/STRONG&gt; Esri Community &amp;lt;&lt;A href="mailto:esricommunity@esri.com" target="_blank" rel="noopener"&gt;esricommunity@esri.com&lt;/A&gt;&amp;gt;&lt;BR /&gt;&lt;STRONG&gt;Sent:&lt;/STRONG&gt; Thursday, November 14, 2024 6:33 PM&lt;BR /&gt;&lt;STRONG&gt;To:&lt;/STRONG&gt; &lt;A href="mailto:arnie.waddell@canada.ca" target="_blank" rel="noopener"&gt;arnie.waddell@canada.ca&lt;/A&gt;&lt;BR /&gt;&lt;STRONG&gt;Subject:&lt;/STRONG&gt; Re: Forest Based Classification and Regression (Subscription Update)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and know the content is safe.&lt;BR /&gt;ATTENTION: Ce courriel provient de l’extérieur de l’organisation. Ne cliquez pas sur les liens et n’ouvrez pas les pièces jointes à moins que vous ne reconnaissiez l’expéditeur et que vous sachiez que le contenu est sûr.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;*** DO NOT REPLY TO THIS E-MAIL ***&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;To respond, use the hyperlinked Response Options at the bottom of your notification below OR visit the Esri Community post directly and reply from there.&lt;/P&gt;&lt;P&gt;Hi ArnieWaddell1,&lt;/P&gt;&lt;P&gt;CatherineMcSorley (Esri Contributor) posted a new reply in &lt;A href="https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcommunity.esri.com%2Ft5%2Fspatial-statistics-questions%2Fbd-p%2Fspatial-statistics-questions&amp;amp;data=05%7C02%7Carnie.waddell%40AGR.GC.CA%7C0ae9e5425c7344d8a0c408dd050d1767%7C9da98bb118574cc387519a49e35d24cd%7C0%7C0%7C638672276012579199%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=9OVS%2F2uqJ0e4WTedpNNtlrI7jETTZD%2BNMHuawAWopDU%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;Spatial Statistics Questions&lt;/A&gt; on 11-14-2024 04:33 PM:&lt;/P&gt;&lt;H3&gt;&lt;A href="https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcommunity.esri.com%2Ft5%2Fspatial-statistics-questions%2Fforest-based-classification-and-regression%2Fm-p%2F1559005%23M2683&amp;amp;data=05%7C02%7Carnie.waddell%40AGR.GC.CA%7C0ae9e5425c7344d8a0c408dd050d1767%7C9da98bb118574cc387519a49e35d24cd%7C0%7C0%7C638672276012597257%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=EDCEMEOdXIZdc50Jsxktr%2BWkgUCgXTgDS2cNATHq3Vw%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;Re: Forest Based Classification and Regression&lt;/A&gt;&lt;/H3&gt;&lt;P&gt;Hi, it sounds like your model might be overfitting to the training data. So it looks like it's doing a really good job on the data the model was trained on, but then the model won't perform well when predicting to new data.&lt;/P&gt;&lt;P&gt;There are a couple of ways to avoid this. Start by looking at Validation Options accordion in the forest-based and boosted classification and regression tool, and make sure there is some data set aside for evaluation (Training data excluded for validation %). Then, in the output, you can evaluate your R^2, errors, etc. on both the training and the validation data. If the metrics are much better for training than for testing, your model is overfitting.&lt;/P&gt;&lt;P&gt;There is a checkbox in the tool to Optimize Parameters. This will choose the parameters (such as tree depth, etc.) that gives you the highest, say, R^2 specifically for your testing data. See more here:&amp;nbsp;&lt;A href="https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fpro.arcgis.com%2Fen%2Fpro-app%2Flatest%2Ftool-reference%2Fspatial-statistics%2Fhow-forest-works.htm%23%3A~%3Atext%3DTo%2520use%2520parameter%2520optimization&amp;amp;data=05%7C02%7Carnie.waddell%40AGR.GC.CA%7C0ae9e5425c7344d8a0c408dd050d1767%7C9da98bb118574cc387519a49e35d24cd%7C0%7C0%7C638672276012606543%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=0qfsa07qVpNZ3wCuYqBTQImGX4E3nplB17euf4k1eBU%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;https://pro.arcgis.com/en/pro-app/latest/tool-reference/spatial-statistics/how-forest-works.htm#:~:text=To%20use%20parameter%20optimization&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;--Catherine McSorley&lt;/P&gt;&lt;P&gt;&lt;A href="mailto:esri.prod|179a0d17|5ae2f6a5-b435-4e55-a887-f800e75fd81b@replybyemail.usw2.prod.hosted.lithcloud.com?subject=Re:%20Forest%20Based%20Classification%20and%20Regression&amp;amp;body=%0A%0A%23%23-%20Please%20type%20your%20reply%20above%20this%20line%20-%23%23" target="_blank" rel="noopener"&gt;Reply&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Esri Community sent this message to &lt;A href="mailto:arnie.waddell@canada.ca" target="_blank" rel="noopener"&gt;arnie.waddell@canada.ca&lt;/A&gt;.&lt;BR /&gt;You are receiving this email because a new message matches your subscription to a topic.&lt;/P&gt;&lt;P&gt;If you do not want to receive notification for this message, &lt;A href="https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcommunity.esri.com%2Ft5%2Faction%2Freplybyemailactionpage%3Freplybyemail.action%3Desri.prod%257C188c0d5e%257C0a231c5c-9bc2-437b-89bd-a14b3f39a7cb%26dest_url%3Dhttps%253A%252F%252Fcommunity.esri.com%252Ft5%252Fspatial-statistics-questions%252Fforest-based-classification-and-regression%252Ftd-p%252F1558104%26ticket%3DpuXuE7_V-AGD_184833&amp;amp;data=05%7C02%7Carnie.waddell%40AGR.GC.CA%7C0ae9e5425c7344d8a0c408dd050d1767%7C9da98bb118574cc387519a49e35d24cd%7C0%7C0%7C638672276012615678%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=KgqLwB3ljmD0jJaCeI68RtvaBnZH6SLWdQXB6H0NQdg%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;unsubscribe the topic&lt;/A&gt; or &lt;A href="https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcommunity.esri.com%2Ft5%2Faction%2Freplybyemailactionpage%3Freplybyemail.action%3Desri.prod%257C19670d14%257Cd2df0232-b7c4-41c4-9bdb-95e68bc57559%26dest_url%3Dhttps%253A%252F%252Fcommunity.esri.com%252Ft5%252Fspatial-statistics-questions%252Fforest-based-classification-and-regression%252Fm-p%252F1558104%2523M2682%26ticket%3DpuXuE7_V-AGD_184833&amp;amp;data=05%7C02%7Carnie.waddell%40AGR.GC.CA%7C0ae9e5425c7344d8a0c408dd050d1767%7C9da98bb118574cc387519a49e35d24cd%7C0%7C0%7C638672276012625057%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=cll7tNA3wOpBghoBPZGNSLoaNd0GcPYkFGk3NxIRThI%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;mute the message&lt;/A&gt;.&lt;BR /&gt;To manage your email notifications, &lt;A href="https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcommunity.esri.com%2Fccqpr47374%2Fuser_subscriptions&amp;amp;data=05%7C02%7Carnie.waddell%40AGR.GC.CA%7C0ae9e5425c7344d8a0c408dd050d1767%7C9da98bb118574cc387519a49e35d24cd%7C0%7C0%7C638672276012634301%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=U7VR6qG5npkulVMr23c%2FHolAQPRd5I7UJxHPJoWURus%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;go to your settings in the community&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 15:09:21 GMT</pubDate>
      <guid>https://community.esri.com/t5/spatial-statistics-questions/forest-based-classification-and-regression/m-p/1559187#M2684</guid>
      <dc:creator>ArnieWaddell1</dc:creator>
      <dc:date>2024-11-15T15:09:21Z</dc:date>
    </item>
  </channel>
</rss>

