<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Training Deep Learning Model Workflow misunderstanding in ArcGIS Image Analyst Questions</title>
    <link>https://community.esri.com/t5/arcgis-image-analyst-questions/training-deep-learning-model-workflow/m-p/1353000#M509</link>
    <description>&lt;P&gt;I am trying to see if I can train a model and detect objects within an aerial image.&lt;/P&gt;&lt;P&gt;I have followed some of the tutorials and went through some of the deep learning courses, and wanted to try on my own using Ohio imagery and I figured ponds would be an easy/fun thing to do.&lt;/P&gt;&lt;P&gt;I can't get it to work correctly, and I believe it is due to me not understanding the requirements or the parameters fully, but I can’t find further information, or I’m missing a simple step.&lt;/P&gt;&lt;P&gt;My PC specs:&lt;/P&gt;&lt;P&gt;CPU i9 13900KS 3.2 GHz&lt;/P&gt;&lt;P&gt;128 ram ddr5 4.2 GHz&lt;/P&gt;&lt;P&gt;RTX 4090 24GB&lt;/P&gt;&lt;P&gt;ArcPro 3.2 and appropriate deep learning libraries, got help from tech support setting it up correctly.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Objective&lt;/STRONG&gt;: create polygons around all ponds in imagery sample.&lt;/P&gt;&lt;P&gt;Imagery: Lorain County OSIP III 6IN MrSID RGB 20x - 2017 (13.2 GB) from&amp;nbsp;&lt;A href="https://gis1.oit.ohio.gov/geodatadownload/" target="_blank"&gt;OGRIP Data Downloads (ohio.gov)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;What I've done:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Label Objects for Deep Learning&lt;/STRONG&gt;-&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've digitized 291 features. (maybe this is not enough?)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Molality_0-1700674488074.png" style="width: 400px;"&gt;&lt;img src="https://community.esri.com/t5/image/serverpage/image-id/86899i9A6036AEA73B6ADD/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Molality_0-1700674488074.png" alt="Molality_0-1700674488074.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Export Training Data:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Input raster: I’ve tried using the .sid, now I’ve converted the .sid to a .tif&lt;/P&gt;&lt;P&gt;No additional input raster&lt;/P&gt;&lt;P&gt;Output folder: default&lt;/P&gt;&lt;P&gt;Input feature: shapefile from the labeled objects&lt;/P&gt;&lt;P&gt;Class Value Field: blank&lt;/P&gt;&lt;P&gt;Buffer Radius: 0&lt;/P&gt;&lt;P&gt;Input Mask Poly: blank&lt;/P&gt;&lt;P&gt;Image Format: Tiff&lt;/P&gt;&lt;P&gt;Tile Size X/Y: I’ve tried 256, 512, and 576&amp;nbsp;(from my understanding, the tile size should be large enough to contain the object within it, but it is okay if a few objects are bigger than the tile size. I also couldn’t find any literature on tile size limitations, is 1024 okay if your pc can run it?&lt;/P&gt;&lt;P&gt;Stride: I’ve tried 0, 8, 288, my understanding is that stride helps with limited training and that the stride value of half the tile size will have a 50% overlap.&lt;/P&gt;&lt;P&gt;Rotation Angle: I’ve tried 45, and zero.&lt;/P&gt;&lt;P&gt;Reference System: Map space&lt;/P&gt;&lt;P&gt;Metadata Format: I’ve tried PASCAL Visual Object Classes and RCNN Masks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Train Deep Learning Model&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Input Training Data: the exported training data&lt;/P&gt;&lt;P&gt;Output Folder: Default&lt;/P&gt;&lt;P&gt;Max Epochs: 50&lt;/P&gt;&lt;P&gt;Pre-trained Model: Blank&lt;/P&gt;&lt;P&gt;Model Type: MaskRCNN&lt;/P&gt;&lt;P&gt;Batch size: 64&lt;/P&gt;&lt;P&gt;Validation %: 10&lt;/P&gt;&lt;P&gt;Backbone Model: Blank&lt;/P&gt;&lt;P&gt;Monitor Metric: Validation Loss&lt;/P&gt;&lt;P&gt;Stop when Model stops improving: tried unchecked and checked&lt;/P&gt;&lt;P&gt;Freeze Model: tried unchecked and checked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Detect Objects using deep learning:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Never got one model to give proper results.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I’ve ran through other tutorials and got them to work, but I’m really struggling to understand what I’m doing wrong when making my own model. Happy Friendsgiving! and Thank you for your time.&lt;/P&gt;</description>
    <pubDate>Thu, 23 Nov 2023 16:34:46 GMT</pubDate>
    <dc:creator>Molality</dc:creator>
    <dc:date>2023-11-23T16:34:46Z</dc:date>
    <item>
      <title>Training Deep Learning Model Workflow misunderstanding</title>
      <link>https://community.esri.com/t5/arcgis-image-analyst-questions/training-deep-learning-model-workflow/m-p/1353000#M509</link>
      <description>&lt;P&gt;I am trying to see if I can train a model and detect objects within an aerial image.&lt;/P&gt;&lt;P&gt;I have followed some of the tutorials and went through some of the deep learning courses, and wanted to try on my own using Ohio imagery and I figured ponds would be an easy/fun thing to do.&lt;/P&gt;&lt;P&gt;I can't get it to work correctly, and I believe it is due to me not understanding the requirements or the parameters fully, but I can’t find further information, or I’m missing a simple step.&lt;/P&gt;&lt;P&gt;My PC specs:&lt;/P&gt;&lt;P&gt;CPU i9 13900KS 3.2 GHz&lt;/P&gt;&lt;P&gt;128 ram ddr5 4.2 GHz&lt;/P&gt;&lt;P&gt;RTX 4090 24GB&lt;/P&gt;&lt;P&gt;ArcPro 3.2 and appropriate deep learning libraries, got help from tech support setting it up correctly.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Objective&lt;/STRONG&gt;: create polygons around all ponds in imagery sample.&lt;/P&gt;&lt;P&gt;Imagery: Lorain County OSIP III 6IN MrSID RGB 20x - 2017 (13.2 GB) from&amp;nbsp;&lt;A href="https://gis1.oit.ohio.gov/geodatadownload/" target="_blank"&gt;OGRIP Data Downloads (ohio.gov)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;What I've done:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Label Objects for Deep Learning&lt;/STRONG&gt;-&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've digitized 291 features. (maybe this is not enough?)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Molality_0-1700674488074.png" style="width: 400px;"&gt;&lt;img src="https://community.esri.com/t5/image/serverpage/image-id/86899i9A6036AEA73B6ADD/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Molality_0-1700674488074.png" alt="Molality_0-1700674488074.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Export Training Data:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Input raster: I’ve tried using the .sid, now I’ve converted the .sid to a .tif&lt;/P&gt;&lt;P&gt;No additional input raster&lt;/P&gt;&lt;P&gt;Output folder: default&lt;/P&gt;&lt;P&gt;Input feature: shapefile from the labeled objects&lt;/P&gt;&lt;P&gt;Class Value Field: blank&lt;/P&gt;&lt;P&gt;Buffer Radius: 0&lt;/P&gt;&lt;P&gt;Input Mask Poly: blank&lt;/P&gt;&lt;P&gt;Image Format: Tiff&lt;/P&gt;&lt;P&gt;Tile Size X/Y: I’ve tried 256, 512, and 576&amp;nbsp;(from my understanding, the tile size should be large enough to contain the object within it, but it is okay if a few objects are bigger than the tile size. I also couldn’t find any literature on tile size limitations, is 1024 okay if your pc can run it?&lt;/P&gt;&lt;P&gt;Stride: I’ve tried 0, 8, 288, my understanding is that stride helps with limited training and that the stride value of half the tile size will have a 50% overlap.&lt;/P&gt;&lt;P&gt;Rotation Angle: I’ve tried 45, and zero.&lt;/P&gt;&lt;P&gt;Reference System: Map space&lt;/P&gt;&lt;P&gt;Metadata Format: I’ve tried PASCAL Visual Object Classes and RCNN Masks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Train Deep Learning Model&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Input Training Data: the exported training data&lt;/P&gt;&lt;P&gt;Output Folder: Default&lt;/P&gt;&lt;P&gt;Max Epochs: 50&lt;/P&gt;&lt;P&gt;Pre-trained Model: Blank&lt;/P&gt;&lt;P&gt;Model Type: MaskRCNN&lt;/P&gt;&lt;P&gt;Batch size: 64&lt;/P&gt;&lt;P&gt;Validation %: 10&lt;/P&gt;&lt;P&gt;Backbone Model: Blank&lt;/P&gt;&lt;P&gt;Monitor Metric: Validation Loss&lt;/P&gt;&lt;P&gt;Stop when Model stops improving: tried unchecked and checked&lt;/P&gt;&lt;P&gt;Freeze Model: tried unchecked and checked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Detect Objects using deep learning:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Never got one model to give proper results.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I’ve ran through other tutorials and got them to work, but I’m really struggling to understand what I’m doing wrong when making my own model. Happy Friendsgiving! and Thank you for your time.&lt;/P&gt;</description>
      <pubDate>Thu, 23 Nov 2023 16:34:46 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-image-analyst-questions/training-deep-learning-model-workflow/m-p/1353000#M509</guid>
      <dc:creator>Molality</dc:creator>
      <dc:date>2023-11-23T16:34:46Z</dc:date>
    </item>
    <item>
      <title>Re: Training Deep Learning Model Workflow misunderstanding</title>
      <link>https://community.esri.com/t5/arcgis-image-analyst-questions/training-deep-learning-model-workflow/m-p/1353596#M516</link>
      <description>&lt;P&gt;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/319812"&gt;@Molality&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Your workflow appears right to me.&amp;nbsp;&lt;/P&gt;&lt;P&gt;How many samples do you have in your training data? Also, in the Train Deep Learning tool, what is the Chip Size set to? By default it's set to 224. Please match it with your Tile Size.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Common practice is to choose a tile size that is large enough to capture the entire object of interest, while also providing enough context for accurate detection. When objects size varies a lot in your samples, one of the approaches can be to use three times the average object size. I have read some research papers on tile size and they appear to be specific to different use cases. I believe you can use 1024 size if it's too big for your GPU, you can try smaller Batch Size.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers!&lt;/P&gt;&lt;P&gt;Pavan&lt;/P&gt;</description>
      <pubDate>Mon, 27 Nov 2023 16:58:36 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-image-analyst-questions/training-deep-learning-model-workflow/m-p/1353596#M516</guid>
      <dc:creator>PavanYadav</dc:creator>
      <dc:date>2023-11-27T16:58:36Z</dc:date>
    </item>
  </channel>
</rss>

