Select to view content in your preferred language

Fine-Tune deep learning detect objects - swimming pool

184
3
02-26-2025 05:47 AM
Labels (1)
Letíciad_Agosto_Miguel_Fonseca
Esri Contributor

Hello everyone, 

I’m currently working on fine-tuning the pool detection model for use in southern Brazil. I followed the steps outlined in this post: https://doc.arcgis.com/en/pretrained-models/latest/imagery/finetuning-the-pool-detection-usa.htm

 

Raster information:

Letciad_Agosto_Miguel_Fonseca_0-1740576450690.png


Training Samples:

Letciad_Agosto_Miguel_Fonseca_1-1740576515748.png

Parameters:

Letciad_Agosto_Miguel_Fonseca_3-1740576720521.png

Image training sample (0.3m):

Letciad_Agosto_Miguel_Fonseca_2-1740576674307.png

 

However, the results I'm getting seem a bit unusual:

Letciad_Agosto_Miguel_Fonseca_4-1740577329200.png

Does anyone have any suggestions on how I can improve these results?

Thanks in advance!

 

0 Kudos
3 Replies
PavanYadav
Esri Regular Contributor

Hi @Letíciad_Agosto_Miguel_Fonseca it does look concerning. The model is trained with 8-bit, 3-band high resolution (5-30 centimeters) imagery. What kind of imagery did you use to create the training samples. Can you share a screen capture of its stats.txt file?.it should look something like this example 

PavanYadav_0-1741799925651.png

 

 

Pavan Yadav
Product Engineer at Esri
AI for Imagery
Connect with me on LinkedIn!
Contact Esri Support Services
Letíciad_Agosto_Miguel_Fonseca
Esri Contributor

Hello @PavanYadav , first of all, thank you for your attention. Indeed, I noticed some inconsistencies.

Here is the stats.txt from my training samples:

Letciad_Agosto_Miguel_Fonseca_0-1742390209313.png

The .TIFF image I exported from the .ECW file is showing 4 bands, although there are only 3.

I re-exported it using the Raster to Other Format geoprocessing tool and now it is right:

Letciad_Agosto_Miguel_Fonseca_1-1742390310633.png

Now, the statistics for my training samples (62 swimming pools) are as follows:

 

Letciad_Agosto_Miguel_Fonseca_3-1742390384331.png

However, I believe I need more training data and better resources to fine-tune the model. When I tried running the tool, my ArcGIS Pro crashed, and it's not feasible to use 100 epochs and a batch size of 64.

I’ll try running it on another server and see if I can successfully fine-tune the model.

If you have any more suggestions on how to improve, please let me know.

 

0 Kudos
PavanYadav
Esri Regular Contributor

@Letíciad_Agosto_Miguel_FonsecaSorry for the late response. ArcGIS Pro shouldn't crash even if the batch size exceeds GPU memory. Can you try using a smaller batch size, such as 2 or 4? More training samples should also help. Another thing to try is building a new model solely with your training data (don't use the pre-trained model as a starting point) to see if you get any meaningful results.

Cheers!

Pavan Yadav
Product Engineer at Esri
AI for Imagery
Connect with me on LinkedIn!
Contact Esri Support Services