Deep Learning Studio 11.1 (Linux)

S6 · ‎10-11-2023

Hi,

We have installed Image Server 11.1 & Raster Analytics in a NVIDIA DGX machine with 4 GPU cards.

How can we leverage all 4 GPUs while training the model in Deep Learning Studio?

Thanks & regards,

Saranya

JayChen · ‎10-11-2023

Hi Saranya,

At 11.1, when it gets to train model using a RA site in an enterprise environment, each image server node within the Raster Analytics site can still use one and only one GPU. Deep Learning Studio will also carry this limitation since it's leveraging the RA site for processing. We are looking to remove this limitation in the future release.

-Jay

View solution in original post

JayChen · ‎10-11-2023

Hi Saranya,

At 11.1, when it gets to train model using a RA site in an enterprise environment, each image server node within the Raster Analytics site can still use one and only one GPU. Deep Learning Studio will also carry this limitation since it's leveraging the RA site for processing. We are looking to remove this limitation in the future release.

-Jay

S6 · ‎10-11-2023

Hi Jay,
Thank you for your prompt response.
Really appreciate it. Do you have a timeline on when this capability will be incorporated?

JayChen · ‎10-12-2023

Hi Saranya,

We are working with Python API team on this, starting with supporting multiple GPUs on a single image server node for some of the model types. Unfortunately we don't yet have a clear timeline when this will be available for most of the model types. We do undertsdand that this is essential for some of the customers, especially when it goes to the model training. Once it becomes available, the Deep Learning Studio will natually support it too. Thank you for your patience.

-Jay

S6 · ‎10-12-2023

Thanks for the update Jay!

S6 · ‎10-25-2023

Hi,

I have a follow up question.

https://enterprise.arcgis.com/en/image/latest/raster-analytics/configure-and-deploy-arcgis-enterpris...

In this page, it says if I have multiple GPUs, I can run multiple instances. Does this mean if I have 16GPUs, I can run inferences on 16 images in parallel? One image per GPU?

JayChen · ‎10-27-2023

Supporting multiple GPUs on your raster analytics node within your raster analytics site is not fully supported for all models yet. `Utilization of multiple GPUs per server node is applicable to some deep learning model configurations predefined in ArcGIS. They include Tensorflow (ObjectDetectionAPI and DeepLab), Keras (MaskRCNN), and PyTorch.`

For supported models, yes the job can be distributed to multiple GPUs on your RA node if there are multiple GPUs available and the Max instances per machine for RA services are set to be more than 1.

S6 · ‎10-29-2023

Hi JayChen,

Thanks for your reply. Does the same apply for inferencing as well?

JayChen · ‎10-31-2023

Yes. This should be on inferencing side.