We have a multi-machine highly-available ArcGIS Enterprise deployment, deployed on premises. When our infrastructure team does server patching and needs to restart all of the machines, what is the proper shutdown/restart order to avoid issues?
For example, last night patching occurred and this morning the portal site was not working. I had to restart both portal nodes in order to get it back up. From looking at the logs it looks like the portal servers were restarted before the file server (see table below), so then the portal threw some severe error while the file server was down:
Cannot write to directory path '\\<fileserver machine>\esri\ArcGIS_Portal\arcgisportal'. Please check that the location is valid and that the Portal service account has permissions to the location.
HA: Error in HA plugin. Cannot read from directory path '\\<fileserver machine>\esri\ArcGIS_Portal\arcgisportal'. Please check that the location is valid and that the Portal service account has permissions to the location.
It seems like the portal never recovered from this error until i restarted them.
Our architecture basically look like this (All components are on Windows Server 2016 machines running ArcGIS 10.8.1):
Machines | Role |
portal-01 portal-02 | Portal |
hosting-01 hosting-02 | ArcGIS Server - Hosting Site |
federated-01 federated-02 | ArcGIS Server - federated |
image-01 | ArcGIS Server - federated (image server) |
raster-01 | ArcGIS Server - federated (raster) |
datastore-01 datastore-02 | ArcGIS Datastore (Relational) |
fileserv-01 | Windows File Share (used for all shared directories for portal and server sites) |
geoanalysis-01 geoanalysis-02 geoanalysis-03 | ArcGIS Server - federated (Geoanalytics server) |
bigdata-01 bigdata-02 bigdata-03 | ArcGIS Datastore (Spatiotemporal big data store) |
geoevent-01 | ArcGIS Server - federated (Geoevent) |
Solved! Go to Solution.
great, thanks for the info!