Select to view content in your preferred language

ArcGIS Server 11.3 Upgrade Failed due to Corrupted Tomcat server.xml

775
2
Jump to solution
09-01-2024 09:15 PM
cle444
by
Regular Contributor

Hi ESRI community,

Our ArcGIS Enterprise 11.3 upgrade failed because the hosting server encountered a Tomcat server.xml issue. To understand why it happens, here are some specific questions (refer to the Question section below). It would be much appreciated if someone could shed some light.

 

Background

Upgrade the existing production environment of ArcGIS Enterprise from 10.9.1 to 11.3.

  • Standard multiple machine deployment, e.g. one server for Web Adaptor, one server for ArcGIS Portal, one server for ArcGIS Hosting GIS Server, one server for Configuration Store, etc.
  • Windows Server 2019.
  • All machines have met the system requirements, e.g., there is more than enough disk space on all servers, the ArcGIS server must have the correct .Net desktop runtime installed, etc.
  • The plan is to manually upgrad each component following the official doc and recommended upgrade order, e.g.,
  • cle444_0-1725249622189.png
  • After Portal was successfully upgraded, we continued to upgrade the hosting ArcGIS Sever by running the Windows installation file, e.g.
  • cle444_1-1725249622190.png
  • cle444_8-1725249907690.png
  • The above ArcGIS Server 11.3 installation took about 1 hour to complete. There were no errors, and the installation was successful.

 

Issue

After the above steps, a browser session was triggered, trying to connect to the ArcGIS Server Manager website to complete the upgrade, but it couldn’t be loaded. Tried https://localhost:6443/arcgis/manager/ or https://servername.xyzxy.local:6443/arcgis/manager, and both showed the same error in browser like below. It looks like the ArcGIS Server's Tomcat server is down.

cle444_3-1725249622192.png

 

Some facts to help diagnose

 1. Windows Service shows ArcGIS Server Windows Service is running under Local System.

cle444_4-1725249622193.png

2. In the Task Manager, ArcGIS.exe is running with no ArcSOC.exe processes running.

3. Tomcat server.xml seems corrupted after the upgrade. For the last part of the file, the <Connector> section here is before VS after the upgrade.

cle444_5-1725249622195.png

cle444_6-1725249622197.png

4. The certificates folder “<ArcGIS Server Root>\Server\framework\etc\certificates” should contain three files, but it had six files after the upgrade, including one lock file for each file below.

cle444_7-1725249622198.png

5. An ESRI technical document https://support.esri.com/en-us/knowledge-base/how-to-fix-an-arcgis-keystore-or-server-xml-corruption... provided a ‘workaround’ to fix the problem. It worked. But I must copy the DEV environment keystore and certificates to the production one, which concerns me for future upgrades.

6. We have MS Defender working hard during the installation processes. For about 15 minutes, the installation showed ‘not responding’. After checking with our IT team, we found nothing abnormal was captured for the failed server in the Defender report.

 

Questions 

1. During the installation of the ArcGIS Server, a domain service account was nominated to run it. But the Windows Service shows it is running under the Local System after installation. Is this expected? The 10.9.1 ArcGIS Server Windows service is running under the specified domain account.

2. Are there any known conflicts between MS Defender, the antivirus software and the ArcGIS server installation process?

3. If we rule out the low disk space factor on both the server and the configuration store server, what may have caused a corrupted server.xml while upgrading?

4. To avoid the corrupted server.xml issue, is there anything extra I should check before the upgrade?

5. If it happens again, what is the recommended exercise? If we use this workaround, what is the implication of copying the key store and DEV certificates to PROD, as the certificates are issued to different FQDNs?

I would appreciate any information on any of the above questions. 

Thanks in advance.

Hua

0 Kudos
1 Solution

Accepted Solutions
cle444
by
Regular Contributor

Thanks @DavidColey. I did some research and contacted local ESRI support. I can confirm that your comments are well aligned with what I have learned so far. We just upgraded successfully on the 2nd attempt.

To answer my questions above.

Qusetion: During the installation of the ArcGIS Server, a domain service account was nominated to run it. But the Windows Service shows it runs under the Local System after installation. Is this expected? The 10.9.1 ArcGIS Server Windows service is running under the specified domain account.

A: As David mentioned, the Windows service for the ArcGIS Server should run under the nominated service account during the installation. A silently failing back to the Local System can be a permission issue. Make sure to run the installation file as an administrator, and the nominated service account should have full control over target ArcGIS-related file folders, e.g., C:\arcgisserver, and C:\Program FilesArcGIS (these folders can be different if using non-default configuration).

 

Question: Are there any known conflicts between MS Defender, the antivirus software and the ArcGIS server installation process?

A: ESRI recommends that you turn off anti-virus software or active scanning during installation. Anti-virus software will significantly slow down the installation and is change-resilient.

 

Question: If we rule out the low disk space factor on both the server and the configuration store server, what may have caused a corrupted server.xml while upgrading?

A: Not sure. Suspect a permission thing because we did find a permission issue.

 

Question: To avoid the corrupted server.xml issue, is there anything extra I should check before the upgrade?

A: Back the system as the bottom line, and key files e.g. tomcat conf, certificates folder and config store etc.

 

Question: If it happens again, what is the recommended exercise? If we use this workaround, what is the implication of copying the key store and DEV certificates to PROD, as the certificates are issued to different FQDNs?

A. The above workaround is still valid for 11.3.

 

 

View solution in original post

0 Kudos
2 Replies
DavidColey
Honored Contributor

Hello Hua - I'm not sure that an underscore in a domain service account name is allowed and that may be why the 'Log On As' from the services window is showing as running under Local System.

- or the directory permissions for the domain account are such that the domain account could not fully access all of Server's directories

- or it did not have full write permissions to create any new directories that 11.3 may need.

If the domain account is set up properly, the ArcGIS Server windows process should be showing that it is running from the account:  e.g. domain\arcgisserviceaccount, not Local System or Local Service.

Documentation exists that also recommends, if possible, to suspend any active virus scanning just prior to the upgrade.  This cuts the upgrade and post upgrade times down considerably, and if you remove the web adaptor prior to upgrade the chance of access is minimized.

cle444
by
Regular Contributor

Thanks @DavidColey. I did some research and contacted local ESRI support. I can confirm that your comments are well aligned with what I have learned so far. We just upgraded successfully on the 2nd attempt.

To answer my questions above.

Qusetion: During the installation of the ArcGIS Server, a domain service account was nominated to run it. But the Windows Service shows it runs under the Local System after installation. Is this expected? The 10.9.1 ArcGIS Server Windows service is running under the specified domain account.

A: As David mentioned, the Windows service for the ArcGIS Server should run under the nominated service account during the installation. A silently failing back to the Local System can be a permission issue. Make sure to run the installation file as an administrator, and the nominated service account should have full control over target ArcGIS-related file folders, e.g., C:\arcgisserver, and C:\Program FilesArcGIS (these folders can be different if using non-default configuration).

 

Question: Are there any known conflicts between MS Defender, the antivirus software and the ArcGIS server installation process?

A: ESRI recommends that you turn off anti-virus software or active scanning during installation. Anti-virus software will significantly slow down the installation and is change-resilient.

 

Question: If we rule out the low disk space factor on both the server and the configuration store server, what may have caused a corrupted server.xml while upgrading?

A: Not sure. Suspect a permission thing because we did find a permission issue.

 

Question: To avoid the corrupted server.xml issue, is there anything extra I should check before the upgrade?

A: Back the system as the bottom line, and key files e.g. tomcat conf, certificates folder and config store etc.

 

Question: If it happens again, what is the recommended exercise? If we use this workaround, what is the implication of copying the key store and DEV certificates to PROD, as the certificates are issued to different FQDNs?

A. The above workaround is still valid for 11.3.

 

 

0 Kudos