Solved! Go to Solution.
Good to hear! I really hope that this resolves the instability problems that you are seeing.
Please report your findings so that everyone can benefit from them.
Did your server logs, as seen in ArcGIS Server Manager, have any errors in them? In particular, I am wondering about out of memory/heap type errors.
Here is an example of the full error information:
<Msg time="2013-05-22T15:21:39,778" type="SEVERE" code="9003" source="Rest" process="3308" thread="22" methodName="" machine="TERAARCSERV1.TERA1.COM" user="" elapsed="">Unable to process request. Error handling service request : Unknown Container Exception: org.apache.openejb.client.RemoteFailoverException: Cannot complete request. Retry attempted on 1 servers; nested exception is: java.io.IOException: Cannot deternmine server protocol version: Received null/0.0</Msg>
If you look at the source field, is it either Rest or Server. I think that there is a single Rest process (on each GIS Server) whose job it is to handle the incoming requests and handle them. The Server process is an instance of ArcSOC which is the process which does the actual work, e.g. map service, and there many of these.
The above error comes from Rest. I'm not a Java programmer but my guess is that the program is trying to communicate with another machine using Java and there is some type of mismatch between the machines. If you Google on that error then it seems to say that perhaps there is a software version mismatch.
I do believe that the REST process can and does try to communicate with other nodes (i.e. GIS Servers) in the site. I am guessing that there is some type of problem here. I noticed that all the logs are about TERAARCSERV1. Where are the logs/entries for the other machine in the site? Your diagram indicates that you have 2 machines in the default cluster. I am wondering if there is some type of capability and/configuration problem here. Do the machines have the exact same version of ArgGIS Server installed? It could also be firewall/security configuration related. Are all the required ports open?
http://resources.arcgis.com/en/help/main/10.1/index.html#//015400000537000000