The docs say "When the maximum number of instances of a service is in use, a client requesting a service is queued until another client releases one of the services."
We have a service that is sometime very busy and sometime hang when there is many requests.
The response time is not very good but we think it include the wait time in the queue.
Is there a way to see the queue in real time and to understand how many requests are in it?
This will help us understand how to configured the client to send less requests (the client is a program sending requests)
We cannot just increase the number of instances - we do not have that many CPU's