When you run a caching job, one instance of the CachingTools service acts as the controller instance. This instance distributes assignments to one or more worker instances. It is the worker instances that actually modify the tiles. In order to ensure at least one controller and one worker instance are available, the maximum number of instances you allow for CachingTools should always be 2 or greater.
Tools such as Manage Map Server Cache Tiles allow you to choose how many instances of CachingTools will be designated as worker instances. If you're not sure what number to enter, take the number of GIS server CPU cores in your cluster and add one to that number. The ideal number of instances may vary based on the nature of the service, but this is an appropriate starting figure for testing.
To figure out the maximum number of instances you can enter, multiply the number of GIS server machines in the CachingTools cluster by the maximum number of instances of CachingTools allowed to run per machine. Then subtract one for the controller instance.
You can choose to divide the available instances of CachingTools among several running jobs. A job might not utilize its maximum number of instances of CachingTools if those instances are being used by other jobs. If a caching job is using all the CachingTools instances, other requested jobs are queued until the first job finishes.