There are a vast amount of variables here.
1st off, you say 'huge' I laugh, we have caches 10x that size even with small areas, mind you, we go down to 1:256 for some serious details.
Even with a tiny area, you are going down to the 1000 level so depending on the project area, size would be expected and by using Exploded method (which creates many folder directories as well as jpgs (though the advantage to this is 'merging' caches quite easily) the size problem is exaggerated.
Its again further exaggerated by using 128x128 tiles and again exaggerated using png without compression.
Try re-running the cache by selecting EXTENT of the map (after of course, getting the area you want) - I have had strange problems with using features as the area of interest (however, I think this was due to custom projections being used)
Use MIXED Tile format at 75 compression - you wont notice a difference!
Use 512x512 tile
Use Bundle format (compressed storage format)
as for the estimates shown, Ive yet to see one that is accurate or really close - its a big 'guess' done by samples and algorithms.
as for the 'caching never ends successful'
what does the geodatabase results show you? which failures, if any are in the attribute table?
Another thing you can do is create the service MANUALLY cache and send the GP request through Cache manager in Catalog to the map service when published.
Therefore, all users are complaining due to the very slow speed in panning and zooming the maps accommodated in the web application.
Jamal,
I noticed one thing in your screenshots that concerns me. You have a lot of labels visible, and those labels seem to be dynamic. They are not a fixed size nor position.
Dynamic labeling is very costly in most cases, and you probably want to avoid this in your web services.
What you should consider, is converting your labels to Annotation Feature Classes in your geodatabase, probably best as a feature-linked version, that updates with updates to your parcels layer. As the Help page I linked says:
"Geodatabase annotation is indexed spatially, meaning that it will draw and select much faster."
You would also do good to set a fixed reference scale for the Annotation and labels, e.g. 1:5000, so as to have the labels scale. And set a visible scale range. As the Help says:
"To maximize ArcMap display and query performance, always define a visible scale range for each annotation class so annotation features only draw when you're zoomed in enough to read their text."