AnsweredAssumed Answered

How does "Delete Identical" choose which duplicate to keep?

Question asked by Rock_Doc on Feb 19, 2018
Latest reply on Feb 21, 2018 by Rock_Doc

I've run Find Identical to flag duplicates in my dataset, that works fine. I've created a new field for duplicates, and now need to purge entries, but I'm not quite clear on how the Delete Identical tool decides which of the duplicates to delete?

 

I highlight an example in the following image:

duplicate example

Based on a number of attributes, the highlighted samples have been flagged as duplicates. However while I consider them to be duplicates, you can see they do not in fact contain identical values across all fields. 

 

This is ok as I know why it occurs, but when you set the field(s) to check on in Delete Identical how does the tool decide which one to keep and which one(s) to delete? Does it keep the first unique object it finds and delete any duplicates thereafter? For example, in the table snapshot above, would it keep the first highlighted object and delete the 2 beneath it? Does it iterate through the file based on FID/OID (so whichever had the lowest FID/OID value would be kept)?

 

I just want to clarify which order it selects the objects to delete?

 

Thanks in advance

Outcomes