I have an ID number field and some of the numbers are missing. I know I can identify duplicate numbers in a number field, but can I identify missing numbers. There are almost 5000 numbers and to search for them manually is painful. Thanks.
You can calculate your own sequential id field, then compare... the expression is seq_count( first_id_in_field)
Do note that geodatabases use 1 as first OBJECTID
cnt = 0 # ---- change to 1 if using a geodatabase, 0 for shapefiledef seq_count(val): global cnt if cnt >= val : cnt += 1 return cnt# __esri_field_calculator_splitter__seq_count(0) # ----- change to 1 if using a geodatabase
You can then query where the OBJECTID or FID isn't equal to the result you calculate. The big problem is that as soon as it hits a difference then you have to edit it and restart the process.
Another method is to calculate the difference between the previous and current. The following isn't tested so no guarantees... but I am sure someone will test and fix it
diff = 0def seq_diff(val): global diff if val - diff != 1 : diff = val return -999 else: diff += 1 return diffseq_diff(!OBJECTID!) # ---- or !FID! for shapefiles
Try using Python sets to your advantage:
fc = # path to feature classmin_oid, = min(arcpy.da.SearchCursor(fc, "OID@"))max_oid, = max(arcpy.da.SearchCursor(fc ,"OID@"))missing_oids = set(range(min_oid, max_oid + 1)) - set(oid for oid, in arcpy.da.SearchCursor(fc,"OID@"))
In terms of performance, the following will be better since it recycles the same cursor instead of recreating cursors:
fc = # path to feature classwith arcpy.da.SearchCursor(fc, "OID@") as cur: min_oid, = min(cur) cur.reset() max_oid, = max(cur) cur.reset() missing_oids = set(range(min_oid, max_oid + 1)) - set(oid for oid, in cur)
Retrieving data ...