unexpected truncation in my text field ("Table to Table" GP)

Felix10546 · ‎12-19-2021

I am using the "Table to table" GP to input the following csv to gdb for genetic data.

the sequence of virus1 has length about 31000

The default text length in the tool is 8000 and I amended it to 50000

The import ran successfully and I added a field for finding the maximum sequence length.

However, when I looked in the attribute table.

The import string is truncated to length of 8001 which originally should be ~31000.

Question: How do I fix it?

I have attached the required testing file.

Thank for anyone that can help with this issue.

DanPatterson · ‎12-20-2021

@Felix10546 if you are good with numpy, then you can just use

# -- set dtype to string rather than unicode
dt = np.dtype([('strain', 'S6'), ('sequence', 'S50000')])
ar2 = np.genfromtxt(f, dtype=dt, delimiter=",", names=True, autostrip=True, encoding='utf-8')
ar2.dtype
dtype([('strain', 'S6'), ('sequence', 'S50000')])
fc1 = r"C:\arcpro_npg\npg\Project_npg\tests.gdb\strain2"

NumPyArrayToTable(ar2, fc1)

If you use unicode strings it doubles your field widths, if you stick with string S, the field widths are retained

Here is the fields view of the data you posted.

BTW NumPyArrayToTable skips all the hoops to get a table if you are working with csv data

... sort of retired...