Are there tools or Python scripts that will extract text from Microsoft Word (2016) files (.docx )?
Issue - I have dozens or Word documents , all in the same format, that have data I would like to extract to put in a feature class. The Word documents have listings of geocoordinates, and text descriptions that would fit nicely into feature class fields.
Plan - I would like to be able to extract the data to a table in a FGDB (or into a CSV file) that I can then convert to a feature class.
Are there Python modules, or Python code, or ESRI models that can do this?
The LocateXT extension for ArcGIS Pro can do this. It can work with non-structured data such as a Word Doc and extract coordinates and other attributes out into a geodatabase point feature class. You can even build create custom attributes to only pull out certain things from your unstructured data. Pretty interesting tool!