Select to view content in your preferred language

Auto-extract text from attached documents

56
2
yesterday
Shane_EU
New Contributor II

Hi, does anybody have any ideas how could this be implemented in ArcGIS Pro 3.3? Thanks!

Shane_EU_4-1720503638272.png

0 Kudos
2 Replies
jcarlson
MVP Esteemed Contributor

I think you'll want to look at the textract module: https://textract.readthedocs.io/en/stable/

I haven't used it in quite the way you've specified, but I imagine it could be fairly straightforward with some Python.

I'm more comfortable with the ArcGIS Python API than ArcPy. Is this process meant to edit local file-based resources? AGOL Layers? It sounds like you're editing a geodatabase directly, given the reference to the __ATTACH table.

The ArcGIS Python API has a helpful submodule for working with attachments, and you could probably do it in a couple of for loops.

- Josh Carlson
Kendall County GIS
0 Kudos
Shane_EU
New Contributor II

Thanks Josh. Yes it's for a local file-based GDB not AGOL or Enterprise, and should happen at the point of attaching the document. I'll look into textract 👍

0 Kudos