<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Auto-extract text from attached documents in ArcGIS Pro Questions</title>
    <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585560#M93079</link>
    <description>&lt;P&gt;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/781468"&gt;@Shane_EU&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/224706"&gt;@AdamMartin&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/1173"&gt;@BruceHarold&lt;/a&gt;&amp;nbsp; &amp;nbsp;This would be super easy to do with LocateXT.&amp;nbsp; So you have word docs attached to features?&amp;nbsp; How are the docs related to the features?&amp;nbsp; What are the features?&amp;nbsp; Do the documents identify a location in the text that is the feature?&amp;nbsp; &amp;nbsp;This would be super easy to do either way, but there might be a more impressive workflow if understood the basics of what you are trying to do.&lt;/P&gt;</description>
    <pubDate>Fri, 14 Feb 2025 17:19:18 GMT</pubDate>
    <dc:creator>JeffWilson7</dc:creator>
    <dc:date>2025-02-14T17:19:18Z</dc:date>
    <item>
      <title>Auto-extract text from attached documents</title>
      <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1502861#M85348</link>
      <description>&lt;P&gt;Hi, does anybody have any ideas how could this be implemented in ArcGIS Pro 3.3? Thanks&lt;EM&gt;!&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Shane_EU_4-1720503638272.png" style="width: 999px;"&gt;&lt;img src="https://community.esri.com/t5/image/serverpage/image-id/109167i5B0D4189F6B338C6/image-size/large?v=v2&amp;amp;px=999" role="button" title="Shane_EU_4-1720503638272.png" alt="Shane_EU_4-1720503638272.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2024 06:12:15 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1502861#M85348</guid>
      <dc:creator>Shane_EU</dc:creator>
      <dc:date>2024-07-09T06:12:15Z</dc:date>
    </item>
    <item>
      <title>Re: Auto-extract text from attached documents</title>
      <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1502943#M85358</link>
      <description>&lt;P&gt;I think you'll want to look at the &lt;STRONG&gt;textract &lt;/STRONG&gt;module: &lt;A href="https://textract.readthedocs.io/en/stable/" target="_blank"&gt;https://textract.readthedocs.io/en/stable/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I haven't used it in quite the way you've specified, but I imagine it could be fairly straightforward with some Python.&lt;/P&gt;&lt;P&gt;I'm more comfortable with the ArcGIS Python API than ArcPy. Is this process meant to edit local file-based resources? AGOL Layers? It sounds like you're editing a geodatabase directly, given the reference to the __ATTACH table.&lt;/P&gt;&lt;P&gt;The ArcGIS Python API has a helpful submodule for working with attachments, and you could probably do it in a couple of for loops.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2024 12:47:23 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1502943#M85358</guid>
      <dc:creator>jcarlson</dc:creator>
      <dc:date>2024-07-09T12:47:23Z</dc:date>
    </item>
    <item>
      <title>Re: Auto-extract text from attached documents</title>
      <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1502953#M85359</link>
      <description>&lt;P&gt;Thanks Josh. Yes it's for a local file-based GDB not AGOL or Enterprise, and should happen at the point of attaching the document. I'll look into textract&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":thumbs_up:"&gt;👍&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2024 13:03:16 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1502953#M85359</guid>
      <dc:creator>Shane_EU</dc:creator>
      <dc:date>2024-07-09T13:03:16Z</dc:date>
    </item>
    <item>
      <title>Re: Auto-extract text from attached documents</title>
      <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585461#M93072</link>
      <description>&lt;P&gt;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/781468"&gt;@Shane_EU&lt;/a&gt;&amp;nbsp;I'm sure there's a way to do this in ModelBuilder but I do not have an example I can share. &lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/1173"&gt;@BruceHarold&lt;/a&gt;&amp;nbsp;may have an example using the Data Interop Extension for Pro. I don't know the specific interactions these tools have with feature Attachments, but have you also explored extracting text objects as entities (rather than the full text) into properties? Have you explored the Text Classifications tools using AI (&lt;A href="https://pro.arcgis.com/en/pro-app/3.3/tool-reference/geoai/how-entity-recognition-works.htm)" target="_blank"&gt;https://pro.arcgis.com/en/pro-app/3.3/tool-reference/geoai/how-entity-recognition-works.htm)&lt;/A&gt; and LocateXT which can extract locations, but also text strings using regex patterns?&amp;nbsp;&lt;A href="https://pro.arcgis.com/en/pro-app/3.3/help/data/locatext/extract-locations.htm" target="_blank"&gt;https://pro.arcgis.com/en/pro-app/3.3/help/data/locatext/extract-locations.htm&lt;/A&gt;&amp;nbsp;&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/326600"&gt;@JeffWilson4&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Feb 2025 14:02:53 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585461#M93072</guid>
      <dc:creator>AdamMartin</dc:creator>
      <dc:date>2025-02-14T14:02:53Z</dc:date>
    </item>
    <item>
      <title>Re: Auto-extract text from attached documents</title>
      <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585469#M93073</link>
      <description>&lt;P&gt;I think this could be modeled or scripted, but it might make more sense to use the LocateXT tools on the incoming Word documents &lt;EM&gt;&lt;STRONG&gt;before&lt;/STRONG&gt;&lt;/EM&gt; attaching them, to make accessory attachments that go for the ride.&amp;nbsp; Data Interoperability doesn't add anything in this situation, as described, but might if other complexities arise.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Feb 2025 14:12:19 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585469#M93073</guid>
      <dc:creator>BruceHarold</dc:creator>
      <dc:date>2025-02-14T14:12:19Z</dc:date>
    </item>
    <item>
      <title>Re: Auto-extract text from attached documents</title>
      <link>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585560#M93079</link>
      <description>&lt;P&gt;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/781468"&gt;@Shane_EU&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/224706"&gt;@AdamMartin&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/1173"&gt;@BruceHarold&lt;/a&gt;&amp;nbsp; &amp;nbsp;This would be super easy to do with LocateXT.&amp;nbsp; So you have word docs attached to features?&amp;nbsp; How are the docs related to the features?&amp;nbsp; What are the features?&amp;nbsp; Do the documents identify a location in the text that is the feature?&amp;nbsp; &amp;nbsp;This would be super easy to do either way, but there might be a more impressive workflow if understood the basics of what you are trying to do.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Feb 2025 17:19:18 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-pro-questions/auto-extract-text-from-attached-documents/m-p/1585560#M93079</guid>
      <dc:creator>JeffWilson7</dc:creator>
      <dc:date>2025-02-14T17:19:18Z</dc:date>
    </item>
  </channel>
</rss>

