<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Gather data from a regularly formatted webpage (Tax Parcels) in Python Questions</title>
    <link>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198342#M65115</link>
    <description>&lt;P&gt;Since the URL contains the TMK # of the parcel you could use that with the requests library. Retrieve info from the page, parse the return, then repeat for each record of interest.&lt;/P&gt;</description>
    <pubDate>Tue, 02 Aug 2022 01:33:26 GMT</pubDate>
    <dc:creator>I_AM_ERROR</dc:creator>
    <dc:date>2022-08-02T01:33:26Z</dc:date>
    <item>
      <title>Gather data from a regularly formatted webpage (Tax Parcels)</title>
      <link>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198332#M65113</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;We frequently use the &lt;A href="https://nps.maps.arcgis.com/home/item.html?id=1eb5fa03038d49cba930096ea67194e0" target="_blank" rel="noopener"&gt;tax parcel layer&lt;/A&gt; published by t&lt;A href="https://geoportal.hawaii.gov/search?q=tmk&amp;amp;source=hawaii%20statewide%20gis%20program&amp;amp;type=feature%20layer" target="_blank" rel="noopener"&gt;he State of Hawaii&lt;/A&gt; to help us plan projects.&lt;/P&gt;&lt;P&gt;In the last few years, the State stopped publishing the layer with the ownership information, opting instead to include a link to a webpage featuring ownership, taxes, etc. as an attribute&lt;/P&gt;&lt;P&gt;Example here, with Hawai'i Volcano National Park.&amp;nbsp;&lt;A href="https://qpublic.schneidercorp.com/Application.aspx?AppID=1048&amp;amp;LayerID=23618&amp;amp;PageTypeID=4&amp;amp;PageID=9878&amp;amp;KeyValue=980010010000" target="_blank"&gt;qPublic.net - Hawai'i County, HI - Report: 980010010000 (schneidercorp.com)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I'd like to be able to populate a copy of the layer (filtered to be relevant to us) with attributes from the webpage, mostly (especially) the ownership information.&lt;/P&gt;&lt;P&gt;Does anyone have any tips as to this might be done? Dynamic is not needed.&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 01 Aug 2022 23:56:16 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198332#M65113</guid>
      <dc:creator>AlfredBaldenweck</dc:creator>
      <dc:date>2022-08-01T23:56:16Z</dc:date>
    </item>
    <item>
      <title>Re: Gather data from a regularly formatted webpage (Tax Parcels)</title>
      <link>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198342#M65115</link>
      <description>&lt;P&gt;Since the URL contains the TMK # of the parcel you could use that with the requests library. Retrieve info from the page, parse the return, then repeat for each record of interest.&lt;/P&gt;</description>
      <pubDate>Tue, 02 Aug 2022 01:33:26 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198342#M65115</guid>
      <dc:creator>I_AM_ERROR</dc:creator>
      <dc:date>2022-08-02T01:33:26Z</dc:date>
    </item>
    <item>
      <title>Re: Gather data from a regularly formatted webpage (Tax Parcels)</title>
      <link>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198358#M65116</link>
      <description>&lt;P&gt;Taking a look at the sites robots.txt file, it disallows all user agents (web crawlers/ automatic scraping) for /Application.aprx/ so be respectful/careful how you go about your data extraction.&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can use the python package BeautifulSoup to extract items/text from webpages/urls- there are a ton of tutorials on the net for how it can be done.&lt;/P&gt;</description>
      <pubDate>Tue, 02 Aug 2022 03:56:57 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/gather-data-from-a-regularly-formatted-webpage-tax/m-p/1198358#M65116</guid>
      <dc:creator>Anonymous User</dc:creator>
      <dc:date>2022-08-02T03:56:57Z</dc:date>
    </item>
  </channel>
</rss>

