<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Unable to read parquet file with arcgis per docs? in ArcGIS API for Python Questions</title>
    <link>https://community.esri.com/t5/arcgis-api-for-python-questions/unable-to-read-parquet-file-with-arcgis-per-docs/m-p/1289792#M8694</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/12167"&gt;@CarlSunderman&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I read the &lt;A title="from_parquet" href="https://developers.arcgis.com/python/api-reference/arcgis.features.toc.html?#arcgis.features.GeoAccessor.from_parquet" target="_self"&gt;&lt;EM&gt;from_parquet&lt;/EM&gt; &lt;/A&gt;documentation and I believe it may need to be updated. It states:&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;if no geometry columns are read, this will raise a&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;ValueError&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;- you should use the pandas&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;read_parquet&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;method instead.&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;My reading of this is you should use the standard &lt;FONT face="courier new,courier"&gt;pandas.read_parquet&lt;/FONT&gt; method, completely separate from GeoAccessor. This makes sense because if you have no geometry column(s) in your data, then you are working with non-spatial data.&lt;/P&gt;&lt;P&gt;Of course, later on the User is told to do this:&lt;/P&gt;&lt;PRE&gt;&lt;SPAN class=""&gt;df&lt;/SPAN&gt; &lt;SPAN class=""&gt;=&lt;/SPAN&gt; &lt;SPAN class=""&gt;pd&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;DataFrame&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;spatial&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;read_parquet&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;"data.parquet"&lt;/SPAN&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt;  &lt;/PRE&gt;&lt;P&gt;This seems to contradict the previous statement. If you have spatial data, then you should be using&amp;nbsp;&lt;FONT face="courier new,courier"&gt;&lt;SPAN class=""&gt;pd&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;DataFrame&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;spatial&lt;/SPAN&gt;&lt;SPAN class=""&gt;.from&lt;/SPAN&gt;&lt;/FONT&gt;&lt;SPAN class=""&gt;&lt;FONT face="courier new,courier"&gt;_parquet&lt;/FONT&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, to read non-spatial parquet data use:&lt;/P&gt;&lt;PRE&gt;pd&lt;SPAN class=""&gt;.&lt;/SPAN&gt;read_parquet&lt;SPAN class=""&gt;(&lt;/SPAN&gt;your_parquet_file_path&lt;SPAN class=""&gt;)&lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this helps clear up some confusion. I think submitting feedback on the documentation is in order.&lt;/P&gt;</description>
    <pubDate>Tue, 16 May 2023 20:56:44 GMT</pubDate>
    <dc:creator>EarlMedina</dc:creator>
    <dc:date>2023-05-16T20:56:44Z</dc:date>
    <item>
      <title>Unable to read parquet file with arcgis per docs?</title>
      <link>https://community.esri.com/t5/arcgis-api-for-python-questions/unable-to-read-parquet-file-with-arcgis-per-docs/m-p/1287888#M8680</link>
      <description>&lt;P&gt;I understand that I can read the parquet file in pandas, but why does the documentation claim you can read into a spatial dataframe when you cannot?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The docs claim, new in version 1.9 and I'm using arcgis version 2.0.1&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class=""&gt;This is from the docs:&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;EM&gt;static&amp;nbsp;&lt;/EM&gt;from_parquet&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;EM&gt;&lt;SPAN class=""&gt;path&lt;/SPAN&gt;&lt;/EM&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;&lt;SPAN class=""&gt;columns&lt;/SPAN&gt;&lt;SPAN class=""&gt;=&lt;/SPAN&gt;&lt;SPAN class=""&gt;None&lt;/SPAN&gt;&lt;/EM&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;&lt;SPAN class=""&gt;**&lt;/SPAN&gt;&lt;SPAN class=""&gt;kwargs&lt;/SPAN&gt;&lt;/EM&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt;&lt;A title="Permalink to this definition" href="https://developers.arcgis.com/python/api-reference/arcgis.features.toc.html?#arcgis.features.GeoAccessor.from_parquet" target="_blank" rel="noopener"&gt;¶&lt;/A&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;P&gt;Load a Parquet object from the file path, returning a Spatially Enabled DataFrame.&lt;/P&gt;Returns&lt;P&gt;Spatially Enabled DataFrame&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;PRE&gt;&lt;SPAN class=""&gt;&amp;gt;&amp;gt;&amp;gt; &lt;/SPAN&gt;&lt;SPAN class=""&gt;df&lt;/SPAN&gt; &lt;SPAN class=""&gt;=&lt;/SPAN&gt; &lt;SPAN class=""&gt;pd&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;DataFrame&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;spatial&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;read_parquet&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;"data.parquet"&lt;/SPAN&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt;  &lt;/PRE&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;A href="https://developers.arcgis.com/python/api-reference/arcgis.features.toc.html" target="_blank" rel="noopener"&gt;https://developers.arcgis.com/python/api-reference/arcgis.features.toc.html&lt;/A&gt;?&lt;/P&gt;&lt;P&gt;but every time I try to read a parquet file, i get the same error&lt;/P&gt;&lt;PRE&gt;&lt;SPAN class=""&gt;AttributeError&lt;/SPAN&gt;: type object 'GeoAccessor' has no attribute 'read_parquet'
&lt;SPAN class=""&gt;---------------------------------------------------------------------------&lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Why not remove it from the docs if you cannot read a parquet file?&lt;/P&gt;&lt;P&gt;I&lt;/P&gt;</description>
      <pubDate>Wed, 10 May 2023 18:34:54 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-api-for-python-questions/unable-to-read-parquet-file-with-arcgis-per-docs/m-p/1287888#M8680</guid>
      <dc:creator>CarlSunderman</dc:creator>
      <dc:date>2023-05-10T18:34:54Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to read parquet file with arcgis per docs?</title>
      <link>https://community.esri.com/t5/arcgis-api-for-python-questions/unable-to-read-parquet-file-with-arcgis-per-docs/m-p/1289792#M8694</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.esri.com/t5/user/viewprofilepage/user-id/12167"&gt;@CarlSunderman&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I read the &lt;A title="from_parquet" href="https://developers.arcgis.com/python/api-reference/arcgis.features.toc.html?#arcgis.features.GeoAccessor.from_parquet" target="_self"&gt;&lt;EM&gt;from_parquet&lt;/EM&gt; &lt;/A&gt;documentation and I believe it may need to be updated. It states:&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;if no geometry columns are read, this will raise a&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;ValueError&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;- you should use the pandas&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;read_parquet&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;method instead.&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;My reading of this is you should use the standard &lt;FONT face="courier new,courier"&gt;pandas.read_parquet&lt;/FONT&gt; method, completely separate from GeoAccessor. This makes sense because if you have no geometry column(s) in your data, then you are working with non-spatial data.&lt;/P&gt;&lt;P&gt;Of course, later on the User is told to do this:&lt;/P&gt;&lt;PRE&gt;&lt;SPAN class=""&gt;df&lt;/SPAN&gt; &lt;SPAN class=""&gt;=&lt;/SPAN&gt; &lt;SPAN class=""&gt;pd&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;DataFrame&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;spatial&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;read_parquet&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;"data.parquet"&lt;/SPAN&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt;  &lt;/PRE&gt;&lt;P&gt;This seems to contradict the previous statement. If you have spatial data, then you should be using&amp;nbsp;&lt;FONT face="courier new,courier"&gt;&lt;SPAN class=""&gt;pd&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;DataFrame&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;spatial&lt;/SPAN&gt;&lt;SPAN class=""&gt;.from&lt;/SPAN&gt;&lt;/FONT&gt;&lt;SPAN class=""&gt;&lt;FONT face="courier new,courier"&gt;_parquet&lt;/FONT&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, to read non-spatial parquet data use:&lt;/P&gt;&lt;PRE&gt;pd&lt;SPAN class=""&gt;.&lt;/SPAN&gt;read_parquet&lt;SPAN class=""&gt;(&lt;/SPAN&gt;your_parquet_file_path&lt;SPAN class=""&gt;)&lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this helps clear up some confusion. I think submitting feedback on the documentation is in order.&lt;/P&gt;</description>
      <pubDate>Tue, 16 May 2023 20:56:44 GMT</pubDate>
      <guid>https://community.esri.com/t5/arcgis-api-for-python-questions/unable-to-read-parquet-file-with-arcgis-per-docs/m-p/1289792#M8694</guid>
      <dc:creator>EarlMedina</dc:creator>
      <dc:date>2023-05-16T20:56:44Z</dc:date>
    </item>
  </channel>
</rss>

