ETL Patterns Blog

cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

Other Boards in This Place


Latest Activity

(15 Posts)
BruceHarold
Esri Frequent Contributor

With the release of ArcGIS Pro 3.5, the stars align a little more when it comes to the use of GeoParquet.  You can now work with local GeoParquet files for your mapping and analysis needs, but it is also much easier to ingest big GeoParquet data from an S3-API-compliant object store!

This post is about how simple it is to bring remote GeoParquet data into your project.

The enabling technology is DuckDB, now included in the default Python environment in ArcGIS Pro 3.5 - no more package management just for this spectacularly useful client technology.

Here is an example, the entire Overture Maps Foundation divisions dataset accessed from their AWS S3 object store and written to my project home geodatabase.

Overture DivisionsOverture Divisions

Automation is key to GIS happiness, so to access this data I created a simple notebook which you can find in the post download.  You'll need ArcGIS Pro 3.5 to run it, or an earlier release with your Python environment extended with DuckDB 1.1+.

It takes me about 6 minutes to download the 1m+ features to my project home geodatabase, but a big chunk of that is taken up in a couple of best-practice steps, namely sorting the features on area (descending) and repairing any geometry issues.  The sort step is so small features display on top of large features, the geometry repair is commonly needed for point-rich data that "tiles the plain' like these divisions do.

The lift and shift itself is fast.

I'll let you inspect the notebook for yourselves, but note the option to apply an attribute or spatial filter on the features you download, for example within a bounding box in lat/long or the name of a country.  Instead of manually download a set of very large parquet files from S3 you now have a simple tool to go get what you want, any time you like!

more
5 3 1,783
BruceHarold
Esri Frequent Contributor

Now your data can get younger!

Read more...

more
3 0 1,051
ShareUser
Esri Community Manager

Automation has arrived!

Read more...

more
0 4 426
BruceHarold
Esri Frequent Contributor

It's easy!

Read more...

more
1 0 687
ShareUser
Esri Community Manager

Using portable workspaces, not streaming.

Read more...

more
0 2 428
ShareUser
Esri Community Manager

No-code B2G - Business to GIS!

Read more...

more
0 1 433
ShareUser
Esri Community Manager

Less transaction = more satisfaction!

Read more...

more
0 4 860
BruceHarold
Esri Frequent Contributor

They're supported now, by you!

Read more...

more
2 1 749
BruceHarold
Esri Frequent Contributor

Without the manual download and conversion steps...

Read more...

more
1 1 1,371
BruceHarold
Esri Frequent Contributor

Starting with the basics - spreadsheet files!

Read more...

more
2 0 828
103 Subscribers