ETL Patterns Blog

cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

Other Boards in This Place


Latest Activity

(20 Posts)
BruceHarold
Esri Frequent Contributor

We'll look at three cases:

  • Infrequent bulk replacement
  • Frequent append and upsert
  • Regular insert, update and delete editing
Read more...

more
3 0 207
BruceHarold
Esri Frequent Contributor

Closing the loop by sharing the content

Read more...

more
2 0 540
BruceHarold
Esri Frequent Contributor

Make data on demand!

Read more...

more
3 0 497
BruceHarold
Esri Frequent Contributor

It's alpha release so stay tuned for updates!

Read more...

more
2 0 826
BruceHarold
Esri Frequent Contributor

OK I struggled with the title, there is too much goodness in it, just read it!

Read more...

more
3 6 2,089
BruceHarold
Esri Frequent Contributor

With the release of ArcGIS Pro 3.5, the stars align a little more when it comes to the use of GeoParquet.  You can now work with local GeoParquet files for your mapping and analysis needs, but it is also much easier to ingest big GeoParquet data from an S3-API-compliant object store!

This post is about how simple it is to bring remote GeoParquet data into your project.

The enabling technology is DuckDB, now included in the default Python environment in ArcGIS Pro 3.5 - no more package management just for this spectacularly useful client technology.

Here is an example, the entire Overture Maps Foundation divisions dataset accessed from their AWS S3 object store and written to my project home geodatabase.

Overture DivisionsOverture Divisions

Automation is key to GIS happiness, so to access this data I created a simple notebook which you can find in the post download.  You'll need ArcGIS Pro 3.5 to run it, or an earlier release with your Python environment extended with DuckDB 1.1+.

It takes me about 6 minutes to download the 1m+ features to my project home geodatabase, but a big chunk of that is taken up in a couple of best-practice steps, namely sorting the features on area (descending) and repairing any geometry issues.  The sort step is so small features display on top of large features, the geometry repair is commonly needed for point-rich data that "tiles the plain' like these divisions do.

The lift and shift itself is fast.

I'll let you inspect the notebook for yourselves, but note the option to apply an attribute or spatial filter on the features you download, for example within a bounding box in lat/long or the name of a country.  Instead of manually download a set of very large parquet files from S3 you now have a simple tool to go get what you want, any time you like!

more
5 3 3,267
BruceHarold
Esri Frequent Contributor

Now your data can get younger!

Read more...

more
3 0 1,358
ShareUser
Esri Community Manager

Automation has arrived!

Read more...

more
0 4 803
BruceHarold
Esri Frequent Contributor

It's easy!

Read more...

more
1 0 920
ShareUser
Esri Community Manager

Using portable workspaces, not streaming.

Read more...

more
0 2 735
113 Subscribers