Select to view content in your preferred language

Support for Apache Iceberg in ArcGIS Notebooks for Large-Scale GIS

141
0
09-01-2025 09:52 AM
Status: Open
VenkataKondepati
Occasional Contributor

Description:
I would like to use Apache Iceberg together with ArcGIS Notebooks to perform GIS statistical and spatial analysis on large datasets. Iceberg provides powerful capabilities for handling big data, including:

  • Schema evolution without breaking existing queries.

  • Efficient handling of petabyte-scale datasets.

  • Open table format support for cloud object storage and modern data lakes.

Proposed Enhancement:
Extend ArcGIS tools (ArcGIS Notebooks, ArcGIS Pro, and ArcGIS Enterprise) to integrate with Apache Iceberg as a data source. This could mean:

  • Direct connections from ArcGIS Notebooks to Iceberg tables (similar to existing connectors for Spark, Snowflake, or BigQuery).

  • Ability to query Iceberg tables as feature layers.

  • Support for spatial extensions or joins on Iceberg datasets.

Use Case:
Organizations working with cloud-native data lakes increasingly use Iceberg for analytics and data management. Integrating Iceberg with ArcGIS would allow GIS professionals to run statistical analysis and spatial queries directly on massive datasets without duplicating them into geodatabases.

Benefit:

  • Unlocks cloud-scale geospatial analytics.

  • Reduces data duplication between GIS and data science platforms.

  • Keeps ArcGIS aligned with modern data lakehouse ecosystems (Iceberg, Delta Lake, Hudi).

Question:
Are there plans to extend Esri tools with Apache Iceberg integration? I’d love to hear other users’ ideas and use cases as well.

Tags (1)