<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Spatial join with 50 million observations... in Geoprocessing Questions</title>
    <link>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43056#M1550</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Bruce,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thank you very much for your help; I am very grateful.&amp;nbsp; I will definitely take a look at the script.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andrew Tschirhart&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 05 Aug 2011 14:27:18 GMT</pubDate>
    <dc:creator>AndrewTschirhart</dc:creator>
    <dc:date>2011-08-05T14:27:18Z</dc:date>
    <item>
      <title>Spatial join with 50 million observations...</title>
      <link>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43052#M1546</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I am using ArcGIS 9.3.1, and I am trying to match 50 million addresses that I have already geocoded to a polygon feature class.&amp;nbsp; I am using the GP tool Spatial Join, but after working with a tiny subset of my data, I think that this could take a month on the whole dataset...&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Does anyone know of useful techniques I could use to speed up the process?&amp;nbsp; I have 8 processors and 16 GB of RAM on my machine, so I am thinking of splitting the two tables I'm joining into eight pieces each and running 8 simultaneous spatial joins in separate ArcMap windows, one for each thread.&amp;nbsp; Or can the "Add spatial index" geoprocessing tool help me here?&amp;nbsp; I am relatively new to ArcGIS and I only need to perform this process once, so I am trying to avoid writing a Python script, though if someone happened to have a premade solution for 9.3.1 that I could tweak, that would be very helpful.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thank you very much for your time,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andrew Tschirhart&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 03 Aug 2011 21:24:45 GMT</pubDate>
      <guid>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43052#M1546</guid>
      <dc:creator>AndrewTschirhart</dc:creator>
      <dc:date>2011-08-03T21:24:45Z</dc:date>
    </item>
    <item>
      <title>Re: Spatial join with 50 million observations...</title>
      <link>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43053#M1547</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello Andrew&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;A framework for parallelizing a workflow like yours exists in an ArcGIS 10 sample script tool here:&lt;/SPAN&gt;&lt;BR /&gt;&lt;A href="http://resources.arcgis.com/gallery/file/geocoding/details?entryID=A284F7D9-1422-2418-7F50-BA718224C412"&gt;http://resources.arcgis.com/gallery/file/geocoding/details?entryID=A284F7D9-1422-2418-7F50-BA718224C412&lt;/A&gt;&lt;BR /&gt;&lt;SPAN&gt;If you upgrade to 10 it isn't a big job to alter the script to perform Spatial Join.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If you need to stick to 9.3.1 then you'll need to create a Model that (for example) has an input parameter that selects a subset of addresses (say using modulo arithmetic, like OBJECTID mod 8 = 0, then 1,2,3,4,5,6,7 to make 8 possible non-overlapping selections).&amp;nbsp; Then you will need to run this Model in 8 concurrent ArcGIS sessions, and afterwards merge the results.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 04 Aug 2011 14:08:10 GMT</pubDate>
      <guid>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43053#M1547</guid>
      <dc:creator>BruceHarold</dc:creator>
      <dc:date>2011-08-04T14:08:10Z</dc:date>
    </item>
    <item>
      <title>Re: Spatial join with 50 million observations...</title>
      <link>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43054#M1548</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Bruce,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thank you very much for your help, I greatly appreciate it.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have just one follow-up question: do you know if using the framework for ArcGIS 10 is faster/more efficient than running 8 simultaneous sessions in ArcGIS 9.3.1?&amp;nbsp; I am trying to figure out whether I should wait for my IT to install ArcGIS 10, which is planned for the next few months.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Andrew Tschirhart&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 04 Aug 2011 21:45:57 GMT</pubDate>
      <guid>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43054#M1548</guid>
      <dc:creator>AndrewTschirhart</dc:creator>
      <dc:date>2011-08-04T21:45:57Z</dc:date>
    </item>
    <item>
      <title>Re: Spatial join with 50 million observations...</title>
      <link>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43055#M1549</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;Bruce,&lt;BR /&gt;&lt;BR /&gt;Thank you very much for your help, I greatly appreciate it.&lt;BR /&gt;&lt;BR /&gt;I have just one follow-up question: do you know if using the framework for ArcGIS 10 is faster/more efficient than running 8 simultaneous sessions in ArcGIS 9.3.1?&amp;nbsp; I am trying to figure out whether I should wait for my IT to install ArcGIS 10, which is planned for the next few months.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Andrew Tschirhart&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Andrew&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The approaches are the same, in terms of execution of geoprocessing functions, it's just the 10-based script handles all the split/process/merge legwork.&amp;nbsp; The script will help you with the modulo arithmetic details so take a look anyway.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 05 Aug 2011 14:22:00 GMT</pubDate>
      <guid>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43055#M1549</guid>
      <dc:creator>BruceHarold</dc:creator>
      <dc:date>2011-08-05T14:22:00Z</dc:date>
    </item>
    <item>
      <title>Re: Spatial join with 50 million observations...</title>
      <link>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43056#M1550</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Bruce,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thank you very much for your help; I am very grateful.&amp;nbsp; I will definitely take a look at the script.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andrew Tschirhart&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 05 Aug 2011 14:27:18 GMT</pubDate>
      <guid>https://community.esri.com/t5/geoprocessing-questions/spatial-join-with-50-million-observations/m-p/43056#M1550</guid>
      <dc:creator>AndrewTschirhart</dc:creator>
      <dc:date>2011-08-05T14:27:18Z</dc:date>
    </item>
  </channel>
</rss>

