<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Speeding up a join in Python Questions</title>
    <link>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670474#M51948</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;How large is your data? How many records do you have and how many fields do you want to join?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.esri.com/migrated-users/4811"&gt;Richard Fairhurst&lt;/A&gt;&amp;nbsp;has a very good blog post about joins in python using dictionaries, which I recommend you to read:&amp;nbsp;&lt;A href="https://community.esri.com/blogs/richard_fairhurst/2014/11/08/turbo-charging-data-manipulation-with-python-cursors-and-dictionaries"&gt;/blogs/richard_fairhurst/2014/11/08/turbo-charging-data-manipulation-with-python-cursors-and-dictionaries&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sun, 04 Mar 2018 15:23:17 GMT</pubDate>
    <dc:creator>XanderBakker</dc:creator>
    <dc:date>2018-03-04T15:23:17Z</dc:date>
    <item>
      <title>Speeding up a join</title>
      <link>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670473#M51947</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;What is a way to speed up a join (based on a feature ID) of a feature class (in a file gdb) and a table of summary statistics about features in the feature class?&amp;nbsp; The table has a one-to-one relationship with the feature class.&amp;nbsp; Split the feature class and table and use multiprocess?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Has the package multiprocess superseded multiprocessing?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 04 Mar 2018 14:38:12 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670473#M51947</guid>
      <dc:creator>BrooksBreece</dc:creator>
      <dc:date>2018-03-04T14:38:12Z</dc:date>
    </item>
    <item>
      <title>Re: Speeding up a join</title>
      <link>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670474#M51948</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;How large is your data? How many records do you have and how many fields do you want to join?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.esri.com/migrated-users/4811"&gt;Richard Fairhurst&lt;/A&gt;&amp;nbsp;has a very good blog post about joins in python using dictionaries, which I recommend you to read:&amp;nbsp;&lt;A href="https://community.esri.com/blogs/richard_fairhurst/2014/11/08/turbo-charging-data-manipulation-with-python-cursors-and-dictionaries"&gt;/blogs/richard_fairhurst/2014/11/08/turbo-charging-data-manipulation-with-python-cursors-and-dictionaries&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 04 Mar 2018 15:23:17 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670474#M51948</guid>
      <dc:creator>XanderBakker</dc:creator>
      <dc:date>2018-03-04T15:23:17Z</dc:date>
    </item>
    <item>
      <title>Re: Speeding up a join</title>
      <link>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670475#M51949</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I have 10,000ish records and am joining 5 to 7 fields.&amp;nbsp; The join takes 30 minutes, which is not horrible.&amp;nbsp; But, I want to make it faster.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 04 Mar 2018 16:52:28 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670475#M51949</guid>
      <dc:creator>BrooksBreece</dc:creator>
      <dc:date>2018-03-04T16:52:28Z</dc:date>
    </item>
    <item>
      <title>Re: Speeding up a join</title>
      <link>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670476#M51950</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Just before you start coding there are some general recommendations to make things faster:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Is you data on a network drive, make a local copy&lt;/LI&gt;&lt;LI&gt;What is the data format? Best to load the data into a file geodatabase&lt;/LI&gt;&lt;LI&gt;The fields you join on, do&amp;nbsp;you have attribute indexes defined on those fields?&lt;/LI&gt;&lt;LI&gt;obviously the computer&amp;nbsp;specs also have influence over the speed you will obtain.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;30 minutes for 10.000 records (both featureclass and table) is very slow.&amp;nbsp;It should be possible to do this a lot faster.&lt;/P&gt;&lt;P&gt;Have a look at "&lt;EM&gt;Example 2 - Transfer of Multiple Field Values between Feature Classes where there is a 1:1 Match between Field Sets&lt;/EM&gt;" within the section on "&lt;STRONG&gt;Using a Python Dictionary Built using a da SearchCursor to Replace a Join Connecting Two Feature Classes&lt;/STRONG&gt;". That seems exactly what you are looking for.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you run into problems please postback the code you are using and a sample of your data and we will have a look.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 04 Mar 2018 23:31:50 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670476#M51950</guid>
      <dc:creator>XanderBakker</dc:creator>
      <dc:date>2018-03-04T23:31:50Z</dc:date>
    </item>
    <item>
      <title>Re: Speeding up a join</title>
      <link>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670477#M51951</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Here is a pretty simple python join tool that ESRI created a couple years ago that&amp;nbsp;just uses&amp;nbsp;a cursor to do the joining. It only runs on geodatabase inputs, however.&amp;nbsp; But it's very fast (at least compared the the standard JoinField tool). I had to modify it slightly, since it didn't recognize field of type "SHORT".&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 05 Mar 2018 21:35:33 GMT</pubDate>
      <guid>https://community.esri.com/t5/python-questions/speeding-up-a-join/m-p/670477#M51951</guid>
      <dc:creator>ChrisRingo1</dc:creator>
      <dc:date>2018-03-05T21:35:33Z</dc:date>
    </item>
  </channel>
</rss>

