The only way for Portal to failover is by stopping the service. Up until 10.6, this can take time, as you experience. At 10.6.1, the failover time has decreased to under 30 seconds due to some re-architecting of the logic.
Is there a reason why one should be the primary? It shouldn't matter whether portal1 is the primary or portal2 is the primary.
In our enterprise we have two physical servers. prd1 and prd2. Both prd1 and prd2 host ArcGIS Enterprise server and now portal in HA config. prd1 is also the portal and server web adaptor host. We plan to deploy ArcGIS Enterprise Data Store to prd2. I'd like to limit the loading on prd2 from portal by keeping prd1 the primary portal as much as possible. If you'd confirm that in the HA portal config the loading on each server is the same whether the individual portal is functioning as primary or secondary then, I agree it doesn't matter which portal is primary and which is secondary.
Right, it doesn't matter which is primary or standby. Both receive requests through the web adaptor or load balancer in front of them, but both machines will access the database on the primary. Data is automatically streamed to the standby so in the event of a failure on the primary, the standby can be promoted with no data loss. When the primary returns from the failure, it'll return as standby and data will be streamed again from primary to standby.
Jonathan,
Thank you for the information.
Todd