It's a bit unusual (but not impossible) for multiple instances of the LOAD operator to make that much difference with tbuild.
You might also investigate simply allocating more sessions to the job but keeping the number of instances the same; e.g. QueryBandSessInfo='UtilityDataSize=Large;' is configured for this purpose by default.
Your feedback was not clear.
Which version took 4 hours, and which version took 8 hours?