|
Clearly this way of parallelizing the loop is much faster than if we parallelized the independent operations inside the loop. That would require communication in every iteration thru the loop, and unless the independent operations where really computationally hard, we would end up doing a lot of communication and little real computation.
The parallelization of a loop is a step-wise process, where we gather more and more information, and continually decide whether it is possible to continue the loop parallelization. The steps are described in detail in the following.