One of the problems of WPP parallelization is long lags for CTU rows at the end of picture, because (k+1)-th thread starts after the previous one has completed two CTUs:

 

In the figure above thread 8 starts after the first thread has completed the half of CTUs.

However, slicing reduces start lags and parallelization can be better exploited:

 

Leave a Reply

Your email address will not be published. Required fields are marked *