Feature: Support blocked ranges #551

samayala22 · 2024-02-07T00:53:03Z

This is a feature that is only really useful for those developping high performance kernels.

In TBB you have access to blocked_ranges, which gives you the begin and the end index of the current task partition. This is very practical when you have fully vectorized kernels that operate on a range rather than an index.

Example:

tbb::parallel_for(tbb::blocked_range<int>(0, 1000),[&](const tbb::blocked_range<int>& r) {
    some_vectorized_kernel(r.begin(), r.end());
});

In taskflow, the closest thing we can do right now is:

const int step = 16; // some hardcoded step size
taskflow.for_each_index(0, 1000, step, [&] (int i) {
    some_vectorized_kernel(i, i+step);
});

This is okay but we are hardcoding the step size (usually the vector width) which can affect the granularity of the parallelism. If the step is too big, irregular workloads cannot be properly balanced. If the step is too small (like here at 16), we are increasing function call overhead, decreasing ILP in our kernel and polluting cache between intermediate calls.

The solution is almost there. With something like a guided or static partitioner, we just need to modify the api to have some sort of access to the chunk size (or a start and begin index like TBB).

Example:

taskflow.for_each_index(0, 1000, step, [&] (const tf::Range<int>& r) {
    some_vectorized_kernel(r.begin(), r.end());
});

The text was updated successfully, but these errors were encountered:

longpractice · 2024-02-27T15:29:00Z

I also think this is kind of important since within a block I usual do a bit more initialization work before going into individual indices.

samayala22 mentioned this issue Feb 8, 2024

PoC: For Each (Range) Index #552

Draft

tsung-wei-huang added the enhancement New feature or request label Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Support blocked ranges #551

Feature: Support blocked ranges #551

samayala22 commented Feb 7, 2024

longpractice commented Feb 27, 2024

Feature: Support blocked ranges #551

Feature: Support blocked ranges #551

Comments

samayala22 commented Feb 7, 2024

longpractice commented Feb 27, 2024