Locality optimisations for multicore backend #1695

athas · 2022-07-02T06:19:17Z

The multicore (and C) backends will sometimes generate inefficient code where arrays are sequentially traversed with a stride. This is obviously bad. I think a good solution would be to do something similar to the "kernel babysitter" used by the GPU pipelines, where we analyse traversal patterns and transpose the arrays in advance such that the eventual traversal will be optimal. This is not as good as tiling, but it is very general.

athas added optimisation compiler student-viable Viable as a student project labels Jul 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Locality optimisations for multicore backend #1695

Locality optimisations for multicore backend #1695

athas commented Jul 2, 2022

Locality optimisations for multicore backend #1695

Locality optimisations for multicore backend #1695

Comments

athas commented Jul 2, 2022