Occupancy improvement for Hash table build #15700

tgujar · 2024-05-08T01:43:42Z

Description

Prototype implementation for: #15502

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2024-05-08T01:43:47Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

tgujar · 2024-05-08T01:55:21Z

I think the approach of specializing the type dispatcher is very cumbersome and will lead to a lot of code replication. Currently, I have the conditional dispatch working for device_row_hasher but I am unsure if there is a better way to implement this. We could introduce a macro here to generate the code, what do you think?

PointKernel · 2024-05-08T19:04:21Z

/ok to test

PointKernel · 2024-05-14T19:45:57Z

/ok to test

PointKernel · 2024-05-14T19:49:36Z

@tgujar I've updated the docs to unblock CI. Have you noticed any performance regressions for other use cases? It seems that it improves the performance for mixed join but the performance drops significantly in other cases using row hasher.

ttnghia · 2024-05-14T20:23:44Z

cpp/src/join/mixed_join_common_utils.cuh

+                                                          id_to_type<type_id::DECIMAL128>,
+                                                          id_to_type<type_id::DECIMAL64>,
+                                                          id_to_type<type_id::DECIMAL32>,


I don't think decimal types are complex type. They are just a wrapper around some integer type.

Equality operator for Decimal will perform scaling which uses exponentiation.

cudf/cpp/include/cudf/fixed_point/fixed_point.hpp

Line 735 in 888e9d5

CUDF_HOST_DEVICE inline bool operator==(fixed_point<Rep1, Rad1> const& lhs,

I see a reduction in register usage if I comment out decimal types in #15502. I think we can still decide on the types excluded in the branches later on

PointKernel · 2024-05-16T02:56:52Z

/ok to test

PointKernel · 2024-05-16T14:55:44Z

@tgujar Could you take a look at the failing tests?

PointKernel · 2024-05-17T17:57:22Z

/ok to test

PointKernel · 2024-05-21T16:02:15Z

/ok to test

davidwendt · 2024-05-30T14:33:31Z

cpp/include/cudf/table/experimental/row_operators.cuh

-   * @throw cudf::logic_error if the input tables were preprocessed to transform any nested children
-   *        columns into integer columns but `PhysicalElementComparator` is not
+   * @throw cudf::logic_error if the input tables were preprocessed to transform any nested
+   * children columns into integer columns but `PhysicalElementComparator` is not


It appears that a significant number of changes to this file are due to reformatting comments.
Would it be possible to undo those changes? This particular change is certainly not desirable.

Makes sense, will fix. This was caused because of the clang-format extension on vscode

davidwendt · 2024-05-30T14:35:12Z

This PR needs to be rebased on branch-24.08.

tgujar · 2024-05-30T14:36:00Z

Specializing both the comparator and the hasher drops the register usage to 54 instead of the expected 46 for the mixed semi join case. Investigating why the register pressure is different from commenting out the code paths.
The current plan is to avoid using a macro(as mentioned here) and instead do dynamic dispatch on CPU side using std::variant and std::visit

tgujar added 2 commits May 6, 2024 12:01

nested template instantiation for hiding types

70124eb

hasher conditional type dispatch works

bae93a5

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label May 8, 2024

delete dead comment block

6cf44b7

PointKernel added non-breaking Non-breaking change 3 - Ready for Review Ready for review by team improvement Improvement / enhancement to an existing function Performance Performance related issue labels May 8, 2024

PointKernel added 4 commits May 8, 2024 13:23

Merge remote-tracking branch 'upstream/branch-24.06' into hash-occupancy

790c106

Merge remote-tracking branch 'upstream/branch-24.06' into hash-occupancy

7183956

Merge remote-tracking branch 'upstream/branch-24.06' into hash-occupancy

52709f6

Fix docs

ee7a1e1

ttnghia reviewed May 14, 2024

View reviewed changes

PointKernel added 2 commits May 15, 2024 12:21

Merge branch 'branch-24.06' into hash-occupancy

9a62dcd

Merge branch 'branch-24.06' into hash-occupancy

eda24ce

tgujar and others added 2 commits May 17, 2024 08:48

fix type logic, minor refactor

0b32cf3

Merge branch 'branch-24.06' into hash-occupancy

df0bd6c

Merge branch 'branch-24.06' into hash-occupancy

044ad37

tgujar added 3 commits May 29, 2024 07:49

refactor

f8634d0

added template specialization for equality comparator

847b699

added template specialized calls to comparator

072b935

davidwendt reviewed May 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Occupancy improvement for Hash table build #15700

Occupancy improvement for Hash table build #15700

tgujar commented May 8, 2024 •

edited

copy-pr-bot bot commented May 8, 2024

tgujar commented May 8, 2024 •

edited

PointKernel commented May 8, 2024

PointKernel commented May 14, 2024

PointKernel commented May 14, 2024 •

edited

ttnghia May 14, 2024

tgujar May 16, 2024

PointKernel commented May 16, 2024

PointKernel commented May 16, 2024

PointKernel commented May 17, 2024

PointKernel commented May 21, 2024

davidwendt May 30, 2024

tgujar May 30, 2024

davidwendt commented May 30, 2024

tgujar commented May 30, 2024

Occupancy improvement for Hash table build #15700

Are you sure you want to change the base?

Occupancy improvement for Hash table build #15700

Conversation

tgujar commented May 8, 2024 • edited

Description

Checklist

copy-pr-bot bot commented May 8, 2024

tgujar commented May 8, 2024 • edited

PointKernel commented May 8, 2024

PointKernel commented May 14, 2024

PointKernel commented May 14, 2024 • edited

ttnghia May 14, 2024

Choose a reason for hiding this comment

tgujar May 16, 2024

Choose a reason for hiding this comment

PointKernel commented May 16, 2024

PointKernel commented May 16, 2024

PointKernel commented May 17, 2024

PointKernel commented May 21, 2024

davidwendt May 30, 2024

Choose a reason for hiding this comment

tgujar May 30, 2024

Choose a reason for hiding this comment

davidwendt commented May 30, 2024

tgujar commented May 30, 2024

tgujar commented May 8, 2024 •

edited

tgujar commented May 8, 2024 •

edited

PointKernel commented May 14, 2024 •

edited