Batch reprepare speculatively #531

wprzytula · 2022-08-23T15:08:34Z

Extracted repreparing as a method on Connection to avoid duplication. Also, after one statement in a batch is found to be unprepared, we speculatively prepare all prepared statements in the batch, as there is likelihood that once one of them becomes unprepared, the others do too.
Fixes: #529

Pre-review checklist

I have split my patch into logically separate commits.
All commit messages clearly explain what they change and why.
I added relevant tests for new features and bug fixes.
All commits compile, pass static checks and pass test.
PR description sums up the changes and reasons why they should be introduced.
I added appropriate Fixes: annotations to PR description.

As reprepare logic was duplicated (in prepared statements and in batches), I've extracted it as a separate function.

wprzytula · 2022-08-23T15:33:36Z

I've noticed I perform all prepares sequentially, effectively contradicting the purpose of this PR. Will change it into concurrent reprepare.

wprzytula · 2022-08-23T15:47:21Z

v2: made reprepares concurrent.

Before, each unprepared statement in a batch caused a full roundtrip to Scylla. We instead speculatively prepare them all, as there is strong likelihood that once one of them becomes unprepared, the others do too. This change may significantly reduce latency in such case.

psarna · 2022-08-25T09:02:17Z

On the other hand, this is pessimizing for situations where the number of invalidated statements is orders of magnitude smaller than the number of statements in the batch.

We currently implement a variant of LRU with two pools (ref: scylladb/scylladb@1a9c6d9fd3) and this speculative preparation may interact badly with the algorithm by pushing other prepared statements from the first, probationary buffer, even though lots of them might have already been prepared.

So, in order to judge how we should proceed with this, here's question 1: was this pull request based on an existing issue that somebody (or our test case) discovered, or was it a result of code inspection? If it's not based on a real issue, we at least need a couple of tests to prove that it makes sense, comparing what happens when a minority and majority of batch statements are not prepared, respectively.

psarna · 2022-08-25T09:02:50Z

(btw, the commit which deduplicates repreparing logic is good on its own anyway, so we can definitely apply that one regardless of the result of this discussion)

piodul · 2022-08-25T09:13:05Z

scylla/src/transport/connection.rs

                            _ => None,
+                        })
+                        .map(|prepared_statement| {


It's probable that a single batch will have multiple instances of the same prepared query, only with different arguments - for example imagine 100 inserts to the same table. It would be good to deduplicate them before repreparing.

I assume you mean deduplicating by prepared_id, am I right?

wprzytula · 2022-08-25T10:09:50Z

On the other hand, this is pessimizing for situations where the number of invalidated statements is orders of magnitude smaller than the number of statements in the batch.

We currently implement a variant of LRU with two pools (ref: scylladb/scylladb@1a9c6d9fd3) and this speculative preparation may interact badly with the algorithm by pushing other prepared statements from the first, probationary buffer, even though lots of them might have already been prepared.

So, in order to judge how we should proceed with this, here's question 1: was this pull request based on an existing issue that somebody (or our test case) discovered, or was it a result of code inspection? If it's not based on a real issue, we at least need a couple of tests to prove that it makes sense, comparing what happens when a minority and majority of batch statements are not prepared, respectively.

It was based solely on code inspection.

connection: Extracted repreparing as a method

525fdf1

As reprepare logic was duplicated (in prepared statements and in batches), I've extracted it as a separate function.

wprzytula force-pushed the batch-reprepare-speculatively branch from 6597b1c to 0041016 Compare August 23, 2022 15:47

piodul reviewed Aug 25, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch reprepare speculatively #531

Batch reprepare speculatively #531

wprzytula commented Aug 23, 2022 •

edited

wprzytula commented Aug 23, 2022

wprzytula commented Aug 23, 2022

psarna commented Aug 25, 2022

psarna commented Aug 25, 2022

piodul Aug 25, 2022

wprzytula Aug 29, 2022

piodul Sep 5, 2022

wprzytula commented Aug 25, 2022

Batch reprepare speculatively #531

Are you sure you want to change the base?

Batch reprepare speculatively #531

Conversation

wprzytula commented Aug 23, 2022 • edited

Pre-review checklist

wprzytula commented Aug 23, 2022

wprzytula commented Aug 23, 2022

psarna commented Aug 25, 2022

psarna commented Aug 25, 2022

piodul Aug 25, 2022

Choose a reason for hiding this comment

wprzytula Aug 29, 2022

Choose a reason for hiding this comment

piodul Sep 5, 2022

Choose a reason for hiding this comment

wprzytula commented Aug 25, 2022

wprzytula commented Aug 23, 2022 •

edited