add metric for search split affinity #4998

trinity-1686a · 2024-05-17T13:42:58Z

Description

How was this PR tested?

verified the metric exists by running a small cluster

quickwit/quickwit-search/src/search_job_placer.rs

quickwit/quickwit-search/src/metrics.rs

fulmicoton · 2024-05-20T09:20:46Z

I am surprised by the placement code. I thought Francois had updated it for something smarter a long time ago. I might be mistaking it for something else.

Currently the code looks like that.

 pub async fn assign_jobs<J: Job>(
        &self,
        mut jobs: Vec<J>,
        excluded_addrs: &HashSet<SocketAddr>,
    ) -> anyhow::Result<impl Iterator<Item = (SearchServiceClient, Vec<J>)>> {
        let num_nodes = self.searcher_pool.len();

        let mut candidate_nodes: Vec<CandidateNodes> = self
            .searcher_pool
            .pairs()
            .into_iter()
            .filter(|(grpc_addr, _)| {
                excluded_addrs.is_empty()
                    || excluded_addrs.len() == num_nodes
                    || !excluded_addrs.contains(grpc_addr)
            })
            .map(|(grpc_addr, client)| CandidateNodes {
                grpc_addr,
                client,
                load: 0,
            })
            .collect();

        if candidate_nodes.is_empty() {
            bail!(
                "failed to assign search jobs. there are no available searcher nodes in the pool"
            );
        }
        jobs.sort_unstable_by(Job::compare_cost);

        let mut job_assignments: HashMap<SocketAddr, (SearchServiceClient, Vec<J>)> =
            HashMap::with_capacity(num_nodes);

        for job in jobs {
            sort_by_rendez_vous_hash(&mut candidate_nodes, job.split_id());
            // Select the least loaded node.
            let chosen_node_idx = if candidate_nodes.len() >= 2 {
                usize::from(candidate_nodes[0].load > candidate_nodes[1].load)
            } else {
                0
            };
            let chosen_node = &mut candidate_nodes[chosen_node_idx];
            chosen_node.load += job.cost();

            job_assignments
                .entry(chosen_node.grpc_addr)
                .or_insert_with(|| (chosen_node.client.clone(), Vec::new()))
                .1
                .push(job);
        }
        Ok(job_assignments.into_values())
    }

Francois's algorithm was computing the perfect target load.
We would then allocate the node with the best affinity as long as we don't have exceeded the average.
We could even add a small margin.

@fmassot was it used in a different part of the code or has it never been merged?

and remove comment about job assignment. it was made to create a discussion, and the discussion now exists

quickwit/quickwit-search/src/metrics.rs

add metric for search split affinity

f1fd431

trinity-1686a requested a review from fulmicoton May 17, 2024 13:42

trinity-1686a commented May 17, 2024

View reviewed changes

quickwit/quickwit-search/src/search_job_placer.rs Outdated Show resolved Hide resolved

fulmicoton reviewed May 20, 2024

View reviewed changes

quickwit/quickwit-search/src/metrics.rs Outdated Show resolved Hide resolved

trinity-1686a added 2 commits May 27, 2024 15:10

rename metric

700fcfe

and remove comment about job assignment. it was made to create a discussion, and the discussion now exists

Merge branch 'main' into trinity/metric-split-affinity-ratio

6e56a9e

fulmicoton reviewed May 28, 2024

View reviewed changes

quickwit/quickwit-search/src/metrics.rs Outdated Show resolved Hide resolved

trinity-1686a added 2 commits May 29, 2024 19:01

use single metric with label

38f2a58

Merge branch 'main' into trinity/metric-split-affinity-ratio

58dad73

trinity-1686a requested a review from fulmicoton May 29, 2024 17:32

trinity-1686a mentioned this pull request May 30, 2024

improve placing algorithm #5051

Open

fulmicoton approved these changes Jun 3, 2024

View reviewed changes

Merge branch 'main' into trinity/metric-split-affinity-ratio

e2ef6b5

trinity-1686a enabled auto-merge (squash) June 3, 2024 07:43

trinity-1686a merged commit fc7638b into main Jun 3, 2024
4 of 5 checks passed

trinity-1686a deleted the trinity/metric-split-affinity-ratio branch June 3, 2024 07:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add metric for search split affinity #4998

add metric for search split affinity #4998

trinity-1686a commented May 17, 2024

fulmicoton commented May 20, 2024 •

edited

add metric for search split affinity #4998

add metric for search split affinity #4998

Conversation

trinity-1686a commented May 17, 2024

Description

How was this PR tested?

fulmicoton commented May 20, 2024 • edited

fulmicoton commented May 20, 2024 •

edited