Extracting traces in parallel to speed up get_noise_levels (or any other traces related functions) #2382

yger · 2024-01-02T21:49:25Z

We can extent the pipeline machinery as explained in #2380 in order to get data chunks in parallel. The list of chunks passed to the ChunkRecordingExecutor can be customized accordingly, to avoid looping over unnecessary chunks

for more information, see https://pre-commit.ci

yger · 2024-01-03T07:54:33Z

Is it normal than the behavior of get_random_data_chunks, in main, can allow overlapping chunks? This seems weird to me, and thus the tests are not passing because of some behavior I would not expect. I mean, intuitively when I wan to select N random chunks in a small recording, I expect them to be non-overlapping (to avoid biasing the stats) @alejoe91 @samuelgarcia . But currently, this is not the case....

for more information, see https://pre-commit.ci

yger · 2024-01-03T08:04:40Z

Note that I'm well aware that the parallelism is adding an rather large overhead, thus such a process can be useful when I/O are slow, for example when getting data from a remote location. Need to be discuss

alejoe91 · 2024-01-03T08:54:52Z

Is it normal than the behavior of get_random_data_chunks, in main, can allow overlapping chunks? This seems weird to me, and thus the tests are not passing because of some behavior I would not expect. I mean, intuitively when I wan to select N random chunks in a small recording, I expect them to be non-overlapping (to avoid biasing the stats) @alejoe91 @samuelgarcia . But currently, this is not the case....

I think that when possible the chunks should be non overlapping indeed

src/spikeinterface/core/recording_tools.py

for more information, see https://pre-commit.ci

src/spikeinterface/core/recording_tools.py

for more information, see https://pre-commit.ci

samuelgarcia · 2024-01-09T09:09:01Z

Salut Pierre.
I am OK with the idea but I am not sure to like the implementation.
The new run_traces_pipeline is more or less a ChunkRecordingExecutor that return traces.
The is not a super good idea because traces of pickle a transimited to main process which make lot a memory bandwith.
I will try to make another implementation with sharemem and without the pipeline mechanism.
The pipeline module is for peak or spikes adding functionnality for random chunk getter make the module more fuzy. no ?

yger · 2024-01-09T09:48:20Z

I am open to any suggestion. I just wanted to highlight the potential speedup there, especially for slow/remote I/O. Thanks a lot for having a look into that !

yger and others added 3 commits January 2, 2024 22:46

Proposal

2a41a2a

[pre-commit.ci] auto fixes from pre-commit.com hooks

f87dfe4

for more information, see https://pre-commit.ci

Adding return_scaled option

6cbb6a7

yger and others added 2 commits January 3, 2024 08:56

WIP

12b8a50

[pre-commit.ci] auto fixes from pre-commit.com hooks

f67e58e

for more information, see https://pre-commit.ci

yger added 2 commits January 3, 2024 09:11

Merge branch 'main' into parallel_noise_levels

ab9ab12

WIP

f6a0cc9

alejoe91 reviewed Jan 3, 2024

View reviewed changes

src/spikeinterface/core/recording_tools.py Outdated Show resolved Hide resolved

yger and others added 9 commits January 3, 2024 12:32

job_kwargs

8257f30

[pre-commit.ci] auto fixes from pre-commit.com hooks

030ee57

for more information, see https://pre-commit.ci

Former implementation for backward compatibility and tests

416356d

[pre-commit.ci] auto fixes from pre-commit.com hooks

c7521a5

for more information, see https://pre-commit.ci

Concatenated mode also working

a148a75

[pre-commit.ci] auto fixes from pre-commit.com hooks

1d18179

for more information, see https://pre-commit.ci

Nicer syntax

b5d91ec

Nicer syntax

1ec92eb

[pre-commit.ci] auto fixes from pre-commit.com hooks

160c439

for more information, see https://pre-commit.ci

yger marked this pull request as ready for review January 3, 2024 14:33

Merge branch 'main' into parallel_noise_levels

985c7b9

alejoe91 reviewed Jan 3, 2024

View reviewed changes

src/spikeinterface/core/recording_tools.py Show resolved Hide resolved

yger and others added 4 commits January 3, 2024 15:51

Docs

921fae0

[pre-commit.ci] auto fixes from pre-commit.com hooks

50e5a96

for more information, see https://pre-commit.ci

Docsé

b898740

[pre-commit.ci] auto fixes from pre-commit.com hooks

5972fb5

for more information, see https://pre-commit.ci

yger added enhancement New feature or request core Changes to core module labels Jan 4, 2024

yger added 13 commits January 10, 2024 14:53

Merge branch 'SpikeInterface:main' into parallel_noise_levels

be6a64a

Merge branch 'SpikeInterface:main' into parallel_noise_levels

45ca6d6

Merge branch 'main' into parallel_noise_levels

0de23dd

Merge branch 'SpikeInterface:main' into parallel_noise_levels

a760764

Merge branch 'SpikeInterface:main' into parallel_noise_levels

73cc05e

Merge branch 'SpikeInterface:main' into parallel_noise_levels

facd426

Merge branch 'SpikeInterface:main' into parallel_noise_levels

2b5d31e

Merge branch 'SpikeInterface:main' into parallel_noise_levels

68986d4

Merge branch 'SpikeInterface:main' into parallel_noise_levels

a6a877d

Merge branch 'SpikeInterface:main' into parallel_noise_levels

c86db57

Merge branch 'SpikeInterface:main' into parallel_noise_levels

0bccec8

Merge branch 'SpikeInterface:main' into parallel_noise_levels

ca07895

Merge branch 'SpikeInterface:main' into parallel_noise_levels

7b8f11c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extracting traces in parallel to speed up get_noise_levels (or any other traces related functions) #2382

Extracting traces in parallel to speed up get_noise_levels (or any other traces related functions) #2382

yger commented Jan 2, 2024

yger commented Jan 3, 2024

yger commented Jan 3, 2024

alejoe91 commented Jan 3, 2024

samuelgarcia commented Jan 9, 2024

yger commented Jan 9, 2024

Extracting traces in parallel to speed up get_noise_levels (or any other traces related functions) #2382

Are you sure you want to change the base?

Extracting traces in parallel to speed up get_noise_levels (or any other traces related functions) #2382

Conversation

yger commented Jan 2, 2024

yger commented Jan 3, 2024

yger commented Jan 3, 2024

alejoe91 commented Jan 3, 2024

samuelgarcia commented Jan 9, 2024

yger commented Jan 9, 2024