New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Terminology: n_jobs vs num_workers vs ncpu etc. #4876
Comments
I hate to say it, but there is a third contender: SciPy tends to use just |
I think I actually like just |
Right, So, who is for |
Jobbers ? :) |
To be more serious, About n_* and num_*, we have both in the lib (n_dim, n_bins, n_inliers, n_tiles) and (num_trials, num_shapes, num_peaks, num_channels, ). A preference must be added to #2616 |
I'd vote to maintain the consistency throughout the packages: maybe |
To be consistent with the ecosystem, this requires some careful deliberation. @thomasjpfan wrote up a good overview of the state of the ecosystem. I see he recommends
|
Thanks for the link. To quote from that page
This argument works just as well in favor of |
How can we move this along with regards to #7302? I'm happy to involve the ecosystem but how do I do that? According to New SPEC Proposals this might be a good fit for a SPEC. I'll make a post in https://discuss.scientific-python.org/c/specs/ideas/9 if there's no objection. |
This does seem like exactly the kind of thing we need to agree on across projects, so +1. |
Posted this as a SPEC idea in Terminology for parameters controlling parallel computation. |
Since we aim at accelerating some of our functions, one way to do this is to use parallel computing. We already have one function (
restoration.cycle_spin
) which is usingdask.delayed
and several threads, there is also the attempt to parallelizesegmentation.slic
in #3120 which I plan to revive, and probably other functions will have in the future the possibility to use one or more jobs/workers (threads or processes). Therefore, I'm opening this issue so that we can decide what is the best terminology for this parameter. Should it benum_workers
(as incycle_spin
, and as indask
)n_jobs
(as inscikit-learn
and injoblib
)I think both are fine, the choice should be more about consistency with the rest of the ecosystem: do we want to be more consistent with our backend, or with big brother scikit?
The text was updated successfully, but these errors were encountered: