`asap-docking` | Correctly save multipose docking results #1009

apayne97 · 2024-04-24T15:48:02Z

Description

Updates the POSIT docker to use the updates from #692 to handle writing multipose posit results.
Fixes #988

Todos

Notable points that this PR has either accomplished or will accomplish.

add tests and functionality for creating a new ligand object from multiple separate ligand objects with the same chemical identity
update the POSITDocker to use this and pass all the results to a single results obj
add pose id to DockingResultsCols
add method to split POSITDockingResults into single pose results

Status

Ready to go

Developers certificate of origin

I certify that this contribution is covered by the MIT license as defined in our LICENSE and adheres to the Developer Certificate of Origin.

hmacdope

Looks good, see comments.

hmacdope · 2024-04-29T02:05:39Z

asapdiscovery-docking/asapdiscovery/docking/openeye.py

                            provenance=self.provenance(),
+                            probability=(
+                                posed_ligands[0].tags[


Can't this vary by ligand? Or are they all going to be homogenous?

(sorry for the delay) what are you referring to by "this"? the poses are ordered by the POSIT score such that the first pose is the one with the best POSIT probability.

hmacdope · 2024-04-29T02:07:27Z

asapdiscovery-docking/asapdiscovery/docking/openeye.py

+                            input_pairs.append(set)
+
+                    # create hashable dict of input pairs
+                    input_pair_dict = {
+                        input_pair.unique_name: input_pair for input_pair in input_pairs
+                    }
+
+                    # split results by input pair
+                    from collections import defaultdict

+                    results_dict = defaultdict(list)
+                    for input_pair, posed_ligand in zip(input_pairs, posed_ligands):
+                        results_dict[input_pair.unique_name].append(posed_ligand)
+
+                    # return results split by input pair
+                    for input_pair_name, posed_ligands in results_dict.items():
                        docking_result = POSITDockingResults(
-                            input_pair=input_pair,
-                            posed_ligand=posed_ligand,
-                            probability=prob,
+                            input_pair=input_pair_dict[input_pair_name],
+                            posed_ligand=Ligand.from_single_conformers(posed_ligands),
                            provenance=self.provenance(),
+                            probability=(
+                                posed_ligands[0].tags[
+                                    DockingResultCols.DOCKING_CONFIDENCE_POSIT.value
+                                ]
+                                if len(posed_ligands) == 1
+                                else None
+                            ),
+                            pose_id=(
+                                posed_ligands[0].tags["Pose_ID"]
+                                if len(posed_ligands) == 1
+                                else None
+                            ),


Can we move this more complicated logic into the branch above to avoid performance penalty when not using a MultiStructure input and also to make the non-Multi case clearer?

I think I've fixed this sufficiently?

for more information, see https://pre-commit.ci

apayne97 and others added 11 commits April 24, 2024 08:37

add some multipose results ideas

52c2eef

add a ligand schema test, refactor the Ligand.set_SD_data method,

f70e2d6

add from_single_conformers method to Ligand object

3e89157

add working multipose results saving

ea3bafc

add better handling of potentially different results

39b4c61

remove unnecessary multipose docking results class

240d6f5

add pose_id to scoring df outputs

6d40e42

add pose_id to DockingResults object

e273f22

add results splitter code to cross_docking.py

878d056

fix bug where I was adding probability and pose_id as lists

3e6cde3

Merge remote-tracking branch 'upstream/main' into save-multipose-results

c61937d

hmacdope requested changes Apr 29, 2024

View reviewed changes

apayne97 and others added 5 commits May 24, 2024 14:20

Merge branch 'main' into save-multipose-results

7827d21

[pre-commit.ci] auto fixes from pre-commit.com hooks

10fcf9e

for more information, see https://pre-commit.ci

fix merge clash errors

e271afe

delineate multipose docking code path in openeye more clearly

a69a894

fix how results are appended to the initial list of loaded results

741f009

apayne97 requested a review from hmacdope May 27, 2024 13:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`asap-docking` | Correctly save multipose docking results #1009

`asap-docking` | Correctly save multipose docking results #1009

apayne97 commented Apr 24, 2024 •

edited

hmacdope left a comment

hmacdope Apr 29, 2024

apayne97 May 24, 2024 •

edited

hmacdope Apr 29, 2024

apayne97 May 27, 2024

asap-docking | Correctly save multipose docking results #1009

Are you sure you want to change the base?

asap-docking | Correctly save multipose docking results #1009

Conversation

apayne97 commented Apr 24, 2024 • edited

Description

Todos

Status

Developers certificate of origin

hmacdope left a comment

Choose a reason for hiding this comment

hmacdope Apr 29, 2024

Choose a reason for hiding this comment

apayne97 May 24, 2024 • edited

Choose a reason for hiding this comment

hmacdope Apr 29, 2024

Choose a reason for hiding this comment

apayne97 May 27, 2024

Choose a reason for hiding this comment

`asap-docking` | Correctly save multipose docking results #1009

`asap-docking` | Correctly save multipose docking results #1009

apayne97 commented Apr 24, 2024 •

edited

apayne97 May 24, 2024 •

edited