Replies: 11 comments 25 replies
-
Smiles has no knowledge of extended stereo groups, CXSMiles, however, does:
results in:
|
Beta Was this translation helpful? Give feedback.
-
This is the ChemAxon extended smiles (btw): https://docs.chemaxon.com/display/docs/chemaxon-extended-smiles-and-smarts-cxsmiles-and-cxsmarts.md |
Beta Was this translation helpful? Give feedback.
-
Right. I understand that SMILES doesn't understand extended stereo groups. My question is regarding how much "canonicalization" I should be expecting that RDKit does before/during |
Beta Was this translation helpful? Give feedback.
-
It really does sound like CXSmiles is what you want here, it does canonicalize the "&"s.
Results in
|
Beta Was this translation helpful? Give feedback.
-
Note, you might want to remove coordinates when outputting cxsmiles for canonicalization purposes. |
Beta Was this translation helpful? Give feedback.
-
Do you think there is a way to get a "spirit of the law" way to say these 3 molecules represent the same mixture? One option could be comparing the sets of SMILES strings that the enumerator produces. |
Beta Was this translation helpful? Give feedback.
-
I think that currently, generating the isomers is probably the best way, we do have. Registration Hash but it currently has different results for 0 and (1,2)
Currently we can't add 'missing' & stereo groups. That might be a nice feature, actually. |
Beta Was this translation helpful? Give feedback.
-
This works for singleton
|
Beta Was this translation helpful? Give feedback.
-
However, to answer your question, SMILES was never intended to solve this particular problem that extended stereo chemistry solves, so it should not consider it in canonicalization. While inchi does normalize some tautomers, it also has the same issue regarding extended stereochemistry. |
Beta Was this translation helpful? Give feedback.
-
Loosely I consider the smiles notation to indicate what a molecule IS and extended stereo chemistry to indicate what a molecule could be. |
Beta Was this translation helpful? Give feedback.
-
I'm trying to understand which string representation of a molecule throws or keeps information in what way. @coleb suggested I contribute the following two functions: |
Beta Was this translation helpful? Give feedback.
-
I'm asking about 1-atom
STERAC
groups, like the following:Chem.MolToSmiles
for the latter two shows a stereocenter.Chem.MolToInchi
also shows stereoblocks. Is this expected/desired?Beta Was this translation helpful? Give feedback.
All reactions