Skip to content

Array-like structure for MolBlock IO #7235

Answered by greglandrum
cwognum asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @cwognum, if I were doing this I would have a single column in which I store the V3000 mol block for each molecule.
If you want to enable more efficient processing with the RDKit, you could also have a column with the output of Mol.ToBinary() (that can change from version to version, but we maintain backwards compatibility).

Storing arrays of atoms and bonds is limiting, adds complication, requires you to write your own code to serialize/deserialize the molecules into the array format, and delivers very little additional value - there aren't very many use cases where you'd want to query those features and if you need that, you can always add code that creates those arrays while still s…

Replies: 3 comments 5 replies

Comment options

You must be logged in to vote
2 replies
@greglandrum
Comment options

Answer selected by cwognum
@cwognum
Comment options

Comment options

You must be logged in to vote
1 reply
@cwognum
Comment options

Comment options

You must be logged in to vote
2 replies
@greglandrum
Comment options

@cwognum
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants