Preprocessing of amino acid sequences #3784
Unanswered
TrikiSadok
asked this question in
Q&A
Replies: 1 comment 3 replies
-
It does! You can convert a sequence into an RDKit molecule as follows:
And then all the standard RDKit descriptors can be used:
|
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am not that familiar with RDKit.
Does RDKIt allow transforming amino acid sequences into numerical features for machine learning purposes?
example of amino acid sequence I am trying to transform into numerical descriptor that is included in the data set I have:
"
MIGIDIVSIARVEKCVKRFEMRFLERFLSPSEIVLCKDKSSSIAGFFALKEACSKALQVGIGKELSFLDMRISKSPKNAPLITLSKEKMDYFNIQSLSASISHDAGFAIAVVMISP
"
Thanks in advance for any help.
Beta Was this translation helpful? Give feedback.
All reactions