Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

suggestions #11

Open
DenizBartsch opened this issue Apr 22, 2020 · 2 comments
Open

suggestions #11

DenizBartsch opened this issue Apr 22, 2020 · 2 comments

Comments

@DenizBartsch
Copy link

I really love this tool and the output it gives, even for users like me.

From the RBP researcher point of view I have small suggestions:

  • A normalization of overlaps like it is done in homer ( I think it is normalized to the total length of the feature)

  • motif and GO term detection based on features, so for example you can search for motifs only in 3'UTRs and introns and annotate only those GO terms related to this feature

  • maybe as a little nice add-on to motif discovery you could implement secondary structure prediction tools

However, this is a really nice tool and I hope my suggestions are useful!

Best,

Deniz

@borauyar
Copy link
Member

Hi Deniz. Thank you for the nice suggestions. I will add the 'normalisation' and 'secondary structure prediction' to my to-do list.

Your third request is already possible with the current RCAS.

For feature-specific motif analysis, see RCAS::discoverFeatureSpecificMotifs function's example section. Just type ?RCAS::discoverFeatureSpecificMotifs in your RStudio command-line and it will show you the example usage there.

Feature-specific GO term analysis is also possible. First you need to find the list of genes/transcripts you need based on which feature overlaps you need and pass that list to the GO term analysis functions.

You can get the list of genes like this:

dt <- getTargetedGenesTable(queryRegions, txdbFeatures)

Then you can subset this table for the features you are interested in:
Let's say we want to find genes targeted at 3'UTRs (overlapping with peaks at 3'UTRs) but don't overlap any promoters:

tx <- dt[promoters == 0][threeUTRs > 0]$tx_name

and pass this list of transcript ids to RCAS::findEnrichedFunctions, which runs gprofiler2 at the backend.

RCAS::findEnrichedFunctions(targetGenes = tx, species = 'hsapiens')

you could also copy this list and paste it over to functional enrichment webservices such as gprofiler or enrichr

@DenizBartsch
Copy link
Author

DenizBartsch commented May 12, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants