Skip to content

Extension to HCV-GLUE project, to analyse binding sites for HCV neutralising antibodies.

Notifications You must be signed in to change notification settings

giffordlabcvr/HCV-NABS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HCV-NABS

Description

HCV-NABS (Hepatitis C Virus -- Neutralising AntiBodieS) is a linked dataset and set of analysis scripts for analysing the variation of HCV virus sequences with respect to the known binding locations of neutralising antibodies.

HCV-NABS is an open source analysis resource which accompanies the article Predicting the Effectiveness of Hepatitis C Virus Neutralizing Antibodies by Analysis of Public Sequence Data, Cowton et al. 2018, Manuscript submitted for publication.

HCV-NABS is based on GLUE, an open source, data-centric bioinformatics environment specialised for the analysis of virus sequence data. HCV-GLUE is the GLUE project for analysing HCV sequence data generally, and HCV-NABS is an extension to HCV-GLUE.

Installation

You can install HCV-NABS on computers running Windows, MacOSX or Linux. Note: if you simply want to run the HCV-NABS analysis described in the accompanying article, it is not strictly necessary to install certain of GLUE's runtime dependencies such as BLAST+, MAFFT and RAxML. However you should install these if you are interested in using GLUE for general HCV sequence analysis.

  1. Install GLUE, based on the GLUE installation instructions.
  2. Once GLUE is installed and working, you should download Offline HCV-GLUE and load it in to GLUE. Follow the Offline HCV-GLUE installation instructions. You should select the ncbi_hcv_glue.tar.gz project build.
  3. Clone the HCV-NABS repository into your gluetools/projects directory.
  4. Within the gluetools/projects/HCV-NABS directory, start GLUE and build the HCV-NABS extension by issuing the following command in GLUE:
Mode path: /
GLUE> run file hcvNabsExtension.glue
  1. This should run to completion and produce the OK result.

Usage

There are three types of report which the extension can generate. In each case the report is generated by running a command in project mode (use project hcv to enter this mode).

  1. Characterise the geographical distribution and genotypes/subtypes of public sequence data in the E1 and E2 genome regions To produce the E1 report as an interactive table result run:
Mode path: /project/hcv
GLUE> module hcvNabsCharacterisePublicData invoke-function featureCoverageByCountryAndClade E1

For the E2 report:

Mode path: /project/hcv
GLUE> module hcvNabsCharacterisePublicData invoke-function featureCoverageByCountryAndClade E2
  1. Analyse the frequencies of amino acid residues for individual binding locations, by genotype. The following command overwrites the file tabular/bindingResiduesReport.txt within the HCV-NABS repository:
Mode path: /project/hcv
GLUE> module hcvNabsReports invoke-function bindingResiduesReport
  1. Analyse the frequencies of motifs combining all binding locations for each antibody. The following command overwrites the file tabular/bindingMotifsReport.txt within the HCV-NABS repository:
Mode path: /project/hcv
GLUE> module hcvNabsAnalyseBindingMotifs invoke-function analyse

Contributing

The project could be extended, for example to look at more antibodies as their binding patterns are identified. If you are interested in extending the project in this or any other way, please contact Josh Singer.

Credits

The code was written by Josh Singer based on a specification by Vanessa Cowton. Please cite our research article: Predicting the Effectiveness of Hepatitis C Virus Neutralizing Antibodies by Analysis of Public Sequence Data, Cowton et al. 2018, Manuscript submitted for publication.

License

The project is licensed under the GNU Affero General Public License v. 3.0