OncoReport is a user-friendly software system for the analysis of DNA-seq data coming either from solid or liquid biopsy samples. The aim of OncoReport to identify the patients' tumor mutations allowing the help in the definition of the most suitable therapy.
The software performs two types of analysis:
- Tumor vs Normal
- Tumor only
The user can choose the human reference genome to use between GRCh37 (hg19) and GRCh38 (hg38).
OncoReport is a cross-platform application backed by a customized docker image to minimize the dependencies. The supported operating systems are:
- Windows Professional
- macOS
- Linux
Internet Connection: internet connection is needed only for the setup process and the download of cosmic database and reference genomes. After the installation has been completed no connection is required.
For WGS analysis:
- Processor 8 core processor
- RAM 64GB
- Hard Drive 1Tb
For WES analysis:
- Processor ?
- RAM 32GB
- Hard Drive 500GB
For custom panel?
- Install Docker Desktop in your computer by downloading the installer at this link Docker Desktop. When prompted, ensure to enable the Hyper-V Windows Features or WSL2 (available only for Windows 10 2004 or later). For Windows 10, WSL2 option is suggested.
- Now that Docker Desktop is running, you can install OncoReport by downloading...
- First check Docker Desktop system requirements.
- Install Docker Desktop in your computer by downloading the installer at this link Docker Desktop for mac and following the instructions.
- Now that Docker Desktop is running, download OncoReport...
- First install Docker in your computer by following the procedure for your linux distribution.
- Add your user account to the docker group. In most distribution you can get your username with the whoami command, and add it to the docker group using the following command:
foo@bar:~$ sudo usermod -aG docker YOUR_USER_NAME
- After adding your user account to docker group, you will need to log out and log in to activate the changes.
- Download the OncoReport package most suited for your distribution from ...
The pipeline takes several inputs:
- fastq
- bam
- ubam
- vcf (it also possible to upload a varianttable sample in this section) OncoReport allows users to easily generate rich reports to support clinical interpretation of variants and their association to prognosis and therapy.
After installing the app, it is possible to start immediately the analysis. First of all, the user has to create a new patient using the "+" in the patients screen. You can see from the image the specific of the patients page.
The information needed to add a new patient are:
- The Patient's Code*
- Patient's First and Last Name*
- Patient's Age*
- Patient's Gender*
- Fiscal Number
- Email*
- Telephone
- City
- Primary Disease* The disease that the user wants to analyze
- Disease type
- Diagnosis Date
- T, the stage of the patient's tumor
- N, the lymph node number
- M, the presence or absence of metastasis
The information indicated with the asterisk are mandatory.
After the patient creation, it is also possible to add information about other diseases of the patient and about the drugs already taken. In this way OncoReport is able to discover possible drug-drug interactions.
For each patient it is possible to add more than one analysis. At the end of each analysis we will have as output the logs, the report and an archive of the raw annotated output.
The user needs to provide analysis sample code, analysis name, the type of analysis, the input type and the number of threads that are gonna be used by the machine. It is possible to upload fastq, bam, ubam or vcf files. The user has also the chance to choose between two human reference genomes: GRCh37 (hg19) and GRCh38 (hg38). The user has also to specify if the samples are paired end.
It is possible to make two types of analysis:
- The Tumor vs Normal analysis It takes two inputs, a tumor sample and a normal sample. This analysis compare the two samples to remove all the germline mutation of the tumor sample. In this way the report will annotate only the somatic mutations. It is necessary to specify the "Depth filter" of the analysis.
- The Tumor only analysis It takes only the tumor sample. This tumor sample can originate both from a solid biopsy or a liquid biopsy. Here is necessary to specify both the "Depth filter" for variant calling and the "Allele Fraction Filter". The latter it is needed to split the somatic mutations from the germline ones since we do not have the normal samples. We suggest to use Less and equal to 0.3 for liquid biopsy and Less and equal to 0.4 for solid biopsy.
OncoReport' goal is the automatic generation of a report describing all the possible annotations (both prognostic and therapeutic) associated to the patients' mutations. To reach this purpose, it employees several databases which will allow to synthetize a precision medicine therapy report for the patient.
When the analysis finish the user can both interactively consult the report from the app or download locally a compressed version of it. Once decompressed the report will be available within a folder and the user will be able to consult it by clicking on the "index" file.
The report is composed by four main sections. An additional section named "other details" provides further information on the mutation annotations.
This section contains two tables providing information on the mutations' clinical impact. The first table reports mutations yielding therapeutic evidence going from Validated association, to clinical evidence and FDA approved drugs. The second table is called other evidences which are not already used in the clinical context. It includes all the mutations that have been marked as "Case study" evidence level, "Preclinical evidence" and so on.
In both tables the user will find the name of the gene and the variants which have been found mutated in the patient and the drugs associated with these specific mutations. The presence of more than one drug per row indicates the usage of such drugs in combination. The evidence type, can be:
- predictive and therefore associated with a drug that can bring a response;
- Diagnostic, this underlines the impact, positive (e.g. higer incidence of the disease) or negative (e.g. lower incidence of the disease), of the mutation on the patient;
- Prognostic, the impact of a variant on the patient's disease;
- Predisposing, the possibility that the variant predisposes the patient to the illness;
- Oncogenic, a somatic variant that it is involved in tumor pathogenesis;
- Functional, if the variant causes or not alteration of the function of the protein produced by the gene. This information is taken from the two databases: CIViC and Cancer Genome Interpreter (CGI). The Clinical Significance changes when the evidence type changes. For our purpose the most interesting are:
- Sensitivity/Response which is associated with the possible response of the patient, having that specific mutation, to the treatment with that drug;
- Resistance, which is associated with the possible resistance of the patient, having that specific mutation, to the treatment with that drug;
- Adverse Response to the drug treatment;
- Reduced Sensitivity when the patients respond to the treatment, but not in the best way possible. Type of mutation which can be somatic or germline. Details, when details about that evidence exists the user can click on the plus and reach to the subsection Clinical Evidence Details where it is possible to find information about the evidence and the study of that evidence.
Trials Clicking on trials the users can reach the "ClinicalTrials.gov" page where they will find all the existing trials for the drug-variant combination. References Clicking on reference the users reach the Reference Section. Here, they will be able to consult the pubmed paper from which the evidence was taken clicking on the PMID.
Score The evidence score is calculated using variant's pathogenic information, the repetition of the drug for the same mutation and the evidence publication year.
AIFA, EMA, FDA Information about the approval of drugs by the AIFA, EMA and FDA agencies. When there are three ticks it means that the drug or the combination of drugs has been approved by all the agency.
When the "x" is found, it means that at least one of the drug has not been approved by that specific agency.
Publication year The year of the evidence publication.
Information, taken from drugbank, about the interactions between the drugs already taken by the patient and the drugs recommended by the system.
Information, taken from drugbank, about the interactions between the drugs.
This section allows to prevent unpleasant side-effects to the patients, understanding the role of a combination of drugs to the organism.
This section provides information about the interaction between drugs and food allowing to be aware about side-effects, as nausea, to the patients under a specific therapy.
These guidelines, commonly consulted by the oncologists, have been added to the report to allow an easy and fast consultation. The guidelines will bring information about the primary disease inserted by the user at the beginning of the analysis.
Evidence taken from PharmGKB database. The user can found information about the gene with the variant indicated with rs code and the drug associated with its clinical significance (toxicity, efficacy). For each evidence also details are provided.
##Mutations Annotations Annotation for each mutation which have been found in the analysis. For each mutation the user will find:
- Gene name
- Chromosome
- Position
- Ref Base
- Var Base
- The change that the mutation cause (e.g. Synonymous, Missense, intron, etc.)
- Clinical Significance by Clinvar (e.g. Benign, Pathogenic, etc.)
- Clinical Significance by Cosmic (e.g. Neutral, Pathogenic, etc.)
- Allele Frequency
- Depth of the mutation in variant calling step
- Genotype: 0/1 Heterozygote, 1/1 Homozygote
- Class: SNP or Indel
- Type of mutation (e.g. Somatic, Germline)
Therapeutic indication concerning a gene-variant couple in disease different from the one studied by the user. These information can be essential when no evidence is found in therapeutic indication.
Information taken from the COSMIC database about the resistance to a drug in specific cancer with that variant mutation.