With more than 40 peer-reviewed scientific publications, findings from the POG program are influencing precision oncology approaches around the world.
Peer-reviewed publications
POG publications
Single-arm trials are common in precision oncology. Owing to the lack of randomized counterfactual, resultant data are not amenable to comparative outcomes analyses. Difference-in-difference (DID) methods present an opportunity to generate causal estimates of time-varying treatment outcomes. Using DID, our study estimates within-cohort effects of genomics-informed treatment versus standard care on clinical and cost outcomes.
This is the first report of a NACC2-NTRK2 fusion in a histological glioblastoma. Oncogenomic analysis revealed this actionable fusion oncogene in a pediatric cerebellar glioblastoma, which would not have been identified through routine diagnostics, demonstrating the value of clinical genome profiling in cancer care.
Randomized controlled trials (RCTs) are uncommon in precision oncology. We provide an introduction and illustrative example of matching methods for evaluating precision oncology in the absence of RCTs. We focus on British Columbia's Personalized OncoGenomics (POG) program, which applies whole-genome and transcriptome analysis (WGTA) to inform advanced cancer care.
Purpose: Structural variants (SVs) may be an underestimated cause of hereditary cancer syndromes given the current limitations of short-read next-generation sequencing. Here we investigated the utility of long-read sequencing in resolving germline SVs in cancer susceptibility genes detected through short-read genome sequencing.
Methods: Known or suspected deleterious germline SVs were identified using Illumina genome sequencing across a cohort of 669 advanced cancer patients with paired tumor genome and transcriptome sequencing. Candidate SVs were subsequently assessed by Oxford Nanopore long-read sequencing.
Results: Nanopore sequencing confirmed eight simple pathogenic or likely pathogenic SVs, resolving three additional variants whose impact could not be fully elucidated through short-read sequencing. A recurrent sequencing artifact on chromosome 16p13 and one complex rearrangement on chromosome 5q35 were subsequently classified as likely benign, obviating the need for further clinical assessment. Variant configuration was further resolved in one case with a complex pathogenic rearrangement affecting TSC2.
Conclusion: Our findings demonstrate that long-read sequencing can improve the validation, resolution, and classification of germline SVs. This has important implications for return of results, cascade carrier testing, cancer screening, and prophylactic interventions.
Keywords: genome sequencing; hereditary cancer; long-read sequencing; structural variants; variant interpretation.
Read our News Story for this publication.
Purpose: Gene fusions are important oncogenic drivers and many are actionable. Whole-genome and transcriptome (WGS and RNA-seq, respectively) sequencing can discover novel clinically relevant fusions.
Experimental design: Using WGS and RNA-seq, we reviewed the prevalence of fusions in a cohort of 570 patients with cancer, and compared prevalence to that predicted with commercially available panels. Fusions were annotated using a consensus variant calling pipeline (MAVIS) and required that a contig of the breakpoint could be constructed and supported from ≥2 structural variant detection approaches.
Results: In 570 patients with advanced cancer, MAVIS identified 81 recurrent fusions by WGS and 111 by RNA-seq, of which 18 fusions by WGS and 19 by RNA-seq were noted in at least 3 separate patients. The most common fusions were EML4-ALK in thoracic malignancies (9/69, 13%), and CMTM8-CMTM7 in colorectal cancer (4/73, 5.5%). Combined genomic and transcriptomic analysis identified novel fusion partners for clinically relevant genes, such as NTRK2 (novel partners: SHC3, DAPK1), and NTRK3 (novel partners: POLG, PIBF1).
Conclusions: Utilizing WGS/RNA-seq facilitates identification of novel fusions in clinically relevant genes, and detected a greater proportion than commercially available panels are expected to find. A significant benefit of WGS and RNA-seq is the innate ability to retrospectively identify variants that becomes clinically relevant over time, without the need for additional testing, which is not possible with panel-based approaches.
Read our News Story for this publication.
Purpose: Immune checkpoint inhibitors (ICIs) have revolutionised the treatment of solid tumours with dramatic and durable responses seen across multiple tumour types. However, identifying patients who will respond to these drugs remains challenging, particularly in the context of advanced and previously treated cancers.
Experimental design: We characterised fresh tumour biopsies from a heterogeneous pan-cancer cohort of 98 patients with metastatic predominantly pre-treated disease through the Personalized OncoGenomics (POG) program at BC Cancer using whole genome and transcriptome analysis (WGTA). Baseline characteristics and follow up data were collected retrospectively.
Results: We found that tumour mutation burden (TMB), independent of mismatch repair status, was the most predictive marker of time to progression (TTP, p=0.007), but immune related CD8+ T cell and M1-M2 macrophage ratio scores were more predictive for overall survival (OS) (p=0.0014 and 0.0012 respectively). While CD274 (PD-L1) gene expression is comparable to protein levels detected by immunohistochemistry (IHC), we did not observe a clinical benefit for patients with this marker. We demonstrate that a combination of markers based on WGTA provides the best stratification of patients (p=0.00071, OS), and also present a case study of possible acquired resistance to pembrolizumab in a non-small cell lung cancer (NSCLC) patient.
Conclusions: Interpreting the tumour-immune interface to predict ICI efficacy remains challenging. WGTA allows for identification of multiple biomarkers simultaneously that in combination may help to identify responders, particularly in the context of a heterogeneous population of advanced and previously treated cancers, thus precluding tumour type-specific testing.
Read our News Story for this publication.
Introduction: Carcinogenesis is driven by an array of complex genomic patterns; these patterns can render an individual resistant or sensitive to certain chemotherapy agents. The Personalized Oncogenomics (POG) project at BC Cancer has performed integrative genomic analysis of whole tumour genomes and transcriptomes for over 700 patients with advanced cancers, with an aim to predict therapeutic sensitivities. The aim of this study was to utilize the POG genomic data to evaluate a discrete set of biomarkers associated with chemo-sensitivity or-resistance in advanced stage breast and colorectal cancer POG patients.
Methods: This was a retrospective multi-centre analysis across all BC CANCER sites. All breast and colorectal cancer patients enrolled in the POG program between July 1, 2012 and November 30, 2016 were eligible for inclusion. Within the breast cancer population, those treated with capecitabine, paclitaxel, and everolimus were analyzed, and for the colorectal cancer patients, those treated with capecitabine, bevacizumab, irinotecan, and oxaliplatin were analyzed. The expression levels of the selected biomarkers of interest (EPHB4, FIGF, CD133, DICER1, DPYD, TYMP, TYMS, TAP1, TOP1, CKDN1A, ERCC1, GSTP1, BRCA1, PTEN, ABCB1, TLE3, and TXNDC17) were reported as mRNA percentiles.
Results: For the breast cancer population, there were 32 patients in the capecitabine cohort, 15 in the everolimus cohort, and 12 in the paclitaxel cohort. For the colorectal cancer population, there were 29 patients in the bevacizumab cohort, 12 in the oxaliplatin cohort, 29 in the irinotecan cohort, and 6 in the capecitabine cohort. Of the biomarkers evaluated, the strongest associations were found between Bevacizumab-based therapy and DICER1 (P = 0.0445); and between capecitabine therapy and TYMP (P = 0.0553).
Conclusions: Among breast cancer patients, higher TYMP expression was associated with sensitivity to capecitabine. Among colorectal cancer patients, higher DICER1 expression was associated with sensitivity to bevacizumab-based therapy. This study supports further assessment of the potential predictive value of mRNA expression of these genomic biomarkers.
Read our News Story for this publication.
Inherited genetic variation has important implications for cancer screening, early diagnosis, and disease prognosis. A role for germline variation has also been described in shaping the molecular landscape, immune response, microenvironment, and treatment response of individual tumors. However, there is a lack of consensus on the handling and analysis of germline information that extends beyond known or suspected cancer susceptibility in large-scale cancer genomics initiatives. As part of the Personalized OncoGenomics program in British Columbia, we performed whole-genome and transcriptome sequencing in paired tumor and normal tissues from advanced cancer patients to characterize the molecular tumor landscape and identify putative targets for therapy. Overall, our experience supports a multidisciplinary and integrative approach to germline data management. This includes a need for broader definitions and standardized recommendations regarding primary and secondary germline findings in precision oncology. Here, we propose a framework for identifying, evaluating, and returning germline variants of potential clinical significance that may have indications for health management beyond cancer risk reduction or prevention in patients and their families.
Read our News Story for this publication.
Introduction Given the high level of uncertainty surrounding the outcomes of early phase clinical trials, whole genome and transcriptome analysis (WGTA) can be used to optimize patient selection and study assignment. In this retrospective analysis, we reviewed the impact of this approach on one such program. Methods Patients with advanced malignancies underwent fresh tumor biopsies as part of our personalized medicine program (NCT02155621). Tumour molecular data were reviewed for potentially clinically actionable findings and patients were referred to the developmental therapeutics program. Outcomes were reviewed in all patients, including those where trial selection was driven by molecular data (matched) and those where there was no clear molecular rationale (unmatched). Results From January 2014 to January 2018, 28 patients underwent WGTA and enrolled in clinical trials, including 2 patients enrolled in two trials. Fifteen patients were matched to a treatment based on a molecular target. Five patients were matched to a trial based upon single-gene DNA changes, all supported by RNA data. Ten cases were matched on the basis of genome-wide data (n = 4) or RNA gene expression only (n = 6). With a median follow-up of 6.7 months, the median time on treatment was 8.2 weeks. Discussion When compared to single-gene DNA-based data alone, WGTA led to a 3-fold increase in treatment matching. In a setting where there is a high level of uncertainty around both the investigational agents and the biomarkers, more data are needed to fully evaluate the impact of routine use of WGTA.
Advanced and metastatic tumors with complex treatment histories drive cancer mortality. Here we describe the POG570 cohort, a comprehensive whole-genome, transcriptome and clinical dataset, amenable for exploration of the impacts of therapies on genomic landscapes. Previous exposure to DNA-damaging chemotherapies and mutations affecting DNA repair genes, including POLQ and genes encoding Polζ, were associated with genome-wide, therapy-induced mutagenesis. Exposure to platinum therapies coincided with signatures SBS31 and DSB5 and, when combined with DNA synthesis inhibitors, signature SBS17b. Alterations in ESR1, EGFR, CTNNB1, FGFR1, VEGFA and DPYD were consistent with drug resistance and sensitivity. Recurrent noncoding events were found in regulatory region hotspots of genes including TERT, PLEKHS1, AP2A1 and ADGRG6. Mutation burden and immune signatures corresponded with overall survival and response to immunotherapy. Our data offer a rich resource for investigation of advanced cancers and interpretation of whole-genome and transcriptome sequencing in the context of a cancer clinic.
Read our News Story for this publication.
Read the accepted publication at Nature Cancer.
Download author manuscript and figures.
Download supplementary figures, supplementary tables and supplementary methods.
Further data available at https://www.bcgsc.ca/downloads/POG570/
POG-associated publications
Next generation RNA-sequencing (RNA-seq) is a flexible approach that can be applied to a range of applications including global quantification of transcript expression, the characterization of RNA structure such as splicing patterns and profiling of expressed mutations. Many RNA-seq protocols require up to microgram levels of total RNA input amounts to generate high quality data, and thus remain impractical for the limited starting material amounts typically obtained from rare cell populations, such as those from early developmental stages or from laser micro-dissected clinical samples. Here, we present an assessment of the contemporary ribosomal RNA depletion-based protocols, and identify those that are suitable for inputs as low as 1-10 ng of intact total RNA and 100-500 ng of partially degraded RNA from formalin-fixed paraffin-embedded tissues.
Tissues used in pathology laboratories are typically stored in the form of formalin-fixed, paraffin-embedded (FFPE) samples. One important consideration in repurposing FFPE material for next generation sequencing (NGS) analysis is the sequencing artifacts that can arise from the significant damage to nucleic acids due to treatment with formalin, storage at room temperature and extraction. One such class of artifacts consists of chimeric reads that appear to be derived from non-contiguous portions of the genome. Here, we show that a major proportion of such chimeric reads align to both the 'Watson' and 'Crick' strands of the reference genome. We refer to these as strand-split artifact reads (SSARs). This study provides a conceptual framework for the mechanistic basis of the genesis of SSARs and other chimeric artifacts along with supporting experimental evidence, which have led to approaches to reduce the levels of such artifacts. We demonstrate that one of these approaches, involving S1 nuclease-mediated removal of single-stranded fragments and overhangs, also reduces sequence bias, base error rates, and false positive detection of copy number and single nucleotide variants. Finally, we describe an analytical approach for quantifying SSARs from NGS data.
Curation and storage of formalin-fixed, paraffin-embedded (FFPE) samples are standard procedures in hospital pathology laboratories around the world. Many thousands of such samples exist and could be used for next generation sequencing analysis. Retrospective analyses of such samples are important for identifying molecular correlates of carcinogenesis, treatment history and disease outcomes. Two major hurdles in using FFPE material for sequencing are the damaged nature of the nucleic acids and the labor-intensive nature of nucleic acid purification. These limitations and a number of other issues that span multiple steps from nucleic acid purification to library construction are addressed here. We optimized and automated a 96-well magnetic bead-based extraction protocol that can be scaled to large cohorts and is compatible with automation. Using sets of 32 and 91 individual FFPE samples respectively, we generated libraries from 100 ng of total RNA and DNA starting amounts with 95-100% success rate. The use of the resulting RNA in micro-RNA sequencing was also demonstrated. In addition to offering the potential of scalability and rapid throughput, the yield obtained with lower input requirements makes these methods applicable to clinical samples where tissue abundance is limiting.
Pancreatic adenocarcinoma presents as a spectrum of a highly aggressive disease in patients. The basis of this disease heterogeneity has proved difficult to resolve due to poor tumor cellularity and extensive genomic instability. To address this, a dataset of whole genomes and transcriptomes was generated from purified epithelium of primary and metastatic tumors. Transcriptome analysis demonstrated that molecular subtypes are a product of a gene expression continuum driven by a mixture of intratumoral subpopulations, which was confirmed by single-cell analysis. Integrated whole-genome analysis uncovered that molecular subtypes are linked to specific copy number aberrations in genes such as mutant KRAS and GATA6. By mapping tumor genetic histories, tetraploidization emerged as a key mutational process behind these events. Taken together, these data support the premise that the constellation of genomic aberrations in the tumor gives rise to the molecular subtype, and that disease heterogeneity is due to ongoing genomic instability during progression.