American Association for Cancer Research
Browse
00085472can141020-sup-129482_2_supp_2611607_nyyq1x.txt (21.22 kB)

Supplementary Script A from Discrepancies in Cancer Genomic Sequencing Highlight Opportunities for Driver Mutation Discovery

Download (21.22 kB)
dataset
posted on 2023-03-30, 22:47 authored by Andrew M. Hudson, Tim Yates, Yaoyong Li, Eleanor W. Trotter, Shameem Fawdar, Phil Chapman, Paul Lorigan, Andrew Biankin, Crispin J. Miller, John Brognard

Groovy program script used to make the 18 cell line comparison

History

ARTICLE ABSTRACT

Cancer genome sequencing is being used at an increasing rate to identify actionable driver mutations that can inform therapeutic intervention strategies. A comparison of two of the most prominent cancer genome sequencing databases from different institutes (Cancer Cell Line Encyclopedia and Catalogue of Somatic Mutations in Cancer) revealed marked discrepancies in the detection of missense mutations in identical cell lines (57.38% conformity). The main reason for this discrepancy is inadequate sequencing of GC-rich areas of the exome. We have therefore mapped over 400 regions of consistent inadequate sequencing (cold-spots) in known cancer-causing genes and kinases, in 368 of which neither institute finds mutations. We demonstrate, using a newly identified PAK4 mutation as proof of principle, that specific targeting and sequencing of these GC-rich cold-spot regions can lead to the identification of novel driver mutations in known tumor suppressors and oncogenes. We highlight that cross-referencing between genomic databases is required to comprehensively assess genomic alterations in commonly used cell lines and that there are still significant opportunities to identify novel drivers of tumorigenesis in poorly sequenced areas of the exome. Finally, we assess other reasons for the observed discrepancy, such as variations in dbSNP filtering and the acquisition/loss of mutations, to give explanations as to why there is a discrepancy in pharmacogenomic studies, given recent concerns with poor reproducibility of data. Cancer Res; 74(22); 6390–6. ©2014 AACR.

Usage metrics

    Cancer Research

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC