• Something wrong with this record ?

Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing

W. Xiao, L. Ren, Z. Chen, LT. Fang, Y. Zhao, J. Lack, M. Guan, B. Zhu, E. Jaeger, L. Kerrigan, TM. Blomquist, T. Hung, M. Sultan, K. Idler, C. Lu, A. Scherer, R. Kusko, M. Moos, C. Xiao, ST. Sherry, OD. Abaan, W. Chen, X. Chen, J. Nordlund, U....

. 2021 ; 39 (9) : 1141-1150. [pub] 20210909

Language English Country United States

Document type Journal Article, Research Support, N.I.H., Extramural, Research Support, Non-U.S. Gov't

Grant support
S10OD019960 U.S. Department of Health & Human Services | National Institutes of Health (NIH)
HHSN261201400008C NCI NIH HHS - United States
HHSN261201500003I NCI NIH HHS - United States
75N910D00024 NIH HHS - United States

Clinical applications of precision oncology require accurate tests that can distinguish true cancer-specific mutations from errors introduced at each step of next-generation sequencing (NGS). To date, no bulk sequencing study has addressed the effects of cross-site reproducibility, nor the biological, technical and computational factors that influence variant identification. Here we report a systematic interrogation of somatic mutations in paired tumor-normal cell lines to identify factors affecting detection reproducibility and accuracy at six different centers. Using whole-genome sequencing (WGS) and whole-exome sequencing (WES), we evaluated the reproducibility of different sample types with varying input amount and tumor purity, and multiple library construction protocols, followed by processing with nine bioinformatics pipelines. We found that read coverage and callers affected both WGS and WES reproducibility, but WES performance was influenced by insert fragment size, genomic copy content and the global imbalance score (GIV; G > T/C > A). Finally, taking into account library preparation protocol, tumor content, read coverage and bioinformatics processes concomitantly, we recommend actionable practices to improve the reproducibility and accuracy of NGS experiments for cancer mutation detection.

Advanced Biomedical and Computational Sciences Biomedical Informatics and Data Science Directorate Frederick National Laboratory for Cancer Research Frederick MD USA

AstraZeneca Gaithersburg MD USA

ATCC Manassas VA USA

Bioinformatics and Computational Biology Core National Heart Lung and Blood Institute National Institutes of Health Bethesda MD USA

Bioinformatics Research and Early Development Roche Sequencing Solutions Inc Belmont CA USA

Biomarker Development Novartis Institutes for Biomedical Research Basel Switzerland

CCR Collaborative Bioinformatics Resource Office of Science and Technology Resources Center for Cancer Research Bethesda MD USA

Center for Genomics Loma Linda University School of Medicine Loma Linda CA USA

Center for Information Technology National Institutes of Health Bethesda MD USA

Centre for Molecular Medicine and Innovative Therapeutics Murdoch University Murdoch Perth Western Australia Australia

Centro di Riferimento Oncologico di Aviano IRCCS National Cancer Institute Unit of Oncogenetics and Functional Oncogenomics Aviano Italy

Computational Genomics and Bioinformatics Branch Center for Biomedical Informatics and Information Technology National Cancer Institute Rockville MD USA

Computational Genomics Genomics Research Center AbbVie North Chicago IL USA

Department of Biological Sciences Virginia Polytechnic Institute and State University Blacksburg VA USA

Department of Medical Sciences Molecular Medicine and Science for Life Laboratory Uppsala University Uppsala Sweden

Department of Physiology and Biophysics Weill Cornell Medicine New York NY USA

Departments of Medicine and Pathology University of Toledo Medical Center Toledo OH USA

Digicon McLean VA USA

Division of Cancer Epidemiology and Genetics National Cancer Institute National Institutes of Health Rockville MD USA

Estonian Genome Centre Institute of Genomics University of Tartu Tartu Estonia

European Infrastructure for Translational Medicine Amsterdam the Netherlands

Garvan Institute of Medical Research The Kinghorn Cancer Centre Darlinghurst New South Wales Australia

Genentech South San Francisco CA USA

Illumina Inc Foster City CA USA

Immuneering Corporation Cambridge MA USA

IMTM Faculty of Medicine and Dentistry Palacky University Olomouc Olomouc Czech Republic

Institute for Molecular Medicine Finland University of Helsinki Helsinki Finland

Integrative Bioinformatics National Institute of Environmental Health Sciences Durham NC USA

Lymphoid Malignancies Branch Center for Cancer Research National Cancer Institute National Institutes of Health Bethesda MD USA

National Center for Biotechnology Information National Library of Medicine National Institutes of Health Bethesda MD USA

National Center for Toxicological Research US Food and Drug Administration Jefferson AR USA

National Institute of Metrology Beijing China

Office of the Chief Scientist Office of the Commissioner US Food and Drug Information Silver Spring MD USA

Perron Institute for Neurological and Translational Science Nedlands Perth Western Australia Australia

Q2 Solutions EA Genomics Morrisville NC USA

SAS Institute Inc Cary NC USA

Sentieon Inc Mountain View CA USA

Sequencing Facility Cancer Research Technology Program Frederick National Laboratory for Cancer Research Frederick MD USA

Seven Bridges Genomics Inc Cambridge MA USA

State Key Laboratory of Genetic Engineering Human Phenome Institute School of Life Sciences and Shanghai Cancer Center Fudan University Shanghai China

The Center for Biologics Evaluation and Research US Food and Drug Administration Silver Spring MD USA

The Center for Devices and Radiological Health US Food and Drug Administration Silver Spring MD USA

The Center for Drug Evaluation and Research US Food and Drug Administration Silver Spring MD USA

References provided by Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc21025051
003      
CZ-PrNML
005      
20211026134153.0
007      
ta
008      
211013s2021 xxu f 000 0|eng||
009      
AR
024    7_
$a 10.1038/s41587-021-00994-5 $2 doi
035    __
$a (PubMed)34504346
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxu
100    1_
$a Xiao, Wenming $u The Center for Devices and Radiological Health, US Food and Drug Administration, Silver Spring, MD, USA. wenming.xiao@fda.hhs.gov
245    10
$a Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing / $c W. Xiao, L. Ren, Z. Chen, LT. Fang, Y. Zhao, J. Lack, M. Guan, B. Zhu, E. Jaeger, L. Kerrigan, TM. Blomquist, T. Hung, M. Sultan, K. Idler, C. Lu, A. Scherer, R. Kusko, M. Moos, C. Xiao, ST. Sherry, OD. Abaan, W. Chen, X. Chen, J. Nordlund, U. Liljedahl, R. Maestro, M. Polano, J. Drabek, P. Vojta, S. Kõks, E. Reimann, BS. Madala, T. Mercer, C. Miller, H. Jacob, T. Truong, A. Moshrefi, A. Natarajan, A. Granat, GP. Schroth, R. Kalamegham, E. Peters, V. Petitjean, A. Walton, TW. Shen, K. Talsania, CJ. Vera, K. Langenbach, M. de Mars, JA. Hipp, JC. Willey, J. Wang, J. Shetty, Y. Kriga, A. Raziuddin, B. Tran, Y. Zheng, Y. Yu, M. Cam, P. Jailwala, C. Nguyen, D. Meerzaman, Q. Chen, C. Yan, B. Ernest, U. Mehra, RV. Jensen, W. Jones, JL. Li, BN. Papas, M. Pirooznia, YC. Chen, F. Seifuddin, Z. Li, X. Liu, W. Resch, J. Wang, L. Wu, G. Yavas, C. Miles, B. Ning, W. Tong, CE. Mason, E. Donaldson, S. Lababidi, LM. Staudt, Z. Tezak, H. Hong, C. Wang, L. Shi
520    9_
$a Clinical applications of precision oncology require accurate tests that can distinguish true cancer-specific mutations from errors introduced at each step of next-generation sequencing (NGS). To date, no bulk sequencing study has addressed the effects of cross-site reproducibility, nor the biological, technical and computational factors that influence variant identification. Here we report a systematic interrogation of somatic mutations in paired tumor-normal cell lines to identify factors affecting detection reproducibility and accuracy at six different centers. Using whole-genome sequencing (WGS) and whole-exome sequencing (WES), we evaluated the reproducibility of different sample types with varying input amount and tumor purity, and multiple library construction protocols, followed by processing with nine bioinformatics pipelines. We found that read coverage and callers affected both WGS and WES reproducibility, but WES performance was influenced by insert fragment size, genomic copy content and the global imbalance score (GIV; G > T/C > A). Finally, taking into account library preparation protocol, tumor content, read coverage and bioinformatics processes concomitantly, we recommend actionable practices to improve the reproducibility and accuracy of NGS experiments for cancer mutation detection.
650    12
$a benchmarking $7 D019985
650    _2
$a buněčné linie $7 D002460
650    _2
$a nádorové buněčné linie $7 D045744
650    _2
$a vysoce účinné nukleotidové sekvenování $x metody $7 D059014
650    _2
$a lidé $7 D006801
650    _2
$a mutace $7 D009154
650    _2
$a nádory $x genetika $x patologie $7 D009369
650    _2
$a reprodukovatelnost výsledků $7 D015203
650    _2
$a sekvenční analýza DNA $x normy $7 D017422
650    _2
$a sekvenování exomu $x normy $7 D000073359
650    _2
$a sekvenování celého genomu $x normy $7 D000073336
655    _2
$a časopisecké články $7 D016428
655    _2
$a Research Support, N.I.H., Extramural $7 D052061
655    _2
$a práce podpořená grantem $7 D013485
700    1_
$a Ren, Luyao $u State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
700    1_
$a Chen, Zhong $u Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
700    1_
$a Fang, Li Tai $u Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
700    1_
$a Zhao, Yongmei $u Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Lack, Justin $u Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Guan, Meijian $u SAS Institute Inc., Cary, NC, USA
700    1_
$a Zhu, Bin $u Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
700    1_
$a Jaeger, Erich $u Illumina Inc., Foster City, CA, USA
700    1_
$a Kerrigan, Liz $u ATCC, Manassas, VA, USA
700    1_
$a Blomquist, Thomas M $u Departments of Medicine and Pathology, University of Toledo Medical Center, Toledo, OH, USA
700    1_
$a Hung, Tiffany $u Genentech, South San Francisco, CA, USA
700    1_
$a Sultan, Marc $u Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
700    1_
$a Idler, Kenneth $u Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
700    1_
$a Lu, Charles $u Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
700    1_
$a Scherer, Andreas $u Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
700    1_
$a Kusko, Rebecca $u Immuneering Corporation, Cambridge, MA, USA
700    1_
$a Moos, Malcolm $u The Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
700    1_
$a Xiao, Chunlin $u National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Sherry, Stephen T $u National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Abaan, Ogan D $u Illumina Inc., Foster City, CA, USA $u Seven Bridges Genomics Inc., Cambridge, MA, USA
700    1_
$a Chen, Wanqiu $u Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
700    1_
$a Chen, Xin $u Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
700    1_
$a Nordlund, Jessica $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
700    1_
$a Liljedahl, Ulrika $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u Centro di Riferimento Oncologico di Aviano IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
700    1_
$a Maestro, Roberta $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u Centro di Riferimento Oncologico di Aviano IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
700    1_
$a Polano, Maurizio $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u Centro di Riferimento Oncologico di Aviano IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
700    1_
$a Drabek, Jiri $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u IMTM, Faculty of Medicine and Dentistry, Palacky University Olomouc, Olomouc, Czech Republic
700    1_
$a Vojta, Petr $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u IMTM, Faculty of Medicine and Dentistry, Palacky University Olomouc, Olomouc, Czech Republic
700    1_
$a Kõks, Sulev $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u Perron Institute for Neurological and Translational Science, Nedlands, Perth, Western Australia, Australia $u Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Murdoch, Perth, Western Australia, Australia
700    1_
$a Reimann, Ene $u European Infrastructure for Translational Medicine, Amsterdam, the Netherlands $u Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
700    1_
$a Madala, Bindu Swapna $u Garvan Institute of Medical Research, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
700    1_
$a Mercer, Timothy $u Garvan Institute of Medical Research, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
700    1_
$a Miller, Chris $u Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
700    1_
$a Jacob, Howard $u Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
700    1_
$a Truong, Tiffany $u Illumina Inc., Foster City, CA, USA
700    1_
$a Moshrefi, Ali $u Illumina Inc., Foster City, CA, USA
700    1_
$a Natarajan, Aparna $u Illumina Inc., Foster City, CA, USA
700    1_
$a Granat, Ana $u Illumina Inc., Foster City, CA, USA
700    1_
$a Schroth, Gary P $u Illumina Inc., Foster City, CA, USA
700    1_
$a Kalamegham, Rasika $u Genentech, South San Francisco, CA, USA
700    1_
$a Peters, Eric $u Genentech, South San Francisco, CA, USA
700    1_
$a Petitjean, Virginie $u Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
700    1_
$a Walton, Ashley $u Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Shen, Tsai-Wei $u Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Talsania, Keyur $u Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Vera, Cristobal Juan $u Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Langenbach, Kurt $u ATCC, Manassas, VA, USA
700    1_
$a de Mars, Maryellen $u ATCC, Manassas, VA, USA
700    1_
$a Hipp, Jennifer A $u Departments of Medicine and Pathology, University of Toledo Medical Center, Toledo, OH, USA
700    1_
$a Willey, James C $u Departments of Medicine and Pathology, University of Toledo Medical Center, Toledo, OH, USA
700    1_
$a Wang, Jing $u National Institute of Metrology, Beijing, China
700    1_
$a Shetty, Jyoti $u Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Kriga, Yuliya $u Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Raziuddin, Arati $u Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Tran, Bao $u Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
700    1_
$a Zheng, Yuanting $u State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
700    1_
$a Yu, Ying $u State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
700    1_
$a Cam, Margaret $u CCR Collaborative Bioinformatics Resource, Office of Science and Technology Resources, Center for Cancer Research, Bethesda, MD, USA
700    1_
$a Jailwala, Parthav $u CCR Collaborative Bioinformatics Resource, Office of Science and Technology Resources, Center for Cancer Research, Bethesda, MD, USA
700    1_
$a Nguyen, Cu $u Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
700    1_
$a Meerzaman, Daoud $u Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
700    1_
$a Chen, Qingrong $u Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
700    1_
$a Yan, Chunhua $u Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
700    1_
$a Ernest, Ben $u Digicon, McLean, VA, USA
700    1_
$a Mehra, Urvashi $u Digicon, McLean, VA, USA
700    1_
$a Jensen, Roderick V $u Department of Biological Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
700    1_
$a Jones, Wendell $u Q2 Solutions-EA Genomics, Morrisville, NC, USA
700    1_
$a Li, Jian-Liang $u Integrative Bioinformatics, National Institute of Environmental Health Sciences, Durham, NC, USA
700    1_
$a Papas, Brian N $u Integrative Bioinformatics, National Institute of Environmental Health Sciences, Durham, NC, USA
700    1_
$a Pirooznia, Mehdi $u Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Chen, Yun-Ching $u Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Seifuddin, Fayaz $u Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Li, Zhipan $u Sentieon Inc., Mountain View, CA, USA
700    1_
$a Liu, Xuelu $u Center for Information Technology, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Resch, Wolfgang $u Center for Information Technology, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Wang, Jingya $u AstraZeneca, Gaithersburg, MD, USA
700    1_
$a Wu, Leihong $u National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
700    1_
$a Yavas, Gokhan $u National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
700    1_
$a Miles, Corey $u National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
700    1_
$a Ning, Baitang $u National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
700    1_
$a Tong, Weida $u National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
700    1_
$a Mason, Christopher E $u Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
700    1_
$a Donaldson, Eric $u The Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
700    1_
$a Lababidi, Samir $u Office of the Chief Scientist, Office of the Commissioner, US Food and Drug Information, Silver Spring, MD, USA
700    1_
$a Staudt, Louis M $u Lymphoid Malignancies Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
700    1_
$a Tezak, Zivana $u The Center for Devices and Radiological Health, US Food and Drug Administration, Silver Spring, MD, USA
700    1_
$a Hong, Huixiao $u National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
700    1_
$a Wang, Charles $u Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA. oxwang@gmail.com
700    1_
$a Shi, Leming $u State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China. lemingshi@fudan.edu.cn
773    0_
$w MED00003457 $t Nature biotechnology $x 1546-1696 $g Roč. 39, č. 9 (2021), s. 1141-1150
856    41
$u https://pubmed.ncbi.nlm.nih.gov/34504346 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y p $z 0
990    __
$a 20211013 $b ABA008
991    __
$a 20211026134159 $b ABA008
999    __
$a ok $b bmc $g 1714210 $s 1145558
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2021 $b 39 $c 9 $d 1141-1150 $e 20210909 $i 1546-1696 $m Nature biotechnology $n Nat Biotechnol $x MED00003457
GRA    __
$a S10OD019960 $p U.S. Department of Health & Human Services | National Institutes of Health (NIH)
GRA    __
$a HHSN261201400008C $p NCI NIH HHS $2 United States
GRA    __
$a HHSN261201500003I $p NCI NIH HHS $2 United States
GRA    __
$a 75N910D00024 $p NIH HHS $2 United States
LZP    __
$a Pubmed-20211013

Find record

Citation metrics

Loading data ...

Archiving options

Loading data ...