Patterns of somatic structural variation in human cancer genomes

Yilong Li; Nicola D Roberts; Jeremiah A Wala; Ofer Shapira; Steven E Schumacher; Kiran Kumar; Ekta Khurana; Sebastian Waszak; Jan O Korbel; James E Haber; Marcin Imielinski; PCAWG Structural Variation Working Group; Joachim Weischenfeldt; Rameen Beroukhim; Peter J Campbell; PCAWG Consortium

doi:10.1038/s41586-019-1913-9

. 2020 Feb 5;578(7793):112–121. doi: 10.1038/s41586-019-1913-9

Patterns of somatic structural variation in human cancer genomes

Yilong Li ^1,^2,^#, Nicola D Roberts ^1,^#, Jeremiah A Wala ^3,^4,^5,^#, Ofer Shapira ^3,^4,^5,^#, Steven E Schumacher ^3,^4,⁵, Kiran Kumar ^3,^4,⁵, Ekta Khurana ⁶, Sebastian Waszak ⁷, Jan O Korbel ⁷, James E Haber ⁸, Marcin Imielinski ⁹; PCAWG Structural Variation Working Group, Joachim Weischenfeldt ^11,^✉, Rameen Beroukhim ^3,^4,^5,^✉, Peter J Campbell ^1,^12,^✉; PCAWG Consortium

¹Cancer Genome Project, Wellcome Trust Sanger Institute, Hinxton, UK

²Totient Inc, Cambridge, MA USA

³The Broad Institute of Harvard and MIT, Cambridge, MA USA

⁴Bioinformatics and Integrative Genomics, Harvard University, Cambridge, MA USA

⁵Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA USA

⁶Weill Cornell Medical College, New York, NY USA

⁷European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany

⁸Department of Molecular Biology, Rosenstiel Basic Medical Sciences Research Center, Brandeis University, Waltham, MA USA

⁹New York Genome Center, New York, NY USA

¹¹Biotech Research & Innovation Centre (BRIC), The Finsen Laboratory, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark

¹²Department of Haematology, University of Cambridge, Cambridge, UK

¹⁴University of Texas MD Anderson Cancer Center, Houston, TX USA

¹⁵Department of Zoology, Genetics and Physical Anthropology, University of Santiago de Compostela, Santiago de Compostela, Spain

¹⁶Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), University of Santiago de Compostela, Santiago de Compostela, Spain

¹⁷The Biomedical Research Centre (CINBIO), University of Vigo, Vigo, Spain

¹⁸Transmissible Cancer Group, Department of Veterinary Medicine, University of Cambridge, Cambridge, UK

¹⁹Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario Canada

²⁰Department of Medical Biophysics, University of Toronto, Toronto, Ontario Canada

²¹Department of Pharmacology, University of Toronto, Toronto, Ontario Canada

²²University of California Los Angeles, Los Angeles, CA USA

²³Peter MacCallum Cancer Centre, Melbourne, Victoria Australia

²⁴Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Victoria Australia

²⁵National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg, Germany

²⁶Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany

²⁷German Cancer Genome Consortium (DKTK), Heidelberg, Germany

²⁸Johns Hopkins School of Medicine, Baltimore, MD USA

²⁹Faculty of Medicine, Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario Canada

³⁰Centre for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK

³¹Department of Biomedical Informatics, Harvard Medical School, Boston, MA USA

³²Ludwig Center, Harvard Medical School, Boston, MA USA

³³Barcelona Supercomputing Center (BSC), Barcelona, Spain

³⁴Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK

³⁵University of Cambridge, Cambridge, UK

³⁶Sidra Medicine, Doha, Qatar

³⁷Barcelona Supercomputing Center (BSC), Barcelona, Spain

³⁸Queensland Centre for Medical Genomics, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland Australia

³⁹The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel

⁴⁰Department of Computer Science, Princeton University, Princeton, NJ USA

⁴¹Department of Computer Science, Yale University, New Haven, CT USA

⁴²Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT USA

⁴³Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences (NIEHS), Durham, NC USA

⁴⁴Biomolecular Engineering Department, University of California, Santa Cruz, Santa Cruz, CA USA

⁴⁵Massachusetts General Hospital Center for Cancer Research, Charlestown, MA USA

⁴⁶Heidelberg Center for Personalized Oncology (DKFZ-HIPO), German Cancer Research Center (DKFZ), Heidelberg, Germany

⁴⁷Hopp Children’s Cancer Center (KiTZ), Heidelberg, Germany

⁴⁸Pediatric Glioma Research Group, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁴⁹Korea Advanced Institute of Science and Technology, Daejeon, South Korea

⁵⁰Skolkovo Institute of Science and Technology, Moscow, Russia

⁵¹A. A. Kharkevich Institute of Information Transmission Problems, Moscow, Russia

⁵²Dmitry Rogachev National Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, Russia

⁵³Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences (NIEHS), Durham, NC USA

⁵⁴Center For Medical Innovation, Seoul National University Hospital, Seoul, South Korea

⁵⁵Department of Internal Medicine, Seoul National University Hospital, Seoul, South Korea

⁵⁶Division of Genetics and Genomics, Harvard Medical School, Boston, MA USA

⁵⁷Boston Children’s Hospital, Boston, MA USA

⁵⁸School of Medicine/School of Mathematics and Statistics, University of St Andrews, St Andrews, UK

⁵⁹Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY USA

⁶⁰Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY USA

⁶¹Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY USA

⁶²Dana-Farber Cancer Institute, Boston, MA USA

⁶³The Institute of Medical Science, The University of Tokyo, Tokyo, Japan

⁶⁴RIKEN Center for Integrative Medical Sciences, Yokohama, Japan

⁶⁵Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT USA

⁶⁶Universitat Pompeu Fabra (UPF), Barcelona, Spain

⁶⁷Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain

⁶⁸Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany

⁶⁹Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland Australia

⁷⁰Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland Australia

⁷¹German Cancer Research Center (DKFZ), Heidelberg, Germany

⁷²School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, Pullman, WA USA

⁷³Cancer Research Institute, Beth Israel Deaconess Medical Center, Boston, MA USA

⁷⁴Faculty of Biosciences, Heidelberg University, Heidelberg, Germany

⁷⁵Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain

⁷⁶Ben May Department for Cancer Research, Department of Human Genetics, The University of Chicago, Chicago, IL USA

⁷⁷Tri-institutional PhD Program of Computational Biology and Medicine, Weill Cornell Medicine, New York, NY USA

²⁰⁰Applied Tumor Genomics Research Program, Research Programs Unit, University of Helsinki, Helsinki, Finland

²⁰¹Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK

²⁰²Memorial Sloan Kettering Cancer Center, New York, NY USA

²⁰³Genome Science Division, Research Center for Advanced Science and Technology, University of Tokyo, Tokyo, Japan

²⁰⁴Department of Surgery, University of Chicago, Chicago, IL USA

²⁰⁵Department of Surgery, Division of Hepatobiliary and Pancreatic Surgery, School of Medicine, Keimyung University Dongsan Medical Center, Daegu, South Korea

²⁰⁶Department of Oncology, Gil Medical Center, Gachon University, Incheon, South Korea

²⁰⁷Hiroshima University, Hiroshima, Japan

²⁰⁸Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

²⁰⁹University of Texas MD Anderson Cancer Center, Houston, TX USA

²¹⁰King Faisal Specialist Hospital and Research Centre, Al Maather, Riyadh, Saudi Arabia

²¹¹Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain

²¹²Bioinformatics Core Facility, University Medical Center Hamburg, Hamburg, Germany

²¹³Heinrich Pette Institute, Leibniz Institute for Experimental Virology, Hamburg, Germany

²¹⁴Ontario Tumour Bank, Ontario Institute for Cancer Research, Toronto, ON Canada

²¹⁵Department of Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

²¹⁶Laboratory of Pathology, Center for Cancer Research, National Cancer Institute, Bethesda, MD USA

²¹⁷Department of Cellular and Molecular Medicine and Department of Bioengineering, University of California San Diego, La Jolla, CA USA

²¹⁸UC San Diego Moores Cancer Center, San Diego, CA USA

²¹⁹Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC Canada

²²⁰Sir Peter MacCallum Department of Oncology, Peter MacCallum Cancer Centre, University of Melbourne, Melbourne, VIC Australia

²²¹Centre for Research in Molecular Medicine and Chronic Diseases (CiMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain

²²²Department of Zoology, Genetics and Physical Anthropology, (CiMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain

²²³The Biomedical Research Centre (CINBIO), Universidade de Vigo, Vigo, Spain

²²⁴Royal National Orthopaedic Hospital - Bolsover, London, UK

²²⁵Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX USA

²²⁶Quantitative and Computational Biosciences Graduate Program, Baylor College of Medicine, Houston, TX USA

²²⁷The Jackson Laboratory for Genomic Medicine, Farmington, CT USA

²²⁸Genome Informatics Program, Ontario Institute for Cancer Research, Toronto, ON Canada

²²⁹Institute of Human Genetics, Christian-Albrechts-University, Kiel, Germany

²³⁰Institute of Human Genetics, Ulm University and Ulm University Medical Center, Ulm, Germany

²³¹Queensland Centre for Medical Genomics, Institute for Molecular Bioscience, University of Queensland, St. Lucia, Brisbane, QLD Australia

²³²Salford Royal NHS Foundation Trust, Salford, UK

²³³Department of Surgery, Pancreas Institute, University and Hospital Trust of Verona, Verona, Italy

²³⁴Molecular and Medical Genetics, OHSU Knight Cancer Institute, Oregon Health and Science University, Portland, OR USA

²³⁵Department of Molecular Oncology, BC Cancer Research Centre, Vancouver, BC Canada

²³⁶The McDonnell Genome Institute at Washington University, St. Louis, MO USA

²³⁷University College London, London, UK

²³⁸Division of Cancer Genomics, National Cancer Center Research Institute, National Cancer Center, Tokyo, Japan

²³⁹DLR Project Management Agency, Bonn, Germany

²⁴⁰Tokyo Women’s Medical University, Tokyo, Japan

²⁴¹Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY USA

²⁴²Los Alamos National Laboratory, Los Alamos, NM USA

²⁴³Department of Pathology, University Health Network, Toronto General Hospital, Toronto, ON Canada

²⁴⁴Nottingham University Hospitals NHS Trust, Nottingham, UK

²⁴⁵Epigenomics and Cancer Risk Factors, German Cancer Research Center (DKFZ), Heidelberg, Germany

²⁴⁶Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON Canada

²⁴⁷Department of Molecular Genetics, University of Toronto, Toronto, ON Canada

²⁴⁸Vector Institute, Toronto, ON Canada

²⁴⁹Hematopathology Section, Institute of Pathology, Christian-Albrechts-University, Kiel, Germany

²⁵⁰Department of Pathology and Laboratory Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

²⁵¹Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, The Norwegian Radium Hospital, Oslo, Norway

²⁵²Pathology, Hospital Clinic, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain

²⁵³Department of Veterinary Medicine, Transmissible Cancer Group, University of Cambridge, Cambridge, UK

²⁵⁴Alvin J. Siteman Cancer Center, Washington University School of Medicine, St. Louis, MO USA

²⁵⁵Wolfson Wohl Cancer Research Centre, Institute of Cancer Sciences, University of Glasgow, Glasgow, UK

²⁵⁶Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

²⁵⁷Broad Institute of MIT and Harvard, Cambridge, MA USA

²⁵⁸Dana-Farber/Boston Children’s Cancer and Blood Disorders Center, Boston, MA USA

²⁵⁹Department of Pediatrics, Harvard Medical School, Boston, MA USA

²⁶⁰Leeds Institute of Medical Research @ St. James’s, University of Leeds, St. James’s University Hospital, Leeds, UK

²⁶¹Department of Pathology and Diagnostics, University and Hospital Trust of Verona, Verona, Italy

²⁶²Department of Surgery, Princess Alexandra Hospital, Brisbane, QLD Australia

²⁶³Surgical Oncology Group, Diamantina Institute, University of Queensland, Brisbane, QLD Australia

²⁶⁴Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH USA

²⁶⁵Research Health Analytics and Informatics, University Hospitals Cleveland Medical Center, Cleveland, OH USA

²⁶⁶Gloucester Royal Hospital, Gloucester, UK

²⁶⁷European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK

²⁶⁸Diagnostic Development, Ontario Institute for Cancer Research, Toronto, ON Canada

²⁶⁹Barcelona Supercomputing Center (BSC), Barcelona, Spain

²⁷⁰Arnie Charbonneau Cancer Institute, University of Calgary, Calgary, AB Canada

²⁷¹Departments of Surgery and Oncology, University of Calgary, Calgary, AB Canada

²⁷²Department of Pathology, Oslo University Hospital, The Norwegian Radium Hospital, Oslo, Norway

²⁷³PanCuRx Translational Research Initiative, Ontario Institute for Cancer Research, Toronto, ON Canada

²⁷⁴Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University School of Medicine, Baltimore, MD USA

²⁷⁵University Hospital Southampton NHS Foundation Trust, Southampton, UK

²⁷⁶Royal Stoke University Hospital, Stoke-on-Trent, UK

²⁷⁷Genome Sequence Informatics, Ontario Institute for Cancer Research, Toronto, ON Canada

²⁷⁸Human Longevity Inc, San Diego, CA USA

²⁷⁹Olivia Newton-John Cancer Research Institute, La Trobe University, Heidelberg, VIC Australia

²⁸⁰Computer Network Information Center, Chinese Academy of Sciences, Beijing, China

²⁸¹Genome Canada, Ottawa, ON Canada

²⁸²CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain

²⁸³Universitat Pompeu Fabra (UPF), Barcelona, Spain

²⁸⁴Buck Institute for Research on Aging, Novato, CA USA

²⁸⁵Duke University Medical Center, Durham, NC USA

²⁸⁶Department of Human Genetics, Hannover Medical School, Hannover, Germany

²⁸⁷Center for Bioinformatics and Functional Genomics, Cedars-Sinai Medical Center, Los Angeles, CA USA

²⁸⁸Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA USA

²⁸⁹The Hebrew University Faculty of Medicine, Jerusalem, Israel

²⁹⁰Barts Cancer Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK

²⁹¹Department of Computer Science, Bioinformatics Group, University of Leipzig, Leipzig, Germany

²⁹²Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany

²⁹³Transcriptome Bioinformatics, LIFE Research Center for Civilization Diseases, University of Leipzig, Leipzig, Germany

²⁹⁴Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA USA

²⁹⁵Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA USA

²⁹⁶Harvard Medical School, Boston, MA USA

²⁹⁷USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA USA

²⁹⁸Department of Diagnostics and Public Health, University and Hospital Trust of Verona, Verona, Italy

²⁹⁹Department of Mathematics, Aarhus University, Aarhus, Denmark

³⁰⁰Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N, Denmark

³⁰¹Instituto Carlos Slim de la Salud, Mexico City, Mexico

³⁰²Department of Medical Biophysics, University of Toronto, Toronto, ON Canada

³⁰³Cancer Division, Garvan Institute of Medical Research, Kinghorn Cancer Centre, University of New South Wales (UNSW Sydney), Sydney, NSW Australia

³⁰⁴South Western Sydney Clinical School, Faculty of Medicine, University of New South Wales (UNSW Sydney), Liverpool, NSW Australia

³⁰⁵West of Scotland Pancreatic Unit, Glasgow Royal Infirmary, Glasgow, UK

³⁰⁶Center for Digital Health, Berlin Institute of Health and Charitè - Universitätsmedizin Berlin, Berlin, Germany

³⁰⁷Heidelberg Center for Personalized Oncology (DKFZ-HIPO), German Cancer Research Center (DKFZ), Heidelberg, Germany

³⁰⁸The Preston Robert Tisch Brain Tumor Center, Duke University Medical Center, Durham, NC USA

³⁰⁹Massachusetts General Hospital, Boston, MA USA

³¹⁰National Institute of Biomedical Genomics, Kalyani, West Bengal India

³¹¹Institute of Clinical Medicine and Institute of Oral Biology, University of Oslo, Oslo, Norway

³¹²University of North Carolina at Chapel Hill, Chapel Hill, NC USA

³¹³ARC-Net Centre for Applied Research on Cancer, University and Hospital Trust of Verona, Verona, Italy

³¹⁴The Institute of Cancer Research, London, UK

³¹⁵Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore

³¹⁶Programme in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore

³¹⁷Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, Lund, Sweden

³¹⁸Department of Pediatric Oncology, Hematology and Clinical Immunology, Heinrich-Heine-University, Düsseldorf, Germany

³¹⁹Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan

³²⁰RIKEN Center for Integrative Medical Sciences, Yokohama, Japan

³²¹Department of Internal Medicine/Hematology, Friedrich-Ebert-Hospital, Neumünster, Germany

³²²Departments of Dermatology and Pathology, Yale University, New Haven, CT USA

³²³Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain

³²⁴Radcliffe Department of Medicine, University of Oxford, Oxford, UK

³²⁵Canadian Center for Computational Genomics, McGill University, Montreal, QC Canada

³²⁶Department of Human Genetics, McGill University, Montreal, QC Canada

³²⁷Department of Human Genetics, University of California Los Angeles, Los Angeles, CA USA

³²⁸Department of Pharmacology, University of Toronto, Toronto, ON Canada

³²⁹Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Center, Tampere University Hospital, Tampere, Finland

³³⁰Haematology, Leeds Teaching Hospitals NHS Trust, Leeds, UK

³³¹Translational Research and Innovation, Centre Léon Bérard, Lyon, France

³³²Fox Chase Cancer Center, Philadelphia, PA USA

³³³International Agency for Research on Cancer, World Health Organization, Lyon, France

³³⁴Earlham Institute, Norwich, UK

³³⁵Norwich Medical School, University of East Anglia, Norwich, UK

³³⁶Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University, Nijmegen, HB The Netherlands

³³⁷CRUK Manchester Institute and Centre, Manchester, UK

³³⁸Department of Radiation Oncology, University of Toronto, Toronto, ON Canada

³³⁹Division of Cancer Sciences, Manchester Cancer Research Centre, University of Manchester, Manchester, UK

³⁴⁰Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, ON Canada

³⁴¹Department of Pathology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA USA

³⁴²Department of Surgery, Division of Thoracic Surgery, The Johns Hopkins University School of Medicine, Baltimore, MD USA

³⁴³Division of Molecular Pathology, The Netherlands Cancer Institute, Oncode Institute, Amsterdam, CX The Netherlands

³⁴⁴Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA USA

³⁴⁵UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA USA

³⁴⁶Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany

³⁴⁷German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany

³⁴⁸National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg, Germany

³⁴⁹Center for Biological Sequence Analysis, Department of Bio and Health Informatics, Technical University of Denmark, Lyngby, Denmark

³⁵⁰Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark

³⁵¹Institute for Molecular Bioscience, University of Queensland, St. Lucia, Brisbane, QLD Australia

³⁵²Biomedical Engineering, Oregon Health and Science University, Portland, OR USA

³⁵³Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany

³⁵⁴Institute of Pharmacy and Molecular Biotechnology and BioQuant, Heidelberg University, Heidelberg, Germany

³⁵⁵Federal Ministry of Education and Research, Berlin, Germany

³⁵⁶Melanoma Institute Australia, University of Sydney, Sydney, NSW Australia

³⁵⁷Pediatric Hematology and Oncology, University Hospital Muenster, Muenster, Germany

³⁵⁸Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD USA

³⁵⁹McKusick-Nathans Institute of Genetic Medicine, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University School of Medicine, Baltimore, MD USA

³⁶⁰Foundation Medicine, Inc, Cambridge, MA USA

³⁶¹Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA USA

³⁶²Department of Genetics, Stanford University School of Medicine, Stanford, CA USA

³⁶³Bakar Computational Health Sciences Institute and Department of Pediatrics, University of California, San Francisco, CA USA

³⁶⁴Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway

³⁶⁵National Cancer Institute, National Institutes of Health, Bethesda, MD USA

³⁶⁶Royal Marsden NHS Foundation Trust, London and Sutton, UK

³⁶⁷Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany

³⁶⁸Department of Oncology, University of Cambridge, Cambridge, UK

³⁶⁹Li Ka Shing Centre, Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK

³⁷⁰Institut Gustave Roussy, Villejuif, France

³⁷¹Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK

³⁷²Department of Haematology, University of Cambridge, Cambridge, UK

³⁷³Anatomia Patológica, Hospital Clinic, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain

³⁷⁴Spanish Ministry of Science and Innovation, Madrid, Spain

³⁷⁵University of Michigan Comprehensive Cancer Center, Ann Arbor, MI USA

³⁷⁶Department for BioMedical Research, University of Bern, Bern, Switzerland

³⁷⁷Department of Medical Oncology, Inselspital, University Hospital and University of Bern, Bern, Switzerland

³⁷⁸Graduate School for Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland

³⁷⁹University of Pavia, Pavia, Italy

³⁸⁰University of Alabama at Birmingham, Birmingham, AL USA

³⁸¹UHN Program in BioSpecimen Sciences, Toronto General Hospital, Toronto, ON Canada

³⁸²Department of Urology, Icahn School of Medicine at Mount Sinai, New York, NY USA

³⁸³Centre for Law and Genetics, University of Tasmania, Sandy Bay Campus, Hobart, TAS Australia

³⁸⁴Faculty of Biosciences, Heidelberg University, Heidelberg, Germany

³⁸⁵Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, ON Canada

³⁸⁶Division of Anatomic Pathology, Mayo Clinic, Rochester, MN USA

³⁸⁷Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD USA

³⁸⁸Illawarra Shoalhaven Local Health District L3 Illawarra Cancer Care Centre, Wollongong Hospital, Wollongong, NSW Australia

³⁸⁹BioForA, French National Institute for Agriculture, Food, and Environment (INRAE), ONF, Orléans, France

³⁹⁰Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD USA

³⁹¹University of California San Diego, San Diego, CA USA

³⁹²Division of Experimental Pathology, Mayo Clinic, Rochester, MN USA

³⁹³Centre for Cancer Research, The Westmead Institute for Medical Research, University of Sydney, Sydney, NSW Australia

³⁹⁴Department of Gynaecological Oncology, Westmead Hospital, Sydney, NSW Australia

³⁹⁵PDXen Biosystems Inc, Seoul, South Korea

³⁹⁶Korea Advanced Institute of Science and Technology, Daejeon, South Korea

³⁹⁷Electronics and Telecommunications Research Institute, Daejeon, South Korea

³⁹⁸Institut National du Cancer (INCA), Boulogne-Billancourt, France

³⁹⁹Department of Genetics, Informatics Institute, University of Alabama at Birmingham, Birmingham, AL USA

⁴⁰⁰Division of Medical Oncology, National Cancer Centre, Singapore, Singapore

⁴⁰¹Medical Oncology, University and Hospital Trust of Verona, Verona, Italy

⁴⁰²Department of Pediatrics, University Hospital Schleswig-Holstein, Kiel, Germany

⁴⁰³Hepatobiliary/Pancreatic Surgical Oncology Program, University Health Network, Toronto, ON Canada

⁴⁰⁴School of Biological Sciences, University of Auckland, Auckland, New Zealand

⁴⁰⁵Department of Surgery, University of Melbourne, Parkville, VIC Australia

⁴⁰⁶The Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, VIC Australia

⁴⁰⁷Walter and Eliza Hall Institute, Parkville, VIC Australia

⁴⁰⁸Vancouver Prostate Centre, Vancouver, Canada

⁴⁰⁹Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON Canada

⁴¹⁰University of East Anglia, Norwich, UK

⁴¹¹Norfolk and Norwich University Hospital NHS Trust, Norwich, UK

⁴¹²Victorian Institute of Forensic Medicine, Southbank, VIC Australia

⁴¹³Department of Biomedical Informatics, Harvard Medical School, Boston, MA USA

⁴¹⁴Department of Chemistry, Centre for Molecular Science Informatics, University of Cambridge, Cambridge, UK

⁴¹⁵Ludwig Center at Harvard Medical School, Boston, MA USA

⁴¹⁶Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX USA

⁴¹⁷Peter MacCallum Cancer Centre, University of Melbourne, Melbourne, VIC Australia

⁴¹⁸Physics Division, Optimization and Systems Biology Lab, Massachusetts General Hospital, Boston, MA USA

⁴¹⁹Department of Medicine, Baylor College of Medicine, Houston, TX USA

⁴²⁰University of Cologne, Cologne, Germany

⁴²¹International Genomics Consortium, Phoenix, AZ USA

⁴²²Genomics Research Program, Ontario Institute for Cancer Research, Toronto, ON Canada

⁴²³Barking Havering and Redbridge University Hospitals NHS Trust, Romford, UK

⁴²⁴Children’s Hospital at Westmead, University of Sydney, Sydney, NSW Australia

⁴²⁵Department of Medicine, Section of Endocrinology, University and Hospital Trust of Verona, Verona, Italy

⁴²⁶Computational Biology Center, Memorial Sloan Kettering Cancer Center, New York, NY USA

⁴²⁷Department of Biology, ETH Zurich, Zürich, Switzerland

⁴²⁸Department of Computer Science, ETH Zurich, Zurich, Switzerland

⁴²⁹SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland

⁴³⁰Weill Cornell Medical College, New York, NY USA

⁴³¹Academic Department of Medical Genetics, University of Cambridge, Addenbrooke’s Hospital, Cambridge, UK

⁴³²MRC Cancer Unit, University of Cambridge, Cambridge, UK

⁴³³Departments of Pediatrics and Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁴³⁴Seven Bridges Genomics, Charlestown, MA USA

⁴³⁵Annai Systems, Inc, Carlsbad, CA USA

⁴³⁶Department of Pathology, General Hospital of Treviso, Department of Medicine, University of Padua, Treviso, Italy

⁴³⁷Department of Computational Biology, University of Lausanne, Lausanne, Switzerland

⁴³⁸Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, CH Switzerland

⁴³⁹Swiss Institute of Bioinformatics, University of Geneva, Geneva, CH Switzerland

⁴⁴⁰The Francis Crick Institute, London, UK

⁴⁴¹University of Leuven, Leuven, Belgium

⁴⁴²Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany

⁴⁴³Computational and Systems Biology, Genome Institute of Singapore, Singapore, Singapore

⁴⁴⁴School of Computing, National University of Singapore, Singapore, Singapore

⁴⁴⁵Big Data Institute, Li Ka Shing Centre, University of Oxford, Oxford, UK

⁴⁴⁶Biomedical Data Science Laboratory, Francis Crick Institute, London, UK

⁴⁴⁷Bioinformatics Group, Department of Computer Science, University College London, London, UK

⁴⁴⁸The Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON Canada

⁴⁴⁹Breast Cancer Translational Research Laboratory JC Heuson, Institut Jules Bordet, Brussels, Belgium

⁴⁵⁰Department of Oncology, Laboratory for Translational Breast Cancer Research, KU Leuven, Leuven, Belgium

⁴⁵¹Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain

⁴⁵²Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain

⁴⁵³Division of Medical Oncology, Princess Margaret Cancer Centre, Toronto, ON Canada

⁴⁵⁴Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY USA

⁴⁵⁵Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY USA

⁴⁵⁶Department of Pathology, UPMC Shadyside, Pittsburgh, PA USA

⁴⁵⁷Independent Consultant, Wellesley, USA

⁴⁵⁸Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden

⁴⁵⁹Department of Medicine and Department of Genetics, Washington University School of Medicine, St. Louis, St. Louis, MO USA

⁴⁶⁰Hefei University of Technology, Anhui, China

⁴⁶¹Translational Cancer Research Unit, GZA Hospitals St.-Augustinus, Center for Oncological Research, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium

⁴⁶²Simon Fraser University, Burnaby, BC Canada

⁴⁶³University of Pennsylvania, Philadelphia, PA USA

⁴⁶⁴Faculty of Science and Technology, University of Vic—Central University of Catalonia (UVic-UCC), Vic, Spain

⁴⁶⁵The Wellcome Trust, London, UK

⁴⁶⁶The Hospital for Sick Children, Toronto, ON Canada

⁴⁶⁷Department of Pathology, Queen Elizabeth University Hospital, Glasgow, UK

⁴⁶⁸Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, QLD Australia

⁴⁶⁹Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK

⁴⁷⁰Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK

⁴⁷¹Prostate Cancer Canada, Toronto, ON Canada

⁴⁷²University of Cambridge, Cambridge, UK

⁴⁷³Department of Laboratory Medicine, Translational Cancer Research, Lund University Cancer Center at Medicon Village, Lund University, Lund, Sweden

⁴⁷⁴Heidelberg University, Heidelberg, Germany

⁴⁷⁵New BIH Digital Health Center, Berlin Institute of Health (BIH) and Charité - Universitätsmedizin Berlin, Berlin, Germany

⁴⁷⁶CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain

⁴⁷⁷Research Group on Statistics, Econometrics and Health (GRECS), UdG, Barcelona, Spain

⁴⁷⁸Quantitative Genomics Laboratories (qGenomics), Barcelona, Spain

⁴⁷⁹Icelandic Cancer Registry, Icelandic Cancer Society, Reykjavik, Iceland

⁴⁸⁰State Key Laboratory of Cancer Biology, and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Shaanxi, China

⁴⁸¹Department of Medicine (DIMED), Surgical Pathology Unit, University of Padua, Padua, Italy

⁴⁸²Rigshospitalet, Copenhagen, Denmark

⁴⁸³Center for Cancer Genomics, National Cancer Institute, National Institutes of Health, Bethesda, MD USA

⁴⁸⁴Department of Biochemistry and Molecular Medicine, University of Montreal, Montreal, QC Canada

⁴⁸⁵Australian Institute of Tropical Health and Medicine, James Cook University, Douglas, QLD Australia

⁴⁸⁶Department of Neuro-Oncology, Istituto Neurologico Besta, Milano, Italy

⁴⁸⁷Bioplatforms Australia, North Ryde, NSW Australia

⁴⁸⁸Department of Pathology (Research), University College London Cancer Institute, London, UK

⁴⁸⁹Department of Surgical Oncology, Princess Margaret Cancer Centre, Toronto, ON Canada

⁴⁹⁰Department of Medical Oncology, Josephine Nefkens Institute and Cancer Genomics Centre, Erasmus Medical Center, Rotterdam, CN The Netherlands

⁴⁹¹The University of Queensland Thoracic Research Centre, The Prince Charles Hospital, Brisbane, QLD Australia

⁴⁹²CIBIO/InBIO - Research Center in Biodiversity and Genetic Resources, Universidade do Porto, Vairão, Portugal

⁴⁹³HCA Laboratories, London, UK

⁴⁹⁴University of Liverpool, Liverpool, UK

⁴⁹⁵The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel

⁴⁹⁶Department of Neurosurgery, University of Florida, Gainesville, FL USA

⁴⁹⁷Department of Pathology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan

⁴⁹⁸University of Milano Bicocca, Monza, Italy

⁴⁹⁹BGI-Shenzhen, Shenzhen, China

⁵⁰⁰Department of Pathology, Oslo University Hospital Ulleval, Oslo, Norway

⁵⁰¹Center for Biomedical Informatics, Harvard Medical School, Boston, MA USA

⁵⁰²Department Biochemistry and Molecular Biomedicine, University of Barcelona, Barcelona, Spain

⁵⁰³Office of Cancer Genomics, National Cancer Institute, National Institutes of Health, Bethesda, MD USA

⁵⁰⁴Cancer Epigenomics, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁵⁰⁵Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁵⁰⁶Department of Surgical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁵⁰⁷Department of Computer Science, Yale University, New Haven, CT USA

⁵⁰⁸Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT USA

⁵⁰⁹Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT USA

⁵¹⁰Center for Cancer Research, Massachusetts General Hospital, Boston, MA USA

⁵¹¹Department of Pathology, Massachusetts General Hospital, Boston, MA USA

⁵¹²Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY USA

⁵¹³Division of Gastroenterology and Hepatology, Mayo Clinic, Rochester, MN USA

⁵¹⁴University of Sydney, Sydney, NSW Australia

⁵¹⁵University of Oxford, Oxford, UK

⁵¹⁶Department of Surgery, Academic Urology Group, University of Cambridge, Cambridge, UK

⁵¹⁷Department of Medicine II, University of Würzburg, Wuerzburg, Germany

⁵¹⁸Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL USA

⁵¹⁹Institut Hospital del Mar d’Investigacions Mèdiques (IMIM), Barcelona, Spain

⁵²⁰Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences (NIEHS), Durham, NC USA

⁵²¹St. Thomas’s Hospital, London, UK

⁵²²Osaka International Cancer Center, Osaka, Japan

⁵²³Department of Pathology, Skåne University Hospital, Lund University, Lund, Sweden

⁵²⁴Department of Medical Oncology, Beatson West of Scotland Cancer Centre, Glasgow, UK

⁵²⁵National Human Genome Research Institute, National Institutes of Health, Bethesda, MD USA

⁵²⁶Centre for Cancer Research, Victorian Comprehensive Cancer Centre, University of Melbourne, Melbourne, VIC Australia

⁵²⁷Department of Medicine, Section of Hematology/Oncology, University of Chicago, Chicago, IL USA

⁵²⁸German Center for Infection Research (DZIF), Partner Site Hamburg-Borstel-Lübeck-Riems, Hamburg, Germany

⁵²⁹Bioinformatics Research Centre (BiRC), Aarhus University, Aarhus, Denmark

⁵³⁰Department of Biotechnology, Ministry of Science and Technology, Government of India, New Delhi, Delhi India

⁵³¹National Cancer Centre Singapore, Singapore, Singapore

⁵³²Brandeis University, Waltham, MA USA

⁵³³Department of Urologic Sciences, University of British Columbia, Vancouver, BC Canada

⁵³⁴Department of Internal Medicine, Stanford University, Stanford, CA USA

⁵³⁵The University of Texas Health Science Center at Houston, Houston, TX USA

⁵³⁶Imperial College NHS Trust, Imperial College, London, INY UK

⁵³⁷Senckenberg Institute of Pathology, University of Frankfurt Medical School, Frankfurt, Germany

⁵³⁸Department of Medicine, Division of Biomedical Informatics, UC San Diego School of Medicine, San Diego, CA USA

⁵³⁹Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX USA

⁵⁴⁰Oxford Nanopore Technologies, New York, NY USA

⁵⁴¹Institute of Medical Science, University of Tokyo, Tokyo, Japan

⁵⁴²Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, CA USA

⁵⁴³Wakayama Medical University, Wakayama, Japan

⁵⁴⁴Department of Internal Medicine, Division of Medical Oncology, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁵⁴⁵University of Tennessee Health Science Center for Cancer Research, Memphis, TN USA

⁵⁴⁶Department of Histopathology, Salford Royal NHS Foundation Trust, Salford, UK

⁵⁴⁷Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK

⁵⁴⁸BIOPIC, ICG and College of Life Sciences, Peking University, Beijing, China

⁵⁴⁹Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China

⁵⁵⁰Children’s Hospital of Philadelphia, Philadelphia, PA USA

⁵⁵¹Department of Bioinformatics and Computational Biology and Department of Systems Biology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁵⁵²Karolinska Institute, Stockholm, Sweden

⁵⁵³The Donnelly Centre, University of Toronto, Toronto, ON Canada

⁵⁵⁴Department of Medical Genetics, College of Medicine, Hallym University, Chuncheon, South Korea

⁵⁵⁵Department of Experimental and Health Sciences, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Barcelona, Spain

⁵⁵⁶Health Data Science Unit, University Clinics, Heidelberg, Germany

⁵⁵⁷Massachusetts General Hospital Center for Cancer Research, Charlestown, MA USA

⁵⁵⁸Hokkaido University, Sapporo, Japan

⁵⁵⁹Department of Pathology and Clinical Laboratory, National Cancer Center Hospital, Tokyo, Japan

⁵⁶⁰Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁵⁶¹Computational Biology, Leibniz Institute on Aging - Fritz Lipmann Institute (FLI), Jena, Germany

⁵⁶²University of Melbourne Centre for Cancer Research, Melbourne, VIC Australia

⁵⁶³University of Nebraska Medical Center, Omaha, NE USA

⁵⁶⁴Syntekabio Inc, Daejeon, South Korea

⁵⁶⁵Department of Pathology, Academic Medical Center, Amsterdam, AZ The Netherlands

⁵⁶⁶China National GeneBank-Shenzhen, Shenzhen, China

⁵⁶⁷Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁵⁶⁸Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China

⁵⁶⁹Icahn School of Medicine at Mount Sinai, New York, NY USA

⁵⁷⁰Geneplus-Shenzhen, Shenzhen, China

⁵⁷¹School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China

⁵⁷²AbbVie, North Chicago, IL USA

⁵⁷³Institute of Pathology, Charité – University Medicine Berlin, Berlin, Germany

⁵⁷⁴Centre for Translational and Applied Genomics, British Columbia Cancer Agency, Vancouver, BC Canada

⁵⁷⁵Edinburgh Royal Infirmary, Edinburgh, UK

⁵⁷⁶Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany

⁵⁷⁷Department of Pediatric Immunology, Hematology and Oncology, University Hospital, Heidelberg, Germany

⁵⁷⁸German Cancer Research Center (DKFZ), Heidelberg, Germany

⁵⁷⁹Heidelberg Institute for Stem Cell Technology and Experimental Medicine (HI-STEM), Heidelberg, Germany

⁵⁸⁰Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY USA

⁵⁸¹New York Genome Center, New York, NY USA

⁵⁸²Department of Urology, James Buchanan Brady Urological Institute, Johns Hopkins University School of Medicine, Baltimore, MD USA

⁵⁸³Department of Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan

⁵⁸⁴Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX USA

⁵⁸⁵Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX USA

⁵⁸⁶Michael E. DeBakey Veterans Affairs Medical Center, Houston, TX USA

⁵⁸⁷Technical University of Denmark, Lyngby, Denmark

⁵⁸⁸Department of Pathology, College of Medicine, Hanyang University, Seoul, South Korea

⁵⁸⁹Academic Unit of Surgery, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow Royal Infirmary, Glasgow, UK

⁵⁹⁰Department of Pathology, Asan Medical Center, College of Medicine, Ulsan University, Songpa-gu, Seoul South Korea

⁵⁹¹Science Writer, Garrett Park, MD USA

⁵⁹²International Cancer Genome Consortium (ICGC)/ICGC Accelerating Research in Genomic Oncology (ARGO) Secretariat, Ontario Institute for Cancer Research, Toronto, ON Canada

⁵⁹³University of Ljubljana, Ljubljana, Slovenia

⁵⁹⁴Department of Public Health Sciences, University of Chicago, Chicago, IL USA

⁵⁹⁵Research Institute, NorthShore University HealthSystem, Evanston, IL USA

⁵⁹⁶Department for Biomedical Research, University of Bern, Bern, Switzerland

⁵⁹⁷Centre of Genomics and Policy, McGill University and Génome Québec Innovation Centre, Montreal, QC Canada

⁵⁹⁸Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁵⁹⁹Hopp Children’s Cancer Center (KiTZ), Heidelberg, Germany

⁶⁰⁰Pediatric Glioma Research Group, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁶⁰¹Cancer Research UK, London, UK

⁶⁰²Indivumed GmbH, Hamburg, Germany

⁶⁰³Genome Integration Data Center, Syntekabio, Inc, Daejeon, South Korea

⁶⁰⁴University Hospital Zurich, Zurich, Switzerland

⁶⁰⁵Clinical Bioinformatics, Swiss Institute of Bioinformatics, Geneva, Switzerland

⁶⁰⁶Institute for Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland

⁶⁰⁷Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland

⁶⁰⁸MRC Human Genetics Unit, MRC IGMM, University of Edinburgh, Edinburgh, UK

⁶⁰⁹Women’s Cancer Program at the Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA USA

⁶¹⁰Department of Biology, Bioinformatics Group, Division of Molecular Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia

⁶¹¹Department for Internal Medicine II, University Hospital Schleswig-Holstein, Kiel, Germany

⁶¹²Genetics and Molecular Pathology, SA Pathology, Adelaide, SA Australia

⁶¹³Department of Gastric Surgery, National Cancer Center Hospital, Tokyo, Japan

⁶¹⁴Department of Bioinformatics, Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan

⁶¹⁵A.A. Kharkevich Institute of Information Transmission Problems, Moscow, Russia

⁶¹⁶Oncology and Immunology, Dmitry Rogachev National Research Center of Pediatric Hematology, Moscow, Russia

⁶¹⁷Skolkovo Institute of Science and Technology, Moscow, Russia

⁶¹⁸Department of Surgery, The George Washington University, School of Medicine and Health Science, Washington, DC USA

⁶¹⁹Endocrine Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD USA

⁶²⁰Melanoma Institute Australia, Macquarie University, Sydney, NSW Australia

⁶²¹MIT Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA USA

⁶²²Tissue Pathology and Diagnostic Oncology, Royal Prince Alfred Hospital, Sydney, NSW Australia

⁶²³Cholangiocarcinoma Screening and Care Program and Liver Fluke and Cholangiocarcinoma Research Centre, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand

⁶²⁴Controlled Department and Institution, New York, NY USA

⁶²⁵Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY USA

⁶²⁶National Cancer Center, Gyeonggi, South Korea

⁶²⁷Department of Biochemistry, College of Medicine, Ewha Womans University, Seoul, South Korea

⁶²⁸Health Sciences Department of Biomedical Informatics, University of California San Diego, La Jolla, CA USA

⁶²⁹Research Core Center, National Cancer Centre Korea, Goyang-si, South Korea

⁶³⁰Department of Health Sciences and Technology, Sungkyunkwan University School of Medicine, Seoul, South Korea

⁶³¹Samsung Genome Institute, Seoul, South Korea

⁶³²Breast Oncology Program, Dana-Farber/Brigham and Women’s Cancer Center, Boston, MA USA

⁶³³Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY USA

⁶³⁴Division of Breast Surgery, Brigham and Women’s Hospital, Boston, MA USA

⁶³⁵Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences (NIEHS), Durham, NC USA

⁶³⁶Department of Clinical Science, University of Bergen, Bergen, Norway

⁶³⁷Center For Medical Innovation, Seoul National University Hospital, Seoul, South Korea

⁶³⁸Department of Internal Medicine, Seoul National University Hospital, Seoul, South Korea

⁶³⁹Institute of Computer Science, Polish Academy of Sciences, Warsawa, Poland

⁶⁴⁰Functional and Structural Genomics, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁶⁴¹Laboratory of Translational Genomics, Division of Cancer Epidemiology and Genetics, National Cancer Institute, , National Institutes of Health, Bethesda, MD USA

⁶⁴²Institute for Medical Informatics Statistics and Epidemiology, University of Leipzig, Leipzig, Germany

⁶⁴³Morgan Welch Inflammatory Breast Cancer Research Program and Clinic, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁶⁴⁴Department of Hematology and Oncology, Georg-Augusts-University of Göttingen, Göttingen, Germany

⁶⁴⁵Institute of Cell Biology (Cancer Research), University of Duisburg-Essen, Essen, Germany

⁶⁴⁶King’s College London and Guy’s and St. Thomas’ NHS Foundation Trust, London, UK

⁶⁴⁷Center for Epigenetics, Van Andel Research Institute, Grand Rapids, MI USA

⁶⁴⁸The University of Queensland Centre for Clinical Research, Royal Brisbane and Women’s Hospital, Herston, QLD Australia

⁶⁴⁹Department of Pediatric Oncology and Hematology, University of Cologne, Cologne, Germany

⁶⁵⁰University of Düsseldorf, Düsseldorf, Germany

⁶⁵¹Department of Pathology, Institut Jules Bordet, Brussels, Belgium

⁶⁵²Institute of Biomedicine, Sahlgrenska Academy at University of Gothenburg, Gothenburg, Sweden

⁶⁵³Children’s Medical Research Institute, Sydney, NSW Australia

⁶⁵⁴ILSbio, LLC Biobank, Chestertown, MD USA

⁶⁵⁵Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA USA

⁶⁵⁶Institute for Bioengineering and Biopharmaceutical Research (IBBR), Hanyang University, Seoul, South Korea

⁶⁵⁷Department of Statistics, University of California Santa Cruz, Santa Cruz, CA USA

⁶⁵⁸National Genotyping Center, Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan

⁶⁵⁹Department of Vertebrate Genomics/Otto Warburg Laboratory Gene Regulation and Systems Biology of Cancer, Max Planck Institute for Molecular Genetics, Berlin, Germany

⁶⁶⁰McGill University and Genome Quebec Innovation Centre, Montreal, QC Canada

⁶⁶¹biobyte solutions GmbH, Heidelberg, Germany

⁶⁶²Gynecologic Oncology, NYU Laura and Isaac Perlmutter Cancer Center, New York University, New York, NY USA

⁶⁶³Division of Oncology, Stem Cell Biology Section, Washington University School of Medicine, St. Louis, MO USA

⁶⁶⁴Department of Systems Biology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁶⁶⁵Harvard University, Cambridge, MA USA

⁶⁶⁶Urologic Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD USA

⁶⁶⁷University of Oslo, Oslo, Norway

⁶⁶⁸University of Toronto, Toronto, ON Canada

⁶⁶⁹Peking University, Beijing, China

⁶⁷⁰School of Life Sciences, Peking University, Beijing, China

⁶⁷¹Leidos Biomedical Research, Inc, McLean, VA USA

⁶⁷²Hematology, Hospital Clinic, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain

⁶⁷³Second Military Medical University, Shanghai, China

⁶⁷⁴Chinese Cancer Genome Consortium, Shenzhen, China

⁶⁷⁵Department of Medical Oncology, Beijing Hospital, Beijing, China

⁶⁷⁶Laboratory of Molecular Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education), Peking University Cancer Hospital and Institute, Beijing, China

⁶⁷⁷School of Medicine/School of Mathematics and Statistics, University of St. Andrews, St, Andrews, Fife UK

⁶⁷⁸Institute for Systems Biology, Seattle, WA USA

⁶⁷⁹Department of Biochemistry and Molecular Biology, Faculty of Medicine, University Institute of Oncology-IUOPA, Oviedo, Spain

⁶⁸⁰Institut Bergonié, Bordeaux, France

⁶⁸¹Cancer Unit, MRC University of Cambridge, Cambridge, UK

⁶⁸²Department of Pathology and Laboratory Medicine, Center for Personalized Medicine, Children’s Hospital Los Angeles, Los Angeles, CA USA

⁶⁸³John Curtin School of Medical Research, Canberra, ACT Australia

⁶⁸⁴MVZ Department of Oncology, PraxisClinic am Johannisplatz, Leipzig, Germany

⁶⁸⁵Department of Information Technology, Ghent University, Ghent, Belgium

⁶⁸⁶Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium

⁶⁸⁷Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH USA

⁶⁸⁸Computational Biology Program, School of Medicine, Oregon Health and Science University, Portland, OR USA

⁶⁸⁹Department of Surgery, Duke University, Durham, NC USA

⁶⁹⁰Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain

⁶⁹¹Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain

⁶⁹²University of Glasgow, Glasgow, UK

⁶⁹³Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain

⁶⁹⁴Division of Oncology, Washington University School of Medicine, St. Louis, MO USA

⁶⁹⁵Department of Surgery and Cancer, Imperial College, London, INY UK

⁶⁹⁶Applications Department, Oxford Nanopore Technologies, Oxford, UK

⁶⁹⁷Department of Obstetrics, Gynecology and Reproductive Services, University of California San Francisco, San Francisco, CA USA

⁶⁹⁸Department of Biochemistry and Molecular Medicine, University California at Davis, Sacramento, CA USA

⁶⁹⁹STTARR Innovation Facility, Princess Margaret Cancer Centre, Toronto, ON Canada

⁷⁰⁰Discipline of Surgery, Western Sydney University, Penrith, NSW Australia

⁷⁰¹Yale School of Medicine, Yale University, New Haven, CT USA

⁷⁰²Department of Genetics, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁷⁰³Departments of Neurology and Neurosurgery, Henry Ford Hospital, Detroit, MI USA

⁷⁰⁴Precision Oncology, OHSU Knight Cancer Institute, Oregon Health and Science University, Portland, OR USA

⁷⁰⁵Institute of Pathology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany

⁷⁰⁶Department of Health Sciences, Faculty of Medical Sciences, Kyushu University, Fukuoka, Japan

⁷⁰⁷Heidelberg Academy of Sciences and Humanities, Heidelberg, Germany

⁷⁰⁸Department of Clinical Pathology, University of Melbourne, Melbourne, VIC, Australia

⁷⁰⁹Department of Pathology, Roswell Park Cancer Institute, Buffalo, NY USA

⁷¹⁰Department of Computer Science, University of Helsinki, Helsinki, Finland

⁷¹¹Institute of Biotechnology, University of Helsinki, Helsinki, Finland

⁷¹²Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland

⁷¹³Department of Obstetrics and Gynecology, Division of Gynecologic Oncology, Washington University School of Medicine, St. Louis, MO USA

⁷¹⁴Penrose St. Francis Health Services, Colorado Springs, CO USA

⁷¹⁵Institute of Pathology, Ulm University and University Hospital of Ulm, Ulm, Germany

⁷¹⁶National Cancer Center, Tokyo, Japan

⁷¹⁷Genome Institute of Singapore, Singapore, Singapore

⁷¹⁸32Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT USA

⁷¹⁹German Cancer Aid, Bonn, Germany

⁷²⁰Programme in Cancer and Stem Cell Biology, Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore

⁷²¹The Chinese University of Hong Kong, Shatin, NT, Hong Kong China

⁷²²Fourth Military Medical University, Shaanxi, China

⁷²³The University of Cambridge School of Clinical Medicine, Cambridge, UK

⁷²⁴St. Jude Children’s Research Hospital, Memphis, TN USA

⁷²⁵University Health Network, Princess Margaret Cancer Centre, Toronto, ON Canada

⁷²⁶Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, CA USA

⁷²⁷Department of Medicine, University of Chicago, Chicago, IL USA

⁷²⁸Department of Neurology, Mayo Clinic, Rochester, MN USA

⁷²⁹Cambridge Oesophagogastric Centre, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK

⁷³⁰Department of Computer Science, Carleton College, Northfield, MN USA

⁷³¹Institute of Cancer Sciences, College of Medical Veterinary and Life Sciences, University of Glasgow, Glasgow, UK

⁷³²Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AL USA

⁷³³HudsonAlpha Institute for Biotechnology, Huntsville, AL USA

⁷³⁴O’Neal Comprehensive Cancer Center, University of Alabama at Birmingham, Birmingham, AL USA

⁷³⁵Department of Pathology, Keio University School of Medicine, Tokyo, Japan

⁷³⁶Department of Hepatobiliary and Pancreatic Oncology, National Cancer Center Hospital, Tokyo, Japan

⁷³⁷Sage Bionetworks, Seattle, WA USA

⁷³⁸Lymphoma Genomic Translational Research Laboratory, National Cancer Centre, Singapore, Singapore

⁷³⁹Department of Clinical Pathology, Robert-Bosch-Hospital, Stuttgart, Germany

⁷⁴⁰Department of Cell and Systems Biology, University of Toronto, Toronto, ON Canada

⁷⁴¹Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden

⁷⁴²Center for Liver Cancer, Research Institute and Hospital, National Cancer Center, Gyeonggi, South Korea

⁷⁴³Division of Hematology-Oncology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea

⁷⁴⁴Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University School of Medicine, Seoul, South Korea

⁷⁴⁵Cheonan Industry-Academic Collaboration Foundation, Sangmyung University, Cheonan, South Korea

⁷⁴⁶NYU Langone Medical Center, New York, NY USA

⁷⁴⁷Department of Hematology and Medical Oncology, Cleveland Clinic, Cleveland, OH USA

⁷⁴⁸Department of Radiation Oncology, University of California San Francisco, San Francisco, CA USA

⁷⁴⁹Department of Health Sciences Research, Mayo Clinic, Rochester, MN USA

⁷⁵⁰Helen F. Graham Cancer Center at Christiana Care Health Systems, Newark, DE USA

⁷⁵¹Heidelberg University Hospital, Heidelberg, Germany

⁷⁵²CSRA Incorporated, Fairfax, VA USA

⁷⁵³Research Department of Pathology, University College London Cancer Institute, London, UK

⁷⁵⁴Department of Research Oncology, Guy’s Hospital, King’s Health Partners AHSC, King’s College London School of Medicine, London, UK

⁷⁵⁵Faculty of Medicine and Health Sciences, Macquarie University, Sydney, NSW Australia

⁷⁵⁶University Hospital of Minjoz, INSERM UMR 1098, Besançon, France

⁷⁵⁷Spanish National Cancer Research Centre, Madrid, Spain

⁷⁵⁸Center of Digestive Diseases and Liver Transplantation, Fundeni Clinical Institute, Bucharest, Romania

⁷⁵⁹Cureline, Inc, South San Francisco, CA USA

⁷⁶⁰St. Luke’s Cancer Centre, Royal Surrey County Hospital NHS Foundation Trust, Guildford, UK

⁷⁶¹Cambridge Breast Unit, Addenbrooke’s Hospital, Cambridge University Hospital NHS Foundation Trust and NIHR Cambridge Biomedical Research Centre, Cambridge, UK

⁷⁶²East of Scotland Breast Service, Ninewells Hospital, Aberdeen, UK

⁷⁶³Department of Genetics, Microbiology and Statistics, University of Barcelona, IRSJD, IBUB, Barcelona, Spain

⁷⁶⁴Department of Obstetrics and Gynecology, Medical College of Wisconsin, Milwaukee, WI USA

⁷⁶⁵Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA USA

⁷⁶⁶Department of Computer Science, Princeton University, Princeton, NJ USA

⁷⁶⁷Vanderbilt Ingram Cancer Center, Vanderbilt University, Nashville, TN USA

⁷⁶⁸Ohio State University College of Medicine and Arthur G. James Comprehensive Cancer Center, Columbus, OH USA

⁷⁶⁹Department of Surgery, Yokohama City University Graduate School of Medicine, Kanagawa, Japan

⁷⁷⁰Division of Chromatin Networks, German Cancer Research Center (DKFZ) and BioQuant, Heidelberg, Germany

⁷⁷¹Research Computing Center, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁷⁷²School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, Pullman, WA USA

⁷⁷³Finsen Laboratory and Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Copenhagen, Denmark

⁷⁷⁴Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON Canada

⁷⁷⁵Department of Pathology, Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY USA

⁷⁷⁶University Hospital Giessen, Pediatric Hematology and Oncology, Giessen, Germany

⁷⁷⁷Oncologie Sénologie, ICM Institut Régional du Cancer, Montpellier, France

⁷⁷⁸Institute of Clinical Molecular Biology, Christian-Albrechts-University, Kiel, Germany

⁷⁷⁹Institute of Pathology, University of Wuerzburg, Wuerzburg, Germany

⁷⁸⁰Department of Urology, North Bristol NHS Trust, Bristol, UK

⁷⁸¹SingHealth, Duke-NUS Institute of Precision Medicine, National Heart Centre Singapore, Singapore, Singapore

⁷⁸²Department of Computer Science, University of Toronto, Toronto, ON Canada

⁷⁸³Bern Center for Precision Medicine, University Hospital of Bern, University of Bern, Bern, Switzerland

⁷⁸⁴Englander Institute for Precision Medicine, Weill Cornell Medicine and New York Presbyterian Hospital, New York, NY USA

⁷⁸⁵Meyer Cancer Center, Weill Cornell Medicine, New York, NY USA

⁷⁸⁶Pathology and Laboratory, Weill Cornell Medical College, New York, NY USA

⁷⁸⁷Vall d’Hebron Institute of Oncology: VHIO, Barcelona, Spain

⁷⁸⁸General and Hepatobiliary-Biliary Surgery, Pancreas Institute, University and Hospital Trust of Verona, Verona, Italy

⁷⁸⁹National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India

⁷⁹⁰Indiana University, Bloomington, IN USA

⁷⁹¹Department of Pathology, GZA-ZNA Hospitals, Antwerp, Belgium

⁷⁹²Analytical Biological Services, Inc, Wilmington, DE USA

⁷⁹³Sydney Medical School, University of Sydney, Sydney, NSW Australia

⁷⁹⁴cBio Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA USA

⁷⁹⁵Department of Cell Biology, Harvard Medical School, Boston, MA USA

⁷⁹⁶Advanced Centre for Treatment Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai, Maharashtra India

⁷⁹⁷School of Environmental and Life Sciences, Faculty of Science, The University of Newcastle, Ourimbah, NSW Australia

⁷⁹⁸Department of Dermatology, University Hospital of Essen, Essen, Germany

⁷⁹⁹Bioinformatics and Omics Data Analytics, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁸⁰⁰Department of Urology, Charité Universitätsmedizin Berlin, Berlin, Germany

⁸⁰¹Martini-Clinic, Prostate Cancer Center, University Medical Center Hamburg-Eppendorf, Hamburg, Germany

⁸⁰²Department of General Internal Medicine, University of Kiel, Kiel, Germany

⁸⁰³German Cancer Consortium (DKTK), Partner site Berlin, Berlin, Germany

⁸⁰⁴Cancer Research Institute, Beth Israel Deaconess Medical Center, Boston, MA USA

⁸⁰⁵University of Pittsburgh, Pittsburgh, PA USA

⁸⁰⁶Department of Ophthalmology and Ocular Genomics Institute, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA USA

⁸⁰⁷Center for Psychiatric Genetics, NorthShore University HealthSystem, Evanston, IL USA

⁸⁰⁸Van Andel Research Institute, Grand Rapids, MI USA

⁸⁰⁹Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, Japan

⁸¹⁰Japan Agency for Medical Research and Development, Tokyo, Japan

⁸¹¹Korea University, Seoul, South Korea

⁸¹²Murtha Cancer Center, Walter Reed National Military Medical Center, Bethesda, MD USA

⁸¹³Human Genetics, University of Kiel, Kiel, Germany

⁸¹⁴Department of Oncologic Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA USA

⁸¹⁵Oregon Health and Science University, Portland, OR USA

⁸¹⁶Center for RNA Interference and Noncoding RNA, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁸¹⁷Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁸¹⁸Department of Gynecologic Oncology and Reproductive Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁸¹⁹University Hospitals Coventry and Warwickshire NHS Trust, Coventry, UK

⁸²⁰Department of Radiation Oncology, Radboud University Nijmegen Medical Centre, Nijmegen, GA The Netherlands

⁸²¹Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL USA

⁸²²Clinic for Hematology and Oncology, St.-Antonius-Hospital, Eschweiler, Germany

⁸²³Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY USA

⁸²⁴University of Iceland, Reykjavik, Iceland

⁸²⁵Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁸²⁶Dundee Cancer Centre, Ninewells Hospital, Dundee, UK

⁸²⁷Department for Internal Medicine III, University of Ulm and University Hospital of Ulm, Ulm, Germany

⁸²⁸Institut Curie, INSERM Unit 830, Paris, France

⁸²⁹Department of Gastroenterology and Hepatology, Yokohama City University Graduate School of Medicine, Kanagawa, Japan

⁸³⁰Department of Laboratory Medicine, Radboud University Nijmegen Medical Centre, Nijmegen, GA The Netherlands

⁸³¹Division of Cancer Genome Research, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁸³²Department of General Surgery, Singapore General Hospital, Singapore, Singapore

⁸³³Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore

⁸³⁴Department of Medical and Clinical Genetics, Genome-Scale Biology Research Program, University of Helsinki, Helsinki, Finland

⁸³⁵East Anglian Medical Genetics Service, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK

⁸³⁶Irving Institute for Cancer Dynamics, Columbia University, New York, NY USA

⁸³⁷Institute of Molecular and Cell Biology, Singapore, Singapore

⁸³⁸Laboratory of Cancer Epigenome, Division of Medical Science, National Cancer Centre Singapore, Singapore, Singapore

⁸³⁹Universite Lyon, INCa-Synergie, Centre Léon Bérard, Lyon, France

⁸⁴⁰Department of Urology, Mayo Clinic, Rochester, MN USA

⁸⁴¹Royal National Orthopaedic Hospital - Stanmore, Stanmore, Middlesex UK

⁸⁴²Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, Spain

⁸⁴³Giovanni Paolo II / I.R.C.C.S. Cancer Institute, Bari, BA Italy

⁸⁴⁴Neuroblastoma Genomics, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁸⁴⁵Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy, Rome, Italy

⁸⁴⁶University of Verona, Verona, Italy

⁸⁴⁷Centre National de Génotypage, CEA - Institute de Génomique, Evry, France

⁸⁴⁸CAPHRI Research School, Maastricht University, Maastricht, ER The Netherlands

⁸⁴⁹Department of Biopathology, Centre Léon Bérard, Lyon, France

⁸⁵⁰Université Claude Bernard Lyon 1, Villeurbanne, France

⁸⁵¹Core Research for Evolutional Science and Technology (CREST), JST, Tokyo, Japan

⁸⁵²Department of Biological Sciences, Laboratory for Medical Science Mathematics, Graduate School of Science, University of Tokyo, Yokohama, Japan

⁸⁵³Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Tokyo, Japan

⁸⁵⁴Cancer Ageing and Somatic Mutation Programme, Wellcome Sanger Institute, Hinxton, UK

⁸⁵⁵University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK

⁸⁵⁶Centre for Cancer Research and Cell Biology, Queen’s University, Belfast, UK

⁸⁵⁷Breast Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁸⁵⁸Department of Surgery, Johns Hopkins University School of Medicine, Baltimore, MD USA

⁸⁵⁹Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institute, Stockholm, Sweden

⁸⁶⁰School of Cancer Sciences, Faculty of Medicine, University of Southampton, Southampton, UK

⁸⁶¹Department of Gene Technology, Tallinn University of Technology, Tallinn, Estonia

⁸⁶²Genetics and Genome Biology Program, SickKids Research Institute, The Hospital for Sick Children, Toronto, ON Canada

⁸⁶³Departments of Neurosurgery and Hematology and Medical Oncology, Winship Cancer Institute and School of Medicine, Emory University, Atlanta, GA USA

⁸⁶⁴Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Trondheim, Norway

⁸⁶⁵Argmix Consulting, North Vancouver, BC Canada

⁸⁶⁶Department of Information Technology, Ghent University, Interuniversitair Micro-Electronica Centrum (IMEC), Ghent, Belgium

⁸⁶⁷Nuffield Department of Surgical Sciences, John Radcliffe Hospital, University of Oxford, Oxford, UK

⁸⁶⁸Institute of Mathematics and Computer Science, University of Latvia, Riga, LV Latvia

⁸⁶⁹Discipline of Pathology, Sydney Medical School, University of Sydney, Sydney, NSW Australia

⁸⁷⁰Department of Applied Mathematics and Theoretical Physics, Centre for Mathematical Sciences, University of Cambridge, Cambridge, UK

⁸⁷¹Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY USA

⁸⁷²Department of Statistics, Columbia University, New York, NY USA

⁸⁷³Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden

⁸⁷⁴School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China

⁸⁷⁵Department of Histopathology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK

⁸⁷⁶Oxford NIHR Biomedical Research Centre, University of Oxford, Oxford, UK

⁸⁷⁷Georgia Regents University Cancer Center, Augusta, GA USA

⁸⁷⁸Wythenshawe Hospital, Manchester, UK

⁸⁷⁹Department of Genetics, Washington University School of Medicine, St.Louis, MO USA

⁸⁸⁰Department of Biological Oceanography, Leibniz Institute of Baltic Sea Research, Rostock, Germany

⁸⁸¹Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK

⁸⁸²Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX USA

⁸⁸³Thoracic Oncology Laboratory, Mayo Clinic, Rochester, MN USA

⁸⁸⁴Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH USA

⁸⁸⁵Department of Obstetrics and Gynecology, Division of Gynecologic Oncology, Mayo Clinic, Rochester, MN USA

⁸⁸⁶International Institute for Molecular Oncology, Poznań, Poland

⁸⁸⁷Poznan University of Medical Sciences, Poznań, Poland

⁸⁸⁸Genomics and Proteomics Core Facility High Throughput Sequencing Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany

⁸⁸⁹NCCS-VARI Translational Research Laboratory, National Cancer Centre Singapore, Singapore, Singapore

⁸⁹⁰Edison Family Center for Genome Sciences and Systems Biology, Washington University, St. Louis, MO USA

⁸⁹¹MRC-University of Glasgow Centre for Virus Research, Glasgow, UK

⁸⁹²Department of Medical Informatics and Clinical Epidemiology, Division of Bioinformatics and Computational Biology, OHSU Knight Cancer Institute, Oregon Health and Science University, Portland, OR USA

⁸⁹³School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China

⁸⁹⁴Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD USA

⁸⁹⁵Department of Cancer Genome Informatics, Graduate School of Medicine, Osaka University, Osaka, Japan

⁸⁹⁶Institute of Computer Science, Heidelberg University, Heidelberg, Germany

⁸⁹⁷School of Mathematics and Statistics, University of Sydney, Sydney, NSW Australia

⁸⁹⁸Ben May Department for Cancer Research, University of Chicago, Chicago, IL USA

⁸⁹⁹Department of Human Genetics, University of Chicago, Chicago, IL USA

⁹⁰⁰Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY USA

⁹⁰¹The First Affiliated Hospital, Xi’an Jiaotong University, Xi’an, China

⁹⁰²Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Shatin, NT, Hong Kong China

⁹⁰³Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX USA

⁹⁰⁴Duke-NUS Medical School, Singapore, Singapore

⁹⁰⁵Department of Surgery, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China

⁹⁰⁶School of Computing Science, University of Glasgow, Glasgow, UK

⁹⁰⁷Division of Orthopaedic Surgery, Oslo University Hospital, Oslo, Norway

⁹⁰⁸Eastern Clinical School, Monash University, Melbourne, VIC Australia

⁹⁰⁹Epworth HealthCare, Richmond, VIC Australia

⁹¹⁰Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA USA

⁹¹¹Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH USA

⁹¹²The Ohio State University Comprehensive Cancer Center (OSUCCC – James), Columbus, OH USA

⁹¹³The University of Texas School of Biomedical Informatics (SBMI) at Houston, Houston, TX USA

⁹¹⁴Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC USA

⁹¹⁵Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA

⁹¹⁶Faculty of Medicine and Health, University of Sydney, Sydney, NSW Australia

⁹¹⁷Department of Pathology, Erasmus Medical Center Rotterdam, Rotterdam, GD The Netherlands

⁹¹⁸Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, Amsterdam, CX The Netherlands

⁹¹⁹Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland

^✉

Corresponding author.

Contributed equally.

PMCID: PMC7025897 EMSID: EMS84967 PMID: 32025012

Abstract

A key mutational process in cancer is structural variation, in which rearrangements delete, amplify or reorder genomic segments that range in size from kilobases to whole chromosomes^1–7. Here we develop methods to group, classify and describe somatic structural variants, using data from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), which aggregated whole-genome sequencing data from 2,658 cancers across 38 tumour types⁸. Sixteen signatures of structural variation emerged. Deletions have a multimodal size distribution, assort unevenly across tumour types and patients, are enriched in late-replicating regions and correlate with inversions. Tandem duplications also have a multimodal size distribution, but are enriched in early-replicating regions—as are unbalanced translocations. Replication-based mechanisms of rearrangement generate varied chromosomal structures with low-level copy-number gains and frequent inverted rearrangements. One prominent structure consists of 2–7 templates copied from distinct regions of the genome strung together within one locus. Such cycles of templated insertions correlate with tandem duplications, and—in liver cancer—frequently activate the telomerase gene TERT. A wide variety of rearrangement processes are active in cancer, which generate complex configurations of the genome upon which selection can act.

Subject terms: Cancer genomics, Genomic instability

Whole-genome sequencing data from more than 2,500 cancers of 38 tumour types reveal 16 signatures that can be used to classify somatic structural variants, highlighting the diversity of genomic rearrangements in cancer.

Main

Mutations that arise in somatic cells are the driving force of cancer development. Structural variation—in which genomic rearrangement acts to amplify, delete or reorder chromosomal material at scales that range from single genes to entire chromosomes—is an especially important class of somatic mutation. Previous analyses of both cancer and germline genomes have enabled the description of several distinctive patterns of structural variants^1–7, and hypotheses about the underlying basis of several of these patterns have been proposed on the basis of their clustering, orientation and associated copy-number changes. Hypothesis-driven in vitro studies are now beginning to reveal some of the mechanistic processes that generate these structures^9–13, and generate further predictions that can be assessed in the genomic data. However, the landscape of structural variation in human cancer remains incompletely mapped and there are many complex structures that elude formal description.

The PCAWG Consortium aggregated whole-genome sequencing data from 2,658 cancers across 38 tumour types, generated by the ICGC and TCGA projects. These sequencing data were aligned to the human genome (reference build hs37d5) and analysed with standardized, high-accuracy pipelines to call somatic and germline variants of all classes⁸. Here, we analyse the patterns and signatures of structural variants across the PCAWG data. We propose a working classification scheme that encompasses known and newly identified classes of structural variants. We develop methods for annotating the observed structural variants in a given cancer genome, identifying a class of replication-based rearrangement processes that generate clusters of several structural variants. We explore the size, activity and genome-wide distribution of classifiable structural variant types across the cohort, using signature analysis to define how they correlate within patients. Other papers produced by PCAWG address complementary aspects of structural variants, including inference of positive selection acting on recurrently rearranged regions of the genome¹⁴, how structural variants affect the transcriptome¹⁵ and chromosome topology¹⁶, patterns of somatic retrotransposition¹⁷ and distribution of chromothripsis across cancer types¹⁸.

Classification of structural variants

A ‘structural variant’ manifests as a ‘junction’ between two ‘breakpoints’ in the genome (terms in inverted commas here and below refer to those defined in the glossary in Extended Data Table 1). Generally, there will be a change in copy number across a given breakpoint if only one side of the break is rescued by a structural variant; if both sides of a double-stranded DNA break are rescued, a ‘reciprocal’ or ‘balanced’ structural variant will result, without substantial copy-number change. We sometimes observe ‘clusters of structural variants’ in which several breakpoints occur close together, in time or in genomic space—usually both. Such spatial and/or temporal proximity generally, but not always, implies that the structural variants within a cluster are mechanistically linked. Clusters can be ‘phased’ (in which case all structural variants in the cluster resolve to a single derivative chromosome) or ‘unphased’, in which case the structural variants are carried on different derivative chromosomes. An example of the latter is a reciprocal translocation that results in two derivative chromosomes, each with a single interchromosomal breakpoint junction (Fig. 1).

Extended Data Table 1.

Glossary of key terms

Open in a new tab

Fig. 1 — Schematics of major structural-variant (SV) classes, grouped according to whether they are simple or complex and arise through cut-and-paste or copy-and-paste processes. Each schematic comprises three parts. The top segment shows dotted arcs for each rearrangement junction that joins two chromosomal segments together. The middle segment shows the copy number of genomic segments that are involved. The bottom segment shows the configuration of the final derivative chromosome that results from the structural variant; the colour of the segments corresponds to the colour of that segment in the copy-number schematic. + indicates the different derivative chromosomes created for some of the classes: that is, the structural variants are not phased to a single derivative.

We recognize distinct ‘classes of structural variant’ from the orientation of the two segments at the junction and associated copy-number changes (Fig. 1, Supplementary Fig. 1). Some classes of structural variant (such as isochromosomes and rearrangements between extended, highly homologous sequences) are difficult to detect with short-read sequencing data; these classes are not considered further here. We propose categorizing classes of structural variant across two facets: the number of breakpoints involved (simple or complex) and by whether the patterns are likely to arise from ‘cut-and-paste’ or ‘copy-and-paste’ rearrangement processes. A cut-and-paste process generates a cluster of structural variants consistent with reshuffling or loss of extant genomic segments, and a copy-and-paste process is one in which copies of genomic ‘templates’ are newly replicated or synthesized and inserted during the rearrangement process. Deletions, reciprocal inversions, unbalanced translocations and reciprocal translocations are examples of simple cut-and-paste structural variants, as they can be reconstructed from the incorrect religation of chromosomal breaks. Tandem duplications are simple copy-and-paste structural variants, as they arise through the local insertion of a newly generated extra copy of a genomic template.

More-complex cut-and-paste processes that produce structural variants also occur in cancer. ‘Breakage–fusion–bridge’ events result from cycles of DNA breakage, end-to-end sister chromatid fusions, mitotic bridges and further DNA breakage. These events manifest as one or a few proximate, inverted breakpoint junctions with associated copy-number change, which we call ‘fold-back inversions’^1,2,19 (Fig. 1). ‘Chromoplexy’^5,20—which is particularly frequent in prostate cancers—results from several simultaneous double-stranded DNA breaks in several chromosomes that are rejoined incorrectly, leading to balanced chains of rearrangements. ‘Chromothripsis’³, in which chromosome shattering and rearrangement occur in a single catastrophic event^9,21, leads to a pattern of oscillating copy-number changes and localized clustering of tens to hundreds of breakpoints²².

In the germline, more-complex copy-and-paste classes of structural variant have previously been described, which involve small duplications and triplications and are thought to arise from the stalling of the replication fork leading to template switching^4,23,24. Here we describe a wide range of complex copy-and-paste types of somatic structural variant that occur in human cancers, and that are typically characterized by copy-number gains and frequent inverted rearrangements.

Annotation of structural-variant classes

We analysed 2,559 whole cancer genomes across 38 tumour types (alongside matched germline DNA) that passed the most stringent PCAWG quality-control criteria: 1 or more somatic structural variants were detected in 2,429 tumours⁸. As described in an accompanying Article⁸, structural variants were identified using aberrantly mapping and/or split reads in paired-end sequencing data²⁵. We used four somatic structural-variant callers^20,25–27, and the final structural-variant dataset comprised events that were returned by ≥2 callers, merged by a graph-based consensus method⁸. We consider only somatically acquired structural variants in this analysis, and exclude somatic retrotransposition events. Validation of structural-variant calls was undertaken using both manual inspection and pull-down with resequencing of breakpoints. With these approaches, we estimate the sensitivity of the consensus structural-variant call set to be 90% for true calls generated by any 1 of the 4 callers; specificity was estimated as 97.5%⁸. A mean of 3.22 algorithms of the 4 that we used called each structural variant in the consensus set genome-wide, and this differed little across repetitive elements: the mean for short interspersed nuclear elements was 3.22, and the mean for long interspersed nuclear elements was 3.21.

Because the structural variants from a given cancer are often highly clustered, we grouped rearrangements into clusters on the basis of the proximity of breakpoints, the overall number of events in that genome and the size distribution of these events (Supplementary Methods). Essentially, a particular cluster contains structural variants that are significantly closer together than expected by chance, given the overall number and orientation of structural variants in that patient. Alongside the clustering, we computed an in silico library of all possible genomic configurations that result from sequential simple structural variants (deletions, tandem duplications, inversions, translocations, and chromosome duplications or losses), to a depth of five rearrangements. We could then compare the genomic configuration of each observed cluster of structural variants against the library to determine how it might have arisen.

This methodology has the advantage that breakpoint junctions are classified according to the wider genomic context in which they occur. This means that, for example, true deletions will be identifiably different from breakpoint junctions that happen to have a deletion-type orientation but arise within (for instance) a chromothripsis event of markedly different mechanism and properties. Over half the breakpoint junctions that we observed arise within clusters of several or many structural variants (Fig. 2a): removing these junctions from the catalogues of true deletions, tandem duplications and inversions enables a more-precise description of the properties of simple structural variants.

Fig. 2 — a, Violin plots of density of classified structural-variant categories across patients within each histology group. Tumour type panels are sorted in descending order of the average number of structural-variant breakpoints per sample. Within each tumour type, the frequency distribution (y axis) of different structural-variant categories (x axis) across patients is shown as a density: regions of highest density have the greatest width of shaded area. In each panel, the number of patients is indicated at the top right. AdenoCA, adenocarcinoma; BNHL, B-cell non-Hodgkin lymphoma; ChRCC, chromophobe renal cell carcinoma; CLL, chronic lymphocytic leukaemia; CNS, central nervous system; GBM, glioblastoma; HCC, hepatocellular carcinoma; leiomyo, leiomyosarcoma; medullo, medulloblastoma; MPN, myeloproliferative neoplasm; eso, oesophageal; oligo, oligodendrocytic; panc, pancreatic; piloastro, pilocytic astrocytoma; prost, prostate; RCC, renal cell carcinoma; sarc, sarcoma; SCC, squamous cell carcinoma; TCC, transitional cell carcinoma; thy, thyroid. b, Per-sample counts of complex (bottom) and classified (top) structural-variant breakpoint junctions for oesophageal adenocarcinoma. c, Per-sample counts of complex (bottom) and classified (top) structural-variant breakpoint junctions for ovarian adenocarcinoma.

Among the classes of simple structural variants, deletion was the most common, followed by tandem duplication and then unbalanced translocation. Reciprocal translocations and reciprocal inversions were uncommon events (Fig. 2a). There was considerable variability in the overall numbers and distribution of classes of structural variant across tumour types and across patients within a given tumour type (Extended Data Fig. 1). For example, oesophageal adenocarcinomas were characterized by many deletions and a large number of complex clustered rearrangements (Fig. 2b), and ovarian cancers often carried high numbers of tandem duplications and/or deletions with moderate numbers of unbalanced translocations (Fig. 2c).

Extended Data Fig. 1 — Counts of simple, classified structural variants are shown above the x axis and counts of complex breakpoint junctions below the x axis. Patients within each tumour type are ranked by frequency of simple structural variants.

Cycles of templated insertions

We next examined clusters that contain 2–10 structural variants. One newly identified configuration consisted of several segments of copy-number gains, typically on different reference chromosomes, linked together through structural variants (Fig. 3, Extended Data Fig. 2). A sequential path through consecutive segments can be formed by following the breakpoint junctions, which suggests that each cluster represents a string of duplicated templates inserted into a single derivative chromosome, probably acquired concurrently. Although it is theoretically possible that the structural variants in such clusters are not phased on the same derivative chromosome or do not occur concurrently, we think this is unlikely for several reasons. First, we found examples of RNA transcripts that spliced together exons separated by two junctions in the structural-variant cluster (Supplementary Fig. 2), which suggests that they are phased on the same derivative chromosome. Second, long-read sequencing data (reported in an accompanying Article⁸) supported the phasing of structural variants that link templated insertions. Third, we found that the clonal fraction of tumour cells tended to be more similar for structural variants within these clusters than for randomly chosen structural variants in each patient (Supplementary Fig. 3), which suggests that they co-occur in evolutionary time. Fourth, the level of copy-number gain for individual segments in the cluster tended to be identical (Fig. 3, Extended Data Fig. 2).

Fig. 3 — a–c, Examples of a typical cycle (a), chain (b) and bridge (c) of templated insertions. The estimated copy-number profile is shown as in Fig. 1, with structural variants shown as dotted arcs linking two copy-number segments. The derivative chromosome(s) that could explain the copy-number and structural-variant profile is shown below. d, e, Cycles of templated insertions that affect the *TERT* gene, in two hepatocellular carcinomas. *KIAA1024* is also known as *MINAR1*.

Extended Data Fig. 2 — Schematics follow the same structure as in Fig. 3.

We define three basic categories on the basis of whether or not the string of inserted segments returns to the original chromosome: we term strings of inserted segments that do not return ‘chains’ of templated insertions and those strings that do return ‘bridges’ (which leave a gap on the host chromosome) or ‘cycles’ (which rereplicate a segment on the host chromosome). In the PCAWG dataset overall, we observed 1,467 cycles and 1,275 bridges of templated insertions (Fig. 3a, b, Extended Data Fig. 2). In chains of templated insertions, the string of genomic segments does not return to the chromosome of departure (Fig. 3c, Extended Data Fig. 2) but it is similarly associated with copy-number gains at each templated segment. There were 285 instances of such chains in the dataset, commonly manifesting as unbalanced translocations joined through one or more intermediary templated insertions.

Most templated insertion events involve only two breakpoint junctions, but this can extend to three, four or more linked rearrangements (Extended Data Fig. 3a). The longest such event—from a cervical squamous cell cancer—had seven templated insertions strung together on an eighth host chromosome (Fig. 3c; other examples of long templated insertion events are shown in Extended Data Fig. 3).

Extended Data Fig. 3 — a, Histogram of numbers of breakpoint junctions in templated insertion cycles, chains and bridges across all samples in all tumour types in the cohort. b, c, Two examples of particularly long cycles of templated insertions in the cohort. Examples are depicted in a similar manner to those in Fig. 3.

Templated insertions that affect TERT

Structural variants drive tumour development through their effects on cancer genes, whether by altering gene copy number, disrupting tumour-suppressor genes, creating fusion genes or juxtaposing the coding sequence of one gene with the regulatory apparatus of another. We found that many liver cancers had cycles of templated insertions that affect TERT (Fig. 3d, e, Extended Data Fig. 4). Point mutations in the TERT promoter are present in 54% of liver cancers, and a further 5–10% of liver cancers have structural variants that activate the gene²⁸. Of the 30 patients with liver cancer that had structural variants that affect TERT, we find that 10 of these variants were templated insertion events (mostly cycles). All of these events duplicated the entire TERT gene and linked it to duplications of whole genes, fragments of genes or regulatory elements from elsewhere in the genome, and led to increased expression of TERT (Extended Data Fig. 4e). Thus, this particular rearrangement process is distinctive for the precision with which cancer copy-and-pastes normally disparate functional elements of its genome together without wholesale instability.

Extended Data Fig. 4 — a, The positions of all structural-variant breakpoints in the *TERT* region in the PCAWG cohort (including 50-kb flanks either side of *TERT*), coloured by classification and vertically spaced by the distance to the next breakpoint in the cohort. If the two sides of a breakpoint junction are contained within the plotting window, they are joined by a curved line. The number of samples with a breakpoint in the plotting window is annotated in the table in the top left. b–d, Examples of two cycles and a chain of templated insertions that affect *TERT* in hepatocellular carcinomas. e, Expression levels of *TERT* in patients with hepatocellular carcinoma (n = 187 patients), separated by whether *TERT* was wild type, had an activating promoter point mutation, structural variants in a templated insertion or other class. Individual patient data are shown as points. The box shows the median expression level as a thick black line, with the range of the box denoting the interquartile range. The whiskers show the range of data or 1.5× the interquartile range (whichever is lower).

Tumour-suppressor genes were also inactivated by templated insertions (Extended Data Fig. 5). For example, among many straightforward deletions, RB1 was hit by cycles of templated insertions, a templated insertion with deletion and one instance of the linked, inverted duplications detailed in ‘Local n-jumps and local–distant clusters’. These events typically generated duplications of internal exons in RB1 and/or insertions of exons from other genes, all of which presumably rendered a non-functional transcript.

Local n-jumps and local–distant clusters

Many clusters of 2–10 structural variants in the dataset were confined to a single genomic region. Of those clusters that comprised two local rearrangements, some had straightforward explanations, such as nested or adjacent tandem duplications. However, many did not have a trivial explanation (Fig. 4a). These included a duplication–inverted-triplication–duplication structure that has previously been observed in germline structural variants²⁴ (349 instances); a structure of two duplications linked by inverted rearrangements (531 instances); and structures of copy-number loss plus nearby duplication linked by inverted rearrangements (472 instances). All of these patterns had solutions in which breakpoints were phased to a single derivative chromosome (Fig. 4a), although non-phased solutions are theoretically possible (if unlikely). Beyond clusters of two rearrangements (two-jumps), we also found examples involving three, four or more rearrangements confined to one genomic locale (Fig. 4b). All of these configurations of clusters of structural variants can be phased to a single derivative chromosome, with tightly grouped breakpoints.

Fig. 4 — a, Structures created by two local rearrangements that cannot easily be explained by simple structural-variant classes (which we call local 2-jumps). The estimated copy-number profile is shown as in Fig. 1, with structural variants shown as dotted arcs linking two copy-number segments. Possible configurations of the derivative chromosome are shown below; multiple solutions are possible for each example. Dup, duplication; invDup, duplication linked by inverted rearrangement; trp, triplication. b, Structures created by 3–4 local rearrangements that cannot easily be explained by simple structural-variant categories. c, Structures created by one local rearrangement and one rearrangement that reaches elsewhere in the genome (local–distant clusters).

Beyond clusters confined to a single genomic region, we found clusters of 2–10 structural variants that combined local jumps with rearrangements that reach into one or more distant regions of the genome (Fig. 4c). Simple examples of these events include unbalanced translocations or large deletions with a locally derived fragment inserted at the breakpoint, but there was also an extensive range of more-complex patterns. In some cases, the source of the inserted fragment was distal to the major break, and the structural variant could feasibly result from several concurrent DNA breaks in close spatial proximity to the capture of a short DNA fragment during repair (cut-and-paste). In other cases, the origin of the inserted fragment was proximal to the major break and associated with a gain in copy number. This pattern is difficult to explain by a cut-and-paste mechanism, because the copy-number gain implies the inserted segment was a duplicate of the original template rather than a separated fragment redistributed from its original locus. Instead, a copy-and-paste mechanism may be the more parsimonious explanation for these events.

A comparison of local footprints linked together through distant rearrangements revealed a strong connectivity of footprints with the same or similar structure, often enriched tenfold or more than expected by chance (see ‘Footprint connectivity analysis’ in Supplementary Results). The reasons for this are unclear, but it may reflect innate structural symmetry introduced through the generation or the resolution of rearrangements, or through the repeated action of a mechanism that imparts consistent structural motifs.

Copy-and-paste patterns of clusters

The diverse patterns of 2–10 clustered structural variants (Figs. 3, 4) share important morphological features: (1) genomic configurations that can be phased to a single derivative chromosome; (2) low-level gains in copy number, especially duplications and triplications; (3) a high frequency of inverted rearrangements in addition to noninverted rearrangements; (4) occurrence on a chromosome background with similar average copy number to the tumour overall; and (5) tight proximity of breakpoints within the local footprint (typically <1 Mb).

Using our in silico library of genomic configurations, we could define all possible routes by which sequential structural variants could generate these structures through the classically defined repertoire of deletion, tandem duplication, inversion and translocation (Supplementary Fig. 4). These routes typically would require implausible machinations of chromosomes (Supplementary Results). In particular, the high prevalence of inverted breakpoint junctions and local copy-number gains is difficult to recreate using sequential simple rearrangements. Simple inversion events are uncommon in cancers (Fig. 1d) and they tend not to generate copy-number gains, except through breakage–fusion–bridge cycles: these latter also cause terminal deletions², which are not seen in the events discussed here.

If these events cannot be satisfactorily explained by sequential simple rearrangements, another possible explanation is a complex cut-and-paste mechanism such as chromothripsis, chromoplexy or repeated breakage–fusion–bridge cycles. However, the patterns of the 2–10 clustered structural variants do not fit with these processes either (Supplementary Results). Although chromothripsis with copy-number gain has previously been described^3,11,19,22, the resulting copy number and rearrangement patterns have different properties to those we observed. Chromoplexy, in which chromosome breaks lead to a balanced interchange at multiple breakpoint junctions^5,20, typically generates unphased solutions. Repeated breakage–fusion–bridge cycles tend to cause high-level copy-number gains associated with inverted, fold-back rearrangements^1,2, unlike the structures reported here.

Instead, we believe that many of these locally complex clusters of structural variants with low-level copy-number gains are generated in a single event by a copy-and-paste process. That is, the copying of genomic templates is an intrinsic aspect of the structural variation process in these events, with the extra copies being inserted in the resulting derivative chromosome. If the genomic templates all originate locally, we would observe local n-jumps (such as in Fig. 3a, b) with a tight clustering of breakpoints, phased solutions, frequent copy-number gains and a mix of inverted and noninverted breakpoint junctions. If the original templates for the copied segments derive from across the genome, chains, cycles and bridges of templated insertions would arise (Fig. 2).

Genomic properties of structural variants

The size of tandem duplications and deletions followed complex—often multimodal—distributions across tumour types (Fig. 5a, Extended Data Fig. 6a). However, as previously reported^6,29, individual patients tend to have a simpler—usually unimodal—distribution of deletions or tandem duplications (Extended Data Fig. 6b), which implies that the complexity seen in a given tumour type results from combining samples with different profiles. The sizes of individual fragments in templated insertion events were also distinctly multimodal, with varying peak heights across tumour types (Fig. 5b). When correlating template sizes within a given event, two patterns emerged: one in which template sizes were closely correlated with one another, and one in which a small (<1 kb) template was linked with one of any size (Extended Data Fig. 7a, b). Likewise, the sizes of segments within a given local two-jump event showed moderately strong correlations with one another (Extended Data Fig. 7c).

Extended Data Fig. 7 — a, Comparison of the minimum and maximum templated-insert size for multi-insert cycles, chains and bridges of templated insertions. b, All events with three or more templated inserts, grouped by combination of insert sizes. c, Correlations (Pearson’s correlation coefficient) and raw sizes of individual genomic segments for reciprocal inversions and local two-jumps. Each individual event is shown as a line that links the size of the individual segments in that event. The sample sizes for each event class are shown in the labels for each panel.

A number of genomic properties (such as replication timing, transcriptional activity and chromatin state) influence the density of point mutations^30,31 and copy-number alterations³², but how this relates to individual classes of structural variant is unclear. From the literature, we compiled a library of the genome-wide distribution of 38 features including replication timing, GC content, repeat density, gene density and distance to G-quadruplex motifs, among others. Replication timing had the strongest association with the occurrence of structural variants; deletions are enriched in late-replicating regions, and tandem duplications and unbalanced translocations occur preferentially in early-replicating regions (Fig. 5c, Extended Data Fig. 8). For individual patients with high numbers of deletions or tandem duplications, we observed notable heterogeneity in the distribution of these structural variants according to replication timing: some had events that occurred predominantly in late-replicating regions, others had events that occurred exclusively in early-replicating regions, and in others events were distributed more evenly (Supplementary Fig. 5). Regions of active chromatin and increased gene density correlated positively with the rate of rearrangement.

Extended Data Fig. 8 — Associations between a subset of the genomic properties (rows) and classes of structural variant (columns). Each density curve represents the quantile distribution of the genomic property values at observed breakpoints, compared to random genome positions. Asterisks indicate significant departures from uniform quantiles after multiple hypothesis correction by the Benjamini–Yekutieli method on a one-sided Kolmogorov–Smirnov test, based on a sample size of 2,559 genomes containing structural variants: *false-discovery rate < 0.01, **false-discovery rate < 0.001, ***false-discovery rate < 10⁻⁶. Cells with significant property associations are shaded by the magnitude of the shift of the median observed quantile above (blue) or below (red) 0.5. The interpretation of each property from left to right is indicated by the axes to the right of the property label.

A structural variant requires DNA repair pathways to join two sequences together, and several repair mechanisms are available to somatic cells. Some require sequence homology between the two ends, and others can operate to join non-homologous sequences. As previously reported^2,25,33, we find across the PCAWG data that many structural variants do not have sequence homology at the breakpoint junction (Fig. 5d) and therefore arise through non-homologous end joining. Nonetheless, a sizable fraction of structural variants has more microhomology than expected by chance, with an apparently bimodal distribution of microhomology lengths. One set of structural variants has 2–7 bp of microhomology, probably generated by microhomology-mediated end joining, and a second set of structural variants has 10–30 bp of microhomology, probably generated through single-strand annealing or other forms of homologous recombination (including microhomology-mediated break-induced replication). Repetitive sequences in the genome, such as short and long interspersed nuclear elements, are the likely substrate of such structural variants, and we find enrichment for structural variants joining such elements (Fig. 5e, Supplementary Fig. 6).

Signatures of structural variation

The heterogeneous spectrum of point mutations across cancers can be reconstructed from the differential action of a relatively limited repertoire of mutational processes, each with a characteristic signature³⁴. The differences across patients in the size distribution of tandem duplication and deletion—together with the widely varying frequency and patterns of structural variant across tumour types and genome topology—suggested that we could similarly learn such correlations across individual classes of structural variant.

We divided the set of structural variants of each patient into mutually exclusive categories. We split the most frequent classes of simple structural variant (deletions and tandem duplications) into 11 categories according to size, replication timing and occurrence at fragile sites. Other configurations of structural variants and copy-number changes seen more than 50 times in the cohort were included as further categories, including cycles, chains and bridges of templated insertions (also split by size), local n-jumps and local–distant clusters.

We applied two methods for signature discovery, which yielded comparable results. We identified 16 structural-variant signatures: the 12 most prevalent of these signatures are shown in Fig. 6a. Signature extraction on the cohort randomly split into two halves identified ten highly correlated signatures (Supplementary Fig. 7), which closely matched the signatures called in the full cohort despite the lower power. Three signatures of deletions emerged, split by size: the signature of small (<50-kb) deletions included small reciprocal inversions and the signature of large (>500-kb) deletions included large reciprocal inversions. This implies that the frequencies of deletions and reciprocal inversions are correlated across the cohort, and both follow similar size distributions within an individual patient.

Fig. 6 — a, The 12 most distinctive structural-variant signatures extracted by the Bayesian hierarchical Dirichlet process algorithm, run on a sample size of 2,559 genomes containing structural variants. Here the lengths of the bars represent the estimated proportion of each event class assigned to each signature (rows sum to one); the black line segments represent the 95% posterior interval for bar length from the Markov chain. FB, fold-back; mid, mid-sized. b, Association of pathogenic mutations (germline and somatic combined) in key DNA repair genes with structural-variant signatures. The sample size of patients who have pathogenic variants in the specific genes assessed is shown in brackets after each gene label (y axis). Hypothesis tests and effect sizes for each gene are derived from linear models for signature intensity after correction for histology. Significant associations from two-sided tests with correction for multiple hypothesis testing are shown. The colour and size of the points represent the estimated effect sizes. MSH refers to *MSH2*, *MSH3*, *MSH4* and *MSH6*, genes in the mismatch repair pathway; FANC refers to genes associated with Fanconi anaemia, namely *FANCA*, *FANCC*, *FANCD2*, *FANCE*, *FANCF*, *FANCG*, *FANCI*, *FANCL* and *FANCM*.

We identified five signatures of tandem duplications, split by size and replication timing. Cycles, bridges and chains of templated insertions were particularly prominent in signatures of early-replicating tandem duplications, whereas local two-jump structures were more closely associated with late-replicating tandem duplications. All of these patterns exemplify the copy-and-paste concept, in which extra copies of genomic templates are produced and inserted as an integral feature of the structural-variant process.

Another signature was characterized by deletions and tandem duplications at chromosomal fragile sites³⁵. Tandem duplications were more prominent at the edges of the fragile site, and deletions were concentrated in the centre (Extended Data Fig. 9a, b). The size range of fragile site deletions peaked at around 100 kb, similar to the larger deletion signature, whereas the rarer fragile-site tandem duplications showed no strong size peak (Extended Data Fig. 9c). Sites of fragility varied extensively across tumour types (Extended Data Fig. 9d).

Extended Data Fig. 9 — a, Structural-variant breakpoints in the most affected fragile sites: *FHIT*, *MACROD2* and *WWOX*. These are coloured by classification and vertically spaced by the distance to the next breakpoint in the cohort. If the two sides of a breakpoint junction are contained within the plotting window, they are joined by a curved line. The number of samples with a breakpoint in the plotting window is annotated in the tables at the top left. b, Number of deletions and tandem duplications (top) and number of affected samples (bottom) for the 18 fragile sites considered in this analysis. c, Size distribution of deletions and tandem duplications in fragile sites (FS) compared to the rest of the genome. d, Fragile-site preference for 20 cancer histology groups as indicated by the proportion of samples that contains a deletion in each of the 18 fragile sites considered here. The number of samples is indicated in parentheses.

Unbalanced translocations comprised their own signature, which suggests that they derive from a distinct rearrangement process in cancer genomes. A further signature comprised both the fold-back inversions that are a hallmark of breakage–fusion–bridge cycles and similar structures such as translocations adjacent to fold-back inversions. Finally, there was a signature of balanced rearrangements, including reciprocal translocations and chromoplexy clusters⁵. This signature probably arises from several double-stranded DNA breaks (potentially occurring in interphase), in which both sides of the break are incorrectly repaired through ligation to other, simultaneously broken regions of the genome.

DNA repair genes and tumour type

We grouped annotations of pathogenic germline variants and somatic driver mutations in DNA-repair genes across the cohort⁸, correlating their presence with activity of the structural-variant signatures (Fig. 6b). As previously described for breast and ovarian cancers^6,29, BRCA1 mutations are significantly associated with small tandem duplication signatures, the mechanistic basis of which is increasingly well understood¹⁰. As previously described^6,36, CDK12 variants predicted signatures of mid-sized-to-large tandem duplications. BRCA2 variants correlated with small deletions, as expected from previous work²⁹, and also with the reciprocal structural-variant signature that includes chromoplexy. PALB2 variants showed the same correlations with signatures of small deletions and reciprocal structural variants as does BRCA2: PALB2 colocalizes with, stabilizes and assists BRCA2 during homologous recombination³⁷, so we might have predicted that inactivation of either gene would lead to a similar structural-variant signature. These associations between driver mutations and structural-variant signatures were consistently evident across many types of tumour (Extended Data Fig. 10).

Extended Data Fig. 10 — a, Box-and-whisker plots showing the number of structural variants attributed to the small-deletion signature in different types of tumour, split by *BRCA2* status (*BRCA2* wild type in orange; *BRCA2* mutant in cyan). The box denotes the interquartile range, with the median marked as a horizontal line. The whiskers extend as far as the range or 1.5× the interquartile range, whichever is lower. Outlier patients are shown as points. There is an increase in events attributed to the small-deletion signature when *BRCA2* is mutated, across multiple types of tumour (breast, pancreatic, ovarian, prostate, lung squamous and so on). b, Box-and-whisker plots as for a, showing the number of structural variants attributed to the small-deletion signature in different types of tumour, split by *PALB2* status. c, Box-and-whisker plots as for a, showing the number of structural variants attributed to the early-replicating, small-tandem-duplication signature in different types of tumour, split by *BRCA1* status. d, Box-and-whisker plots as for a, showing the number of structural variants attributed to the large-tandem-duplication signature in different types of tumour, split by *CDK12* status.

The structural-variant signatures showed considerable heterogeneity in their activity across tumour types and among patients within a given tumour type (Supplementary Fig. 8). Tumours of the gastrointestinal tract—including colorectal and oesophageal adenocarcinomas—showed high rates of the fragile-site signature. Prostate cancer was notable for the prevalence of the chromoplexy signature, as previously reported^5,20, and squamous cell carcinomas of the lung were characterized by the fold-back inversion signature.

We assessed how classes of structural variant altered known cancer genes (Supplementary Table 1). Some cancer genes acquire oncogenic potential only with specific structural events, such as fusion genes or enhancer hijacking. Not surprisingly, these genes typically showed little variability in which classes of structural variant could generate such events (Extended Data Fig. 11a–c)—although there were exceptions. The TMPRSS2-ERG fusion gene of prostate cancer, for example, was generated by a range of processes (including simple deletions, chromoplexy and chromothripsis), all of which are prevalent signatures in this tumour type (Extended Data Fig. 11d–f).

Extended Data Fig. 11 — a, Rainfall plot of structural-variant breakpoints in the genes *KIAA1549* and *BRAF*, commonly fused together through a tandem duplication in pilocytic astrocytomas. Structural variants are coloured by classification and arranged vertically by the distance to the next breakpoint in the cohort. If the two sides of a breakpoint junction are contained within the plotting window, they are joined by a curved line. The number of samples with a breakpoint in the plotting window is annotated in the table at the top of each panel. b, Rainfall plot of structural-variant breakpoints that affect *RET*, commonly fused to *CCDC6* by inversion in papillary thyroid cancer. c, Rainfall plot of structural-variant breakpoints that affect *BCL2*, commonly hijacked to the *IGH* immunoglobulin locus by translocations in B cell lymphomas. d, Rainfall plot of structural-variant breakpoints that affect *ERG*, commonly fused with *TMPRSS2* by deletion or more-complex events in prostate adenocarcinoma. e, Example of a *TMPRSS2-ERG* fusion gene in a prostate adenocarcinoma created by a chromoplexy cycle. The estimated copy-number profile is shown as black horizontal segments, with structural variants shown as dotted arcs linking the edges of two copy-number segments. f, Example of a *TMPRSS2-ERG* fusion gene in a prostate adenocarcinoma created by chromothripsis.

Tumour-suppressor genes and recurrently amplified genes showed more variability in which types of structural variant were observed, and these were shaped by signatures active in the relevant tumour types. For example, the tumour-suppressor genes, PTEN and RAD51B, which are commonly inactivated in breast and ovarian cancers, were often targeted by tandem duplications generating out-of-frame exon duplications (Extended Data Fig. 12a, b). By contrast, deletions were the predominant events that inactivated SMAD4 and CDKN2A, in keeping with their prevalence in cancers of the gastrointestinal tract (Extended Data Fig. 12c, d). MYC, one of the most commonly amplified genes across all types of cancer, showed considerable diversity in the mechanisms of its rearrangement: nested tandem duplications in breast cancer, translocations or chromoplexy with IGH in lymphoma, as well as chromothripsis, cycles of templated insertions, local n-jumps and local–distant clusters in other types of tumour (Extended Data Fig. 13).

Extended Data Fig. 12 — a, Rainfall plot of structural-variant breakpoints in the gene *PTEN*, commonly inactivated in breast and ovarian adenocarcinomas, in which tandem-duplication signatures are frequent. Structural variants are coloured by classification and arranged vertically by the distance to the next breakpoint in the cohort. If the two sides of a breakpoint junction are contained within the plotting window, they are joined by a curved line. The number of samples with a breakpoint in the plotting window is annotated in the table at the top of each panel. b, Rainfall plot of structural-variant breakpoints that affect *RAD51B*, commonly inactivated in breast and ovarian adenocarcinomas. c, Rainfall plot of structural-variant breakpoints that affect *CDKN2A*, commonly inactivated in tumours of the gastrointestinal tract, in which deletion signatures are common. d, Rainfall plot of structural-variant breakpoints that affect *SMAD4*, commonly inactivated in tumours of the gastrointestinal tract.

Extended Data Fig. 13 — The estimated copy-number profile is shown as black horizontal segments, with structural variants shown as dotted arcs linking the edges of two copy-number segments.

Discussion

We have described the patterns and signatures of structural variation in a large cohort of uniformly analysed cancer genomes. A major grouping of patterns in structural variants that emerges from our study is one in which extra copies of genomic templates are inserted during the rearrangement process. This includes simple events such as tandem duplications, as well as a range of more-complex events with duplications and triplications that are rearranged locally as well as inserted distantly. Our signature analysis grouped a large proportion of these more-complex events together with tandem duplications, which suggests that they represent a continuum of processes that share underlying properties. A replication-based mechanism has previously been proposed to explain local two-jumps^4,23,24, in which stalled replication forks or other DNA lesions cause the DNA polymerase to switch templates and continue replication in a new location. Studies in experimental models are now revealing that a wide range of mechanisms and DNA lesions can result in templated insertions: these mechanisms include tandem duplications in BRCA1 deficiency¹⁰, translocations with templated insertions caused by dysregulated strand invasion³⁸ and distant templated insertions in the absence of replication helicases³⁹.

Genomic instability in cancer is not a single phenomenon. Instead, many different mutational processes can act to restructure the genome and, in doing so, generate a notably flexible array of possible structures. Any given tumour draws on a subset of the available processes, shaped by the cell of origin, germline predisposition and other, unknown, factors: selection then does the rest, promoting the clone that has chanced on the structure that increases its potential for self-determination.

Methods

No statistical methods were used to predetermine sample size. The experiments were not randomized and investigators were not blinded to allocation during experiments and outcome assessment.

A detailed description of the methods used in this paper and many additional results are described in Supplementary Information. Here, we summarize the key aspects of the analysis.

Generation of the structural-variant call set

The final set of structural variants used in this Article was generated by the Technical Working Group of the PCAWG Consortium and is described in the main PCAWG paper⁸. In brief, four variant callers were used to identify somatically acquired structural variants from matched tumour and germline whole genome sequencing data: SvABA (Broad pipeline), DELLY (DKFZ pipeline), BRASS (Sanger pipeline) and dRanger (Broad pipeline). These were merged into a final call set using a graph-based algorithm to identify overlapping breakpoint junctions across algorithms. Detailed visual inspection of structural-variant calls suggested that a simple approach of accepting all structural-variant calls made by two or more of the four algorithms gave the best trade-off between sensitivity and specificity.

Structural-variant clustering and annotation

To identify clusters of structural variants, we developed a method for grouping structural variants into clusters and footprints to allow structural and mechanistic inferences to be made systematically. In parallel, we processed the somatic copy-number data and merged it with structural-variant junctions to enable us produce rearrangement patterns from the generated structural-variant clusters and footprints. We produced normalized representations of structural-variant cluster patterns, which enable us to tabulate the number of different cluster and footprint patterns and analyse their features. Finally, we performed manual and simulation-assisted interpretation of the recurrently observed cluster and footprint patterns. The individual steps of the structural-variant classification pipeline are outlined below and detailed in the subsequent subsections: (1) computing the exact breakpoint coordinates from clipped reads; (2) removing redundant ‘segment-bypassing’ structural variants; (3) merging rearrangement breakpoints with copy-number data to yield structural-variant breakpoint-demarcated, normalized, absolute copy-number data; (4) clustering individual structural variants into structural-variant clusters and footprints; (5) heuristically refining structural-variant clusters and footprints; (6) filtering artefactual fold-back-type structural variants with insufficient support; (7) determining balanced overlapping breakpoints (this step is to distinguish very short templated insertions from mutually overlapping balanced breakpoints); and (8) computing rearrangement patterns and categories.

Distribution of structural variants across the genome

We divided the hg19 human reference genome (autosomes and chromosome X) into 3,036,315 pixels of 1 kb, and calculated a suite of metrics per pixel to summarize a variety of genome properties with potential relevance to the distribution of rearrangements, as listed in the Supplementary Information. Properties were matched as closely as possible to the tissue of origin for cancer samples from the PCAWG data. All other genome properties were held fixed across all tissues. To test for associations between structural-variant event classes and the library of genome properties, the genome property metrics were compared between real structural-variant positions (randomly choosing one side of each breakpoint junction to reduce dependence between observations) and one million uniform random positions from the callable genome space. To compare the tissue-specific properties, each random position was assigned a random tissue type, drawing from the observed tissue-type distribution in the structural-variant call set. For each genome property and each event class, the real observations were pooled amongst the random ones, and then rank-transformed and normalized on a scale from 0 to 1. Under the null hypothesis of no event-versus-property association, the ranks of the real observations would follow a uniform distribution. We tested this in each case with a Kolmogorov–Smirnov test then applied a Benjamini–Yekutieli correction for false-discovery rate across the entire suite of tests and set the threshold for significance reporting at 0.01.

Structural-variant-signature analysis

We used two algorithms for extracting structural-variant signatures. Both used the same input files, comprising a matrix of counts per patient (across all patients) of structural-variant clusters falling into a number of mutually exclusive categories. These categories included the major classes of structural variants, with the more-common events (deletions, tandem duplications and inversions) split by size and/or replication timing. The two algorithms that were used for extracting the signatures were (1) a hierarchical Dirichlet process and (2) non-negative matrix factorization. Further details on the implementation of these algorithms are available in the Supplementary Information.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

Online content

Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-019-1913-9.

Supplementary information

Supplementary Information^{(28.7MB, pdf)}

This file contains Supplementary Figures 1-8, Supplementary Methods, Supplementary Results, References and a list of participants in ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium.

Reporting Summary^{(98.5KB, pdf)}

Supplementary Table^{(52.3KB, xlsx)}

Supplementary Table 1: Counts of patients with SVs in different classes affecting genes in the Cancer Gene Census.

Acknowledgements

This work was supported by the Wellcome Trust, Pediatric Low-Grade Astrocytoma Fund and the Fund for Innovation in Cancer Informatics. P.J.C. is a Wellcome Trust Senior Clinical Fellow (WT088340MA). We acknowledge the contributions of the many clinical networks across ICGC and TCGA, which provided samples and data to the PCAWG Consortium, and the contributions of the Technical Working Group and the Germline Working Group of the PCAWG Consortium for the collation, realignment and harmonized variant-calling of the cancer genomes used in this study. We thank the patients and their families for their participation in the individual ICGC and TCGA projects.

Extended data figures and tables

Author contributions

Y.L., N.D.R., J.A.W. and O.S. contributed equally to this manuscript, undertaking evaluation and curation of structural-variant calls, merging structural-variant call sets from four separate algorithms into a final dataset. Y.L. performed the clustering and classification of structural variants, and identified patterns of rearrangement, with assistance from N.D.R. and M.I. N.D.R. performed the analysis of structural-variant signatures with assistance from Y.L. N.D.R., J.A.W. and O.S. analysed the distribution of structural variants across the genome, with input from J.E.H., E.K., K.K. and S.E.S. S.W. and J.O.K. contributed to the analysis of how germline variants influenced signatures of structural variants. J.W., R.B. and P.J.C. jointly oversaw the project, assisted with data interpretation and wrote the paper, with input from all authors.

Data availability

Somatic and germline variant calls, mutational signatures, subclonal reconstructions, transcript abundance, splice calls and other core data generated by the ICGC/TCGA PCAWG Consortium are described in an accompanying Article⁸ and are available for download at https://dcc.icgc.org/releases/PCAWG. Additional information on accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/. In accordance with the data access policies of the ICGC and TCGA projects, most molecular, clinical and specimen data are in an open tier that does not require access approval. To access information that could potentially identify participants, such as germline alleles and the underlying sequencing data, researchers will need to apply to the TCGA data access committee via dbGaP (https://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the dataset, and to the ICGC data access compliance office (http://icgc.org/daco) for the ICGC portion of the dataset. In addition, to access somatic single-nucleotide variants derived from TCGA donors, researchers will also need to obtain dbGaP authorization.

Code availability

The core computational pipelines used by the PCAWG Consortium for alignment, quality control and variant calling are available to the public at https://dockstore.org/search?search=pcawg under the GNU General Public License v.3.0, which allows for reuse and distribution. These are described in detail in an accompanying Article⁸. The code for grouping structural variants into structural-variant clusters and footprints is available at https://github.com/cancerit/ClusterSV/ (version 1.0). The code for simulating rearrangements can be found at https://github.com/cancerit/SimSvGenomes (version 1.0). The code for sampling from the hierarchical Dirichlet process for identification of mutational signatures is implemented as an R package at https://github.com/nicolaroberts/hdp (version 0.1.1).

Competing interests

R.B. owns equity in Ampressa Therapeutics; M.M. is the scientific advisory board chair of—and consultant for— OrigiMed, and receives research funding from Bayer and Ono Pharma, and patent royalties from LabCorp.; J.W. is a consultant for Nference Inc.; C.-Z.Z. is a cofounder and equity holder of Pillar Biosciences, a for-profit company specializing in the development of targeted sequencing assays.

Footnotes

Peer review information Nature thanks Don Conrad, Ben Lehner and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Yilong Li, Nicola D. Roberts, Jeremiah A. Wala, Ofer Shapira

A list of members and their affiliations appears at the end of the paper

A list of members and their affiliations appears online

Change history

1/25/2023

A Correction to this paper has been published: 10.1038/s41586-022-05597-x

Contributor Information

Joachim Weischenfeldt, Email: joachim.weischenfeldt@bric.ku.dk.

Rameen Beroukhim, Email: rameen_beroukhim@dfci.harvard.edu.

Peter J. Campbell, Email: pc8@sanger.ac.uk

PCAWG Structural Variation Working Group:

Kadir C. Akdemir, Eva G. Alvarez, Adrian Baez-Ortega, Rameen Beroukhim, Paul C. Boutros, David D. L. Bowtell, Benedikt Brors, Kathleen H. Burns, Peter J. Campbell, Kin Chan, Ken Chen, Isidro Cortés-Ciriano, Ana Dueso-Barroso, Andrew J. Dunford, Paul A. Edwards, Xavier Estivill, Dariush Etemadmoghadam, Lars Feuerbach, J. Lynn Fink, Milana Frenkel-Morgenstern, Dale W. Garsed, Mark Gerstein, Dmitry A. Gordenin, David Haan, James E. Haber, Julian M. Hess, Barbara Hutter, Marcin Imielinski, David T. W. Jones, Young Seok Ju, Marat D. Kazanov, Leszek J. Klimczak, Youngil Koh, Jan O. Korbel, Kiran Kumar, Eunjung Alice Lee, Jake June-Koo Lee, Yilong Li, Andy G. Lynch, Geoff Macintyre, Florian Markowetz, Iñigo Martincorena, Alexander Martinez-Fundichely, Matthew Meyerson, Satoru Miyano, Hidewaki Nakagawa, Fabio C. P. Navarro, Stephan Ossowski, Peter J. Park, John V. Pearson, Montserrat Puiggròs, Karsten Rippe, Nicola D. Roberts, Steven A. Roberts, Bernardo Rodriguez-Martin, Steven E. Schumacher, Ralph Scully, Mark Shackleton, Nikos Sidiropoulos, Lina Sieverling, Chip Stewart, David Torrents, Jose M. C. Tubio, Izar Villasante, Nicola Waddell, Jeremiah A. Wala, Joachim Weischenfeldt, Lixing Yang, Xiaotong Yao, Sung-Soo Yoon, Jorge Zamora, and Cheng-Zhong Zhang

PCAWG Consortium:

Lauri A. Aaltonen, Federico Abascal, Adam Abeshouse, Hiroyuki Aburatani, David J. Adams, Nishant Agrawal, Keun Soo Ahn, Sung-Min Ahn, Hiroshi Aikata, Rehan Akbani, Kadir C. Akdemir, Hikmat Al-Ahmadie, Sultan T. Al-Sedairy, Fatima Al-Shahrour, Malik Alawi, Monique Albert, Kenneth Aldape, Ludmil B. Alexandrov, Adrian Ally, Kathryn Alsop, Eva G. Alvarez, Fernanda Amary, Samirkumar B. Amin, Brice Aminou, Ole Ammerpohl, Matthew J. Anderson, Yeng Ang, Davide Antonello, Pavana Anur, Samuel Aparicio, Elizabeth L. Appelbaum, Yasuhito Arai, Axel Aretz, Koji Arihiro, Shun-ichi Ariizumi, Joshua Armenia, Laurent Arnould, Sylvia Asa, Yassen Assenov, Gurnit Atwal, Sietse Aukema, J. Todd Auman, Miriam R. R. Aure, Philip Awadalla, Marta Aymerich, Gary D. Bader, Adrian Baez-Ortega, Matthew H. Bailey, Peter J. Bailey, Miruna Balasundaram, Saianand Balu, Pratiti Bandopadhayay, Rosamonde E. Banks, Stefano Barbi, Andrew P. Barbour, Jonathan Barenboim, Jill Barnholtz-Sloan, Hugh Barr, Elisabet Barrera, John Bartlett, Javier Bartolome, Claudio Bassi, Oliver F. Bathe, Daniel Baumhoer, Prashant Bavi, Stephen B. Baylin, Wojciech Bazant, Duncan Beardsmore, Timothy A. Beck, Sam Behjati, Andreas Behren, Beifang Niu, Cindy Bell, Sergi Beltran, Christopher Benz, Andrew Berchuck, Anke K. Bergmann, Erik N. Bergstrom, Benjamin P. Berman, Daniel M. Berney, Stephan H. Bernhart, Rameen Beroukhim, Mario Berrios, Samantha Bersani, Johanna Bertl, Miguel Betancourt, Vinayak Bhandari, Shriram G. Bhosle, Andrew V. Biankin, Matthias Bieg, Darell Bigner, Hans Binder, Ewan Birney, Michael Birrer, Nidhan K. Biswas, Bodil Bjerkehagen, Tom Bodenheimer, Lori Boice, Giada Bonizzato, Johann S. De Bono, Arnoud Boot, Moiz S. Bootwalla, Ake Borg, Arndt Borkhardt, Keith A. Boroevich, Ivan Borozan, Christoph Borst, Marcus Bosenberg, Mattia Bosio, Jacqueline Boultwood, Guillaume Bourque, Paul C. Boutros, G. Steven Bova, David T. Bowen, Reanne Bowlby, David D. L. Bowtell, Sandrine Boyault, Rich Boyce, Jeffrey Boyd, Alvis Brazma, Paul Brennan, Daniel S. Brewer, Arie B. Brinkman, Robert G. Bristow, Russell R. Broaddus, Jane E. Brock, Malcolm Brock, Annegien Broeks, Angela N. Brooks, Denise Brooks, Benedikt Brors, Søren Brunak, Timothy J. C. Bruxner, Alicia L. Bruzos, Alex Buchanan, Ivo Buchhalter, Christiane Buchholz, Susan Bullman, Hazel Burke, Birgit Burkhardt, Kathleen H. Burns, John Busanovich, Carlos D. Bustamante, Adam P. Butler, Atul J. Butte, Niall J. Byrne, Anne-Lise Børresen-Dale, Samantha J. Caesar-Johnson, Andy Cafferkey, Declan Cahill, Claudia Calabrese, Carlos Caldas, Fabien Calvo, Niedzica Camacho, Peter J. Campbell, Elias Campo, Cinzia Cantù, Shaolong Cao, Thomas E. Carey, Joana Carlevaro-Fita, Rebecca Carlsen, Ivana Cataldo, Mario Cazzola, Jonathan Cebon, Robert Cerfolio, Dianne E. Chadwick, Dimple Chakravarty, Don Chalmers, Calvin Wing Yiu Chan, Kin Chan, Michelle Chan-Seng-Yue, Vishal S. Chandan, David K. Chang, Stephen J. Chanock, Lorraine A. Chantrill, Aurélien Chateigner, Nilanjan Chatterjee, Kazuaki Chayama, Hsiao-Wei Chen, Jieming Chen, Ken Chen, Yiwen Chen, Zhaohong Chen, Andrew D. Cherniack, Jeremy Chien, Yoke-Eng Chiew, Suet-Feung Chin, Juok Cho, Sunghoon Cho, Jung Kyoon Choi, Wan Choi, Christine Chomienne, Zechen Chong, Su Pin Choo, Angela Chou, Angelika N. Christ, Elizabeth L. Christie, Eric Chuah, Carrie Cibulskis, Kristian Cibulskis, Sara Cingarlini, Peter Clapham, Alexander Claviez, Sean Cleary, Nicole Cloonan, Marek Cmero, Colin C. Collins, Ashton A. Connor, Susanna L. Cooke, Colin S. Cooper, Leslie Cope, Vincenzo Corbo, Matthew G. Cordes, Stephen M. Cordner, Isidro Cortés-Ciriano, Kyle Covington, Prue A. Cowin, Brian Craft, David Craft, Chad J. Creighton, Yupeng Cun, Erin Curley, Ioana Cutcutache, Karolina Czajka, Bogdan Czerniak, Rebecca A. Dagg, Ludmila Danilova, Maria Vittoria Davi, Natalie R. Davidson, Helen Davies, Ian J. Davis, Brandi N. Davis-Dusenbery, Kevin J. Dawson, Francisco M. De La Vega, Ricardo De Paoli-Iseppi, Timothy Defreitas, Angelo P. Dei Tos, Olivier Delaneau, John A. Demchok, Jonas Demeulemeester, German M. Demidov, Deniz Demircioğlu, Nening M. Dennis, Robert E. Denroche, Stefan C. Dentro, Nikita Desai, Vikram Deshpande, Amit G. Deshwar, Christine Desmedt, Jordi Deu-Pons, Noreen Dhalla, Neesha C. Dhani, Priyanka Dhingra, Rajiv Dhir, Anthony DiBiase, Klev Diamanti, Li Ding, Shuai Ding, Huy Q. Dinh, Luc Dirix, HarshaVardhan Doddapaneni, Nilgun Donmez, Michelle T. Dow, Ronny Drapkin, Oliver Drechsel, Ruben M. Drews, Serge Serge, Tim Dudderidge, Ana Dueso-Barroso, Andrew J. Dunford, Michael Dunn, Lewis Jonathan Dursi, Fraser R. Duthie, Ken Dutton-Regester, Jenna Eagles, Douglas F. Easton, Stuart Edmonds, Paul A. Edwards, Sandra E. Edwards, Rosalind A. Eeles, Anna Ehinger, Juergen Eils, Roland Eils, Adel El-Naggar, Matthew Eldridge, Kyle Ellrott, Serap Erkek, Georgia Escaramis, Shadrielle M. G. Espiritu, Xavier Estivill, Dariush Etemadmoghadam, Jorunn E. Eyfjord, Bishoy M. Faltas, Daiming Fan, Yu Fan, William C. Faquin, Claudiu Farcas, Matteo Fassan, Aquila Fatima, Francesco Favero, Nodirjon Fayzullaev, Ina Felau, Sian Fereday, Martin L. Ferguson, Vincent Ferretti, Lars Feuerbach, Matthew A. Field, J. Lynn Fink, Gaetano Finocchiaro, Cyril Fisher, Matthew W. Fittall, Anna Fitzgerald, Rebecca C. Fitzgerald, Adrienne M. Flanagan, Neil E. Fleshner, Paul Flicek, John A. Foekens, Kwun M. Fong, Nuno A. Fonseca, Christopher S. Foster, Natalie S. Fox, Michael Fraser, Scott Frazer, Milana Frenkel-Morgenstern, William Friedman, Joan Frigola, Catrina C. Fronick, Akihiro Fujimoto, Masashi Fujita, Masashi Fukayama, Lucinda A. Fulton, Robert S. Fulton, Mayuko Furuta, P. Andrew Futreal, Anja Füllgrabe, Stacey B. Gabriel, Steven Gallinger, Carlo Gambacorti-Passerini, Jianjiong Gao, Shengjie Gao, Levi Garraway, Øystein Garred, Erik Garrison, Dale W. Garsed, Nils Gehlenborg, Josep L. L. Gelpi, Joshy George, Daniela S. Gerhard, Clarissa Gerhauser, Jeffrey E. Gershenwald, Mark Gerstein, Moritz Gerstung, Gad Getz, Mohammed Ghori, Ronald Ghossein, Nasra H. Giama, Richard A. Gibbs, Bob Gibson, Anthony J. Gill, Pelvender Gill, Dilip D. Giri, Dominik Glodzik, Vincent J. Gnanapragasam, Maria Elisabeth Goebler, Mary J. Goldman, Carmen Gomez, Santiago Gonzalez, Abel Gonzalez-Perez, Dmitry A. Gordenin, James Gossage, Kunihito Gotoh, Ramaswamy Govindan, Dorthe Grabau, Janet S. Graham, Robert C. Grant, Anthony R. Green, Eric Green, Liliana Greger, Nicola Grehan, Sonia Grimaldi, Sean M. Grimmond, Robert L. Grossman, Adam Grundhoff, Gunes Gundem, Qianyun Guo, Manaswi Gupta, Shailja Gupta, Ivo G. Gut, Marta Gut, Jonathan Göke, Gavin Ha, Andrea Haake, David Haan, Siegfried Haas, Kerstin Haase, James E. Haber, Nina Habermann, Faraz Hach, Syed Haider, Natsuko Hama, Freddie C. Hamdy, Anne Hamilton, Mark P. Hamilton, Leng Han, George B. Hanna, Martin Hansmann, Nicholas J. Haradhvala, Olivier Harismendy, Ivon Harliwong, Arif O. Harmanci, Eoghan Harrington, Takanori Hasegawa, David Haussler, Steve Hawkins, Shinya Hayami, Shuto Hayashi, D. Neil Hayes, Stephen J. Hayes, Nicholas K. Hayward, Steven Hazell, Yao He, Allison P. Heath, Simon C. Heath, David Hedley, Apurva M. Hegde, David I. Heiman, Michael C. Heinold, Zachary Heins, Lawrence E. Heisler, Eva Hellstrom-Lindberg, Mohamed Helmy, Seong Gu Heo, Austin J. Hepperla, José María Heredia-Genestar, Carl Herrmann, Peter Hersey, Julian M. Hess, Holmfridur Hilmarsdottir, Jonathan Hinton, Satoshi Hirano, Nobuyoshi Hiraoka, Katherine A. Hoadley, Asger Hobolth, Ermin Hodzic, Jessica I. Hoell, Steve Hoffmann, Oliver Hofmann, Andrea Holbrook, Aliaksei Z. Holik, Michael A. Hollingsworth, Oliver Holmes, Robert A. Holt, Chen Hong, Eun Pyo Hong, Jongwhi H. Hong, Gerrit K. Hooijer, Henrik Hornshøj, Fumie Hosoda, Yong Hou, Volker Hovestadt, William Howat, Alan P. Hoyle, Ralph H. Hruban, Jianhong Hu, Taobo Hu, Xing Hua, Kuan-lin Huang, Mei Huang, Mi Ni Huang, Vincent Huang, Yi Huang, Wolfgang Huber, Thomas J. Hudson, Michael Hummel, Jillian A. Hung, David Huntsman, Ted R. Hupp, Jason Huse, Matthew R. Huska, Barbara Hutter, Carolyn M. Hutter, Daniel Hübschmann, Christine A. Iacobuzio-Donahue, Charles David Imbusch, Marcin Imielinski, Seiya Imoto, William B. Isaacs, Keren Isaev, Shumpei Ishikawa, Murat Iskar, S. M. Ashiqul Islam, Michael Ittmann, Sinisa Ivkovic, Jose M. G. Izarzugaza, Jocelyne Jacquemier, Valerie Jakrot, Nigel B. Jamieson, Gun Ho Jang, Se Jin Jang, Joy C. Jayaseelan, Reyka Jayasinghe, Stuart R. Jefferys, Karine Jegalian, Jennifer L. Jennings, Seung-Hyup Jeon, Lara Jerman, Yuan Ji, Wei Jiao, Peter A. Johansson, Amber L. Johns, Jeremy Johns, Rory Johnson, Todd A. Johnson, Clemency Jolly, Yann Joly, Jon G. Jonasson, Corbin D. Jones, David R. Jones, David T. W. Jones, Nic Jones, Steven J. M. Jones, Jos Jonkers, Young Seok Ju, Hartmut Juhl, Jongsun Jung, Malene Juul, Randi Istrup Juul, Sissel Juul, Natalie Jäger, Rolf Kabbe, Andre Kahles, Abdullah Kahraman, Vera B. Kaiser, Hojabr Kakavand, Sangeetha Kalimuthu, Christof von Kalle, Koo Jeong Kang, Katalin Karaszi, Beth Karlan, Rosa Karlić, Dennis Karsch, Katayoon Kasaian, Karin S. Kassahn, Hitoshi Katai, Mamoru Kato, Hiroto Katoh, Yoshiiku Kawakami, Jonathan D. Kay, Stephen H. Kazakoff, Marat D. Kazanov, Maria Keays, Electron Kebebew, Richard F. Kefford, Manolis Kellis, James G. Kench, Catherine J. Kennedy, Jules N. A. Kerssemakers, David Khoo, Vincent Khoo, Narong Khuntikeo, Ekta Khurana, Helena Kilpinen, Hark Kyun Kim, Hyung-Lae Kim, Hyung-Yong Kim, Hyunghwan Kim, Jaegil Kim, Jihoon Kim, Jong K. Kim, Youngwook Kim, Tari A. King, Wolfram Klapper, Kortine Kleinheinz, Leszek J. Klimczak, Stian Knappskog, Michael Kneba, Bartha M. Knoppers, Youngil Koh, Jan Komorowski, Daisuke Komura, Mitsuhiro Komura, Gu Kong, Marcel Kool, Jan O. Korbel, Viktoriya Korchina, Andrey Korshunov, Michael Koscher, Roelof Koster, Zsofia Kote-Jarai, Antonios Koures, Milena Kovacevic, Barbara Kremeyer, Helene Kretzmer, Markus Kreuz, Savitri Krishnamurthy, Dieter Kube, Kiran Kumar, Pardeep Kumar, Sushant Kumar, Yogesh Kumar, Ritika Kundra, Kirsten Kübler, Ralf Küppers, Jesper Lagergren, Phillip H. Lai, Peter W. Laird, Sunil R. Lakhani, Christopher M. Lalansingh, Emilie Lalonde, Fabien C. Lamaze, Adam Lambert, Eric Lander, Pablo Landgraf, Luca Landoni, Anita Langerød, Andrés Lanzós, Denis Larsimont, Erik Larsson, Mark Lathrop, Loretta M. S. Lau, Chris Lawerenz, Rita T. Lawlor, Michael S. Lawrence, Alexander J. Lazar, Ana Mijalkovic Lazic, Xuan Le, Darlene Lee, Donghoon Lee, Eunjung Alice Lee, Hee Jin Lee, Jake June-Koo Lee, Jeong-Yeon Lee, Juhee Lee, Ming Ta Michael Lee, Henry Lee-Six, Kjong-Van Lehmann, Hans Lehrach, Dido Lenze, Conrad R. Leonard, Daniel A. Leongamornlert, Ignaty Leshchiner, Louis Letourneau, Ivica Letunic, Douglas A. Levine, Lora Lewis, Tim Ley, Chang Li, Constance H. Li, Haiyan Irene Li, Jun Li, Lin Li, Shantao Li, Siliang Li, Xiaobo Li, Xiaotong Li, Xinyue Li, Yilong Li, Han Liang, Sheng-Ben Liang, Peter Lichter, Pei Lin, Ziao Lin, W. M. Linehan, Ole Christian Lingjærde, Dongbing Liu, Eric Minwei Liu, Fei-Fei Fei Liu, Fenglin Liu, Jia Liu, Xingmin Liu, Julie Livingstone, Dimitri Livitz, Naomi Livni, Lucas Lochovsky, Markus Loeffler, Georgina V. Long, Armando Lopez-Guillermo, Shaoke Lou, David N. Louis, Laurence B. Lovat, Yiling Lu, Yong-Jie Lu, Youyong Lu, Claudio Luchini, Ilinca Lungu, Xuemei Luo, Hayley J. Luxton, Andy G. Lynch, Lisa Lype, Cristina López, Carlos López-Otín, Eric Z. Ma, Yussanne Ma, Gaetan MacGrogan, Shona MacRae, Geoff Macintyre, Tobias Madsen, Kazuhiro Maejima, Andrea Mafficini, Dennis T. Maglinte, Arindam Maitra, Partha P. Majumder, Luca Malcovati, Salem Malikic, Giuseppe Malleo, Graham J. Mann, Luisa Mantovani-Löffler, Kathleen Marchal, Giovanni Marchegiani, Elaine R. Mardis, Adam A. Margolin, Maximillian G. Marin, Florian Markowetz, Julia Markowski, Jeffrey Marks, Tomas Marques-Bonet, Marco A. Marra, Luke Marsden, John W. M. Martens, Sancha Martin, Jose I. Martin-Subero, Iñigo Martincorena, Alexander Martinez-Fundichely, Yosef E. Maruvka, R. Jay Mashl, Charlie E. Massie, Thomas J. Matthew, Lucy Matthews, Erik Mayer, Simon Mayes, Michael Mayo, Faridah Mbabaali, Karen McCune, Ultan McDermott, Patrick D. McGillivray, Michael D. McLellan, John D. McPherson, John R. McPherson, Treasa A. McPherson, Samuel R. Meier, Alice Meng, Shaowu Meng, Andrew Menzies, Neil D. Merrett, Sue Merson, Matthew Meyerson, William Meyerson, Piotr A. Mieczkowski, George L. Mihaiescu, Sanja Mijalkovic, Tom Mikkelsen, Michele Milella, Linda Mileshkin, Christopher A. Miller, David K. Miller, Jessica K. Miller, Gordon B. Mills, Ana Milovanovic, Sarah Minner, Marco Miotto, Gisela Mir Arnau, Lisa Mirabello, Chris Mitchell, Thomas J. Mitchell, Satoru Miyano, Naoki Miyoshi, Shinichi Mizuno, Fruzsina Molnár-Gábor, Malcolm J. Moore, Richard A. Moore, Sandro Morganella, Quaid D. Morris, Carl Morrison, Lisle E. Mose, Catherine D. Moser, Ferran Muiños, Loris Mularoni, Andrew J. Mungall, Karen Mungall, Elizabeth A. Musgrove, Ville Mustonen, David Mutch, Francesc Muyas, Donna M. Muzny, Alfonso Muñoz, Jerome Myers, Ola Myklebost, Peter Möller, Genta Nagae, Adnan M. Nagrial, Hardeep K. Nahal-Bose, Hitoshi Nakagama, Hidewaki Nakagawa, Hiromi Nakamura, Toru Nakamura, Kaoru Nakano, Tannistha Nandi, Jyoti Nangalia, Mia Nastic, Arcadi Navarro, Fabio C. P. Navarro, David E. Neal, Gerd Nettekoven, Felicity Newell, Steven J. Newhouse, Yulia Newton, Alvin Wei Tian Ng, Anthony Ng, Jonathan Nicholson, David Nicol, Yongzhan Nie, G. Petur Nielsen, Morten Muhlig Nielsen, Serena Nik-Zainal, Michael S. Noble, Katia Nones, Paul A. Northcott, Faiyaz Notta, Brian D. O’Connor, Peter O’Donnell, Maria O’Donovan, Sarah O’Meara, Brian Patrick O’Neill, J. Robert O’Neill, David Ocana, Angelica Ochoa, Layla Oesper, Christopher Ogden, Hideki Ohdan, Kazuhiro Ohi, Lucila Ohno-Machado, Karin A. Oien, Akinyemi I. Ojesina, Hidenori Ojima, Takuji Okusaka, Larsson Omberg, Choon Kiat Ong, Stephan Ossowski, German Ott, B. F. Francis Ouellette, Christine P’ng, Marta Paczkowska, Salvatore Paiella, Chawalit Pairojkul, Marina Pajic, Qiang Pan-Hammarström, Elli Papaemmanuil, Irene Papatheodorou, Nagarajan Paramasivam, Ji Wan Park, Joong-Won Park, Keunchil Park, Kiejung Park, Peter J. Park, Joel S. Parker, Simon L. Parsons, Harvey Pass, Danielle Pasternack, Alessandro Pastore, Ann-Marie Patch, Iris Pauporté, Antonio Pea, John V. Pearson, Chandra Sekhar Pedamallu, Jakob Skou Pedersen, Paolo Pederzoli, Martin Peifer, Nathan A. Pennell, Charles M. Perou, Marc D. Perry, Gloria M. Petersen, Myron Peto, Nicholas Petrelli, Robert Petryszak, Stefan M. Pfister, Mark Phillips, Oriol Pich, Hilda A. Pickett, Todd D. Pihl, Nischalan Pillay, Sarah Pinder, Mark Pinese, Andreia V. Pinho, Esa Pitkänen, Xavier Pivot, Elena Piñeiro-Yáñez, Laura Planko, Christoph Plass, Paz Polak, Tirso Pons, Irinel Popescu, Olga Potapova, Aparna Prasad, Shaun R. Preston, Manuel Prinz, Antonia L. Pritchard, Stephenie D. Prokopec, Elena Provenzano, Xose S. Puente, Sonia Puig, Montserrat Puiggròs, Sergio Pulido-Tamayo, Gulietta M. Pupo, Colin A. Purdie, Michael C. Quinn, Raquel Rabionet, Janet S. Rader, Bernhard Radlwimmer, Petar Radovic, Benjamin Raeder, Keiran M. Raine, Manasa Ramakrishna, Kamna Ramakrishnan, Suresh Ramalingam, Benjamin J. Raphael, W. Kimryn Rathmell, Tobias Rausch, Guido Reifenberger, Jüri Reimand, Jorge Reis-Filho, Victor Reuter, Iker Reyes-Salazar, Matthew A. Reyna, Sheila M. Reynolds, Esther Rheinbay, Yasser Riazalhosseini, Andrea L. Richardson, Julia Richter, Matthew Ringel, Markus Ringnér, Yasushi Rino, Karsten Rippe, Jeffrey Roach, Lewis R. Roberts, Nicola D. Roberts, Steven A. Roberts, A. Gordon Robertson, Alan J. Robertson, Javier Bartolomé Rodriguez, Bernardo Rodriguez-Martin, F. Germán Rodríguez-González, Michael H. A. Roehrl, Marius Rohde, Hirofumi Rokutan, Gilles Romieu, Ilse Rooman, Tom Roques, Daniel Rosebrock, Mara Rosenberg, Philip C. Rosenstiel, Andreas Rosenwald, Edward W. Rowe, Romina Royo, Steven G. Rozen, Yulia Rubanova, Mark A. Rubin, Carlota Rubio-Perez, Vasilisa A. Rudneva, Borislav C. Rusev, Andrea Ruzzenente, Gunnar Rätsch, Radhakrishnan Sabarinathan, Veronica Y. Sabelnykova, Sara Sadeghi, S. Cenk Sahinalp, Natalie Saini, Mihoko Saito-Adachi, Gordon Saksena, Adriana Salcedo, Roberto Salgado, Leonidas Salichos, Richard Sallari, Charles Saller, Roberto Salvia, Michelle Sam, Jaswinder S. Samra, Francisco Sanchez-Vega, Chris Sander, Grant Sanders, Rajiv Sarin, Iman Sarrafi, Aya Sasaki-Oku, Torill Sauer, Guido Sauter, Robyn P. M. Saw, Maria Scardoni, Christopher J. Scarlett, Aldo Scarpa, Ghislaine Scelo, Dirk Schadendorf, Jacqueline E. Schein, Markus B. Schilhabel, Matthias Schlesner, Thorsten Schlomm, Heather K. Schmidt, Sarah-Jane Schramm, Stefan Schreiber, Nikolaus Schultz, Steven E. Schumacher, Roland F. Schwarz, Richard A. Scolyer, David Scott, Ralph Scully, Raja Seethala, Ayellet V. Segre, Iris Selander, Colin A. Semple, Yasin Senbabaoglu, Subhajit Sengupta, Elisabetta Sereni, Stefano Serra, Dennis C. Sgroi, Mark Shackleton, Nimish C. Shah, Sagedeh Shahabi, Catherine A. Shang, Ping Shang, Ofer Shapira, Troy Shelton, Ciyue Shen, Hui Shen, Rebecca Shepherd, Ruian Shi, Yan Shi, Yu-Jia Shiah, Tatsuhiro Shibata, Juliann Shih, Eigo Shimizu, Kiyo Shimizu, Seung Jun Shin, Yuichi Shiraishi, Tal Shmaya, Ilya Shmulevich, Solomon I. Shorser, Charles Short, Raunak Shrestha, Suyash S. Shringarpure, Craig Shriver, Shimin Shuai, Nikos Sidiropoulos, Reiner Siebert, Anieta M. Sieuwerts, Lina Sieverling, Sabina Signoretti, Katarzyna O. Sikora, Michele Simbolo, Ronald Simon, Janae V. Simons, Jared T. Simpson, Peter T. Simpson, Samuel Singer, Nasa Sinnott-Armstrong, Payal Sipahimalani, Tara J. Skelly, Marcel Smid, Jaclyn Smith, Karen Smith-McCune, Nicholas D. Socci, Heidi J. Sofia, Matthew G. Soloway, Lei Song, Anil K. Sood, Sharmila Sothi, Christos Sotiriou, Cameron M. Soulette, Paul N. Span, Paul T. Spellman, Nicola Sperandio, Andrew J. Spillane, Oliver Spiro, Jonathan Spring, Johan Staaf, Peter F. Stadler, Peter Staib, Stefan G. Stark, Lucy Stebbings, Ólafur Andri Stefánsson, Oliver Stegle, Lincoln D. Stein, Alasdair Stenhouse, Chip Stewart, Stephan Stilgenbauer, Miranda D. Stobbe, Michael R. Stratton, Jonathan R. Stretch, Adam J. Struck, Joshua M. Stuart, Henk G. Stunnenberg, Hong Su, Xiaoping Su, Ren X. Sun, Stephanie Sungalee, Hana Susak, Akihiro Suzuki, Fred Sweep, Monika Szczepanowski, Holger Sültmann, Takashi Yugawa, Angela Tam, David Tamborero, Benita Kiat Tee Tan, Donghui Tan, Patrick Tan, Hiroko Tanaka, Hirokazu Taniguchi, Tomas J. Tanskanen, Maxime Tarabichi, Roy Tarnuzzer, Patrick Tarpey, Morgan L. Taschuk, Kenji Tatsuno, Simon Tavaré, Darrin F. Taylor, Amaro Taylor-Weiner, Jon W. Teague, Bin Tean Teh, Varsha Tembe, Javier Temes, Kevin Thai, Sarah P. Thayer, Nina Thiessen, Gilles Thomas, Sarah Thomas, Alan Thompson, Alastair M. Thompson, John F. F. Thompson, R. Houston Thompson, Heather Thorne, Leigh B. Thorne, Adrian Thorogood, Grace Tiao, Nebojsa Tijanic, Lee E. Timms, Roberto Tirabosco, Marta Tojo, Stefania Tommasi, Christopher W. Toon, Umut H. Toprak, David Torrents, Giampaolo Tortora, Jörg Tost, Yasushi Totoki, David Townend, Nadia Traficante, Isabelle Treilleux, Jean-Rémi Trotta, Lorenz H. P. Trümper, Ming Tsao, Tatsuhiko Tsunoda, Jose M. C. Tubio, Olga Tucker, Richard Turkington, Daniel J. Turner, Andrew Tutt, Masaki Ueno, Naoto T. Ueno, Christopher Umbricht, Husen M. Umer, Timothy J. Underwood, Lara Urban, Tomoko Urushidate, Tetsuo Ushiku, Liis Uusküla-Reimand, Alfonso Valencia, David J. Van Den Berg, Steven Van Laere, Peter Van Loo, Erwin G. Van Meir, Gert G. Van den Eynden, Theodorus Van der Kwast, Naveen Vasudev, Miguel Vazquez, Ravikiran Vedururu, Umadevi Veluvolu, Shankar Vembu, Lieven P. C. Verbeke, Peter Vermeulen, Clare Verrill, Alain Viari, David Vicente, Caterina Vicentini, K. VijayRaghavan, Juris Viksna, Ricardo E. Vilain, Izar Villasante, Anne Vincent-Salomon, Tapio Visakorpi, Douglas Voet, Paresh Vyas, Ignacio Vázquez-García, Nick M. Waddell, Nicola Waddell, Claes Wadelius, Lina Wadi, Rabea Wagener, Jeremiah A. Wala, Jian Wang, Jiayin Wang, Linghua Wang, Qi Wang, Wenyi Wang, Yumeng Wang, Zhining Wang, Paul M. Waring, Hans-Jörg Warnatz, Jonathan Warrell, Anne Y. Warren, Sebastian M. Waszak, David C. Wedge, Dieter Weichenhan, Paul Weinberger, John N. Weinstein, Joachim Weischenfeldt, Daniel J. Weisenberger, Ian Welch, Michael C. Wendl, Johannes Werner, Justin P. Whalley, David A. Wheeler, Hayley C. Whitaker, Dennis Wigle, Matthew D. Wilkerson, Ashley Williams, James S. Wilmott, Gavin W. Wilson, Julie M. Wilson, Richard K. Wilson, Boris Winterhoff, Jeffrey A. Wintersinger, Maciej Wiznerowicz, Stephan Wolf, Bernice H. Wong, Tina Wong, Winghing Wong, Youngchoon Woo, Scott Wood, Bradly G. Wouters, Adam J. Wright, Derek W. Wright, Mark H. Wright, Chin-Lee Wu, Dai-Ying Wu, Guanming Wu, Jianmin Wu, Kui Wu, Yang Wu, Zhenggang Wu, Liu Xi, Tian Xia, Qian Xiang, Xiao Xiao, Rui Xing, Heng Xiong, Qinying Xu, Yanxun Xu, Hong Xue, Shinichi Yachida, Sergei Yakneen, Rui Yamaguchi, Takafumi N. Yamaguchi, Masakazu Yamamoto, Shogo Yamamoto, Hiroki Yamaue, Fan Yang, Huanming Yang, Jean Y. Yang, Liming Yang, Lixing Yang, Shanlin Yang, Tsun-Po Yang, Yang Yang, Xiaotong Yao, Marie-Laure Yaspo, Lucy Yates, Christina Yau, Chen Ye, Kai Ye, Venkata D. Yellapantula, Christopher J. Yoon, Sung-Soo Yoon, Fouad Yousif, Jun Yu, Kaixian Yu, Willie Yu, Yingyan Yu, Ke Yuan, Yuan Yuan, Denis Yuen, Christina K. Yung, Olga Zaikova, Jorge Zamora, Marc Zapatka, Jean C. Zenklusen, Thorsten Zenz, Nikolajs Zeps, Cheng-Zhong Zhang, Fan Zhang, Hailei Zhang, Hongwei Zhang, Hongxin Zhang, Jiashan Zhang, Jing Zhang, Junjun Zhang, Xiuqing Zhang, Xuanping Zhang, Yan Zhang, Zemin Zhang, Zhongming Zhao, Liangtao Zheng, Xiuqing Zheng, Wanding Zhou, Yong Zhou, Bin Zhu, Hongtu Zhu, Jingchun Zhu, Shida Zhu, Lihua Zou, Xueqing Zou, Anna deFazio, Nicholas van As, Carolien H. M. van Deurzen, Marc J. van de Vijver, L. van’t Veer, and Christian von Mering

Supplementary information

is available for this paper at 10.1038/s41586-019-1913-9.

Extended data

is available for this paper at 10.1038/s41586-019-1913-9.

References

1.Bignell, G. R. et al. Architectures of somatic genomic rearrangement in human cancer amplicons at sequence-level resolution. Genome Res. 17, 1296–1303 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Campbell, P. J. et al. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature467, 1109–1113 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Stephens, P. J. et al. Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell144, 27–40 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Lee, J. A., Carvalho, C. M. & Lupski, J. R. A DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell131, 1235–1247 (2007). [DOI] [PubMed] [Google Scholar]
5.Baca, S. C. et al. Punctuated evolution of prostate cancer genomes. Cell153, 666–677 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Menghi, F. et al. The tandem duplicator phenotype is a prevalent genome-wide cancer configuration driven by distinct gene mutations. Cancer Cell34, 197–210 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Liu, P. et al. An organismal CNV mutator phenotype restricted to early human development. Cell168, 830–842 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
8.The ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Pan-cancer analysis of whole genomes. Nature10.1038/s41586-020-1969-6 (2020).
9.Zhang, C.-Z. et al. Chromothripsis from DNA damage in micronuclei. Nature522, 179–184 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Willis, N. A. et al. Mechanism of tandem duplication formation in BRCA1-mutant cells. Nature551, 590–595 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Maciejowski, J., Li, Y., Bosco, N., Campbell, P. J. & de Lange, T. Chromothripsis and kataegis induced by telomere crisis. Cell163, 1641–1654 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Ly, P. et al. Chromosome segregation errors generate a diverse spectrum of simple and complex genomic rearrangements. Nat. Genet. 51, 705–715 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Ghezraoui, H. et al. Chromosomal translocations in human cells are generated by canonical nonhomologous end-joining. Mol. Cell55, 829–842 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Nature10.1038/s41586-020-1965-x (2020). [DOI] [PMC free article] [PubMed]
15.PCAWG Transcriptome Core Group et al. Genomic basis for RNA alterations in cancer. Nature10.1038/s41586-020-1970-0 (2020). [DOI] [PMC free article] [PubMed]
16.Akdemir, K. C. et al. Disruption of chromatin folding domains by somatic genomic rearrangements in human cancer. Nat. Genet.10.1038/s41588-019-0564-y (2020). [DOI] [PMC free article] [PubMed]
17.Rodriguez-Martin, B. et al. Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition. Nat. Genet.10.1038/s41588-019-0562-0 (2020). [DOI] [PMC free article] [PubMed]
18.Cortes-Ciriano, I. et al. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat. Genet.10.1038/s41588-019-0576-7 (2020). [DOI] [PMC free article] [PubMed]
19.Li, Y. et al. Constitutional and somatic rearrangement of chromosome 21 in acute lymphoblastic leukaemia. Nature508, 98–102 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Berger, M. F. et al. The genomic complexity of primary human prostate cancer. Nature470, 214–220 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Crasta, K. et al. DNA breaks and chromosome pulverization from errors in mitosis. Nature482, 53–58 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Rausch, T. et al. Genome sequencing of pediatric medulloblastoma links catastrophic DNA rearrangements with TP53 mutations. Cell148, 59–71 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Hastings, P. J., Ira, G. & Lupski, J. R. A microhomology-mediated break-induced replication model for the origin of human copy number variation. PLoS Genet. 5, e1000327 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Carvalho, C. M. B. et al. Inverted genomic segments and complex triplication rearrangements are mediated by inverted repeats in the human genome. Nat. Genet. 43, 1074–1081 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Campbell, P. J. et al. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat. Genet. 40, 722–729 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Rausch, T. et al. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics28, i333–i339 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Wala, J. A. et al. SvABA: genome-wide detection of structural variants and indels by local assembly. Genome Res. 28, 581–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Totoki, Y. et al. Trans-ancestry mutational landscape of hepatocellular carcinoma genomes. Nat. Genet. 46, 1267–1273 (2014). [DOI] [PubMed] [Google Scholar]
29.Nik-Zainal, S. et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature534, 47–54 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Supek, F. & Lehner, B. Differential DNA mismatch repair underlies mutation rate variation across the human genome. Nature521, 81–84 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Schuster-Böckler, B. & Lehner, B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature488, 504–507 (2012). [DOI] [PubMed] [Google Scholar]
32.De, S. & Michor, F. DNA replication timing and long-range DNA interactions predict mutational landscapes of cancer genomes. Nat. Biotechnol. 29, 1103–1108 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Yang, L. et al. Diverse mechanisms of somatic structural variations in human cancer genomes. Cell153, 919–929 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature10.1038/s41586-020-1943-3 (2020). [DOI] [PMC free article] [PubMed]
35.Lukusa, T. & Fryns, J. P. Human chromosome fragility. Biochim. Biophys. Acta1779, 3–16 (2008). [DOI] [PubMed] [Google Scholar]
36.Popova, T. et al. Ovarian cancers harboring inactivating mutations in CDK12 display a distinct genomic instability pattern characterized by large tandem duplications. Cancer Res. 76, 1882–1891 (2016). [DOI] [PubMed] [Google Scholar]
37.Xia, B. et al. Control of BRCA2 cellular and clinical functions by a nuclear partner, PALB2. Mol. Cell22, 719–729 (2006). [DOI] [PubMed] [Google Scholar]
38.Piazza, A., Wright, W. D. & Heyer, W. D. Multi-invasions are recombination byproducts that induce chromosomal rearrangements. Cell170, 760–773 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Yu, Y. et al. Dna2 nuclease deficiency results in large and complex DNA insertions at chromosomal breaks. Nature564, 287–290 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information^{(28.7MB, pdf)}

This file contains Supplementary Figures 1-8, Supplementary Methods, Supplementary Results, References and a list of participants in ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium.

Reporting Summary^{(98.5KB, pdf)}

Supplementary Table^{(52.3KB, xlsx)}

Supplementary Table 1: Counts of patients with SVs in different classes affecting genes in the Cancer Gene Census.

Data Availability Statement

[CR1] 1.Bignell, G. R. et al. Architectures of somatic genomic rearrangement in human cancer amplicons at sequence-level resolution. Genome Res. 17, 1296–1303 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Campbell, P. J. et al. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature467, 1109–1113 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Stephens, P. J. et al. Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell144, 27–40 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Lee, J. A., Carvalho, C. M. & Lupski, J. R. A DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell131, 1235–1247 (2007). [DOI] [PubMed] [Google Scholar]

[CR5] 5.Baca, S. C. et al. Punctuated evolution of prostate cancer genomes. Cell153, 666–677 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Menghi, F. et al. The tandem duplicator phenotype is a prevalent genome-wide cancer configuration driven by distinct gene mutations. Cancer Cell34, 197–210 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Liu, P. et al. An organismal CNV mutator phenotype restricted to early human development. Cell168, 830–842 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.The ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Pan-cancer analysis of whole genomes. Nature10.1038/s41586-020-1969-6 (2020).

[CR9] 9.Zhang, C.-Z. et al. Chromothripsis from DNA damage in micronuclei. Nature522, 179–184 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Willis, N. A. et al. Mechanism of tandem duplication formation in BRCA1-mutant cells. Nature551, 590–595 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Maciejowski, J., Li, Y., Bosco, N., Campbell, P. J. & de Lange, T. Chromothripsis and kataegis induced by telomere crisis. Cell163, 1641–1654 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Ly, P. et al. Chromosome segregation errors generate a diverse spectrum of simple and complex genomic rearrangements. Nat. Genet. 51, 705–715 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Ghezraoui, H. et al. Chromosomal translocations in human cells are generated by canonical nonhomologous end-joining. Mol. Cell55, 829–842 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Nature10.1038/s41586-020-1965-x (2020). [DOI] [PMC free article] [PubMed]

[CR15] 15.PCAWG Transcriptome Core Group et al. Genomic basis for RNA alterations in cancer. Nature10.1038/s41586-020-1970-0 (2020). [DOI] [PMC free article] [PubMed]

[CR16] 16.Akdemir, K. C. et al. Disruption of chromatin folding domains by somatic genomic rearrangements in human cancer. Nat. Genet.10.1038/s41588-019-0564-y (2020). [DOI] [PMC free article] [PubMed]

[CR17] 17.Rodriguez-Martin, B. et al. Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition. Nat. Genet.10.1038/s41588-019-0562-0 (2020). [DOI] [PMC free article] [PubMed]

[CR18] 18.Cortes-Ciriano, I. et al. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat. Genet.10.1038/s41588-019-0576-7 (2020). [DOI] [PMC free article] [PubMed]

[CR19] 19.Li, Y. et al. Constitutional and somatic rearrangement of chromosome 21 in acute lymphoblastic leukaemia. Nature508, 98–102 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Berger, M. F. et al. The genomic complexity of primary human prostate cancer. Nature470, 214–220 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Crasta, K. et al. DNA breaks and chromosome pulverization from errors in mitosis. Nature482, 53–58 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Rausch, T. et al. Genome sequencing of pediatric medulloblastoma links catastrophic DNA rearrangements with TP53 mutations. Cell148, 59–71 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Hastings, P. J., Ira, G. & Lupski, J. R. A microhomology-mediated break-induced replication model for the origin of human copy number variation. PLoS Genet. 5, e1000327 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Carvalho, C. M. B. et al. Inverted genomic segments and complex triplication rearrangements are mediated by inverted repeats in the human genome. Nat. Genet. 43, 1074–1081 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Campbell, P. J. et al. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat. Genet. 40, 722–729 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Rausch, T. et al. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics28, i333–i339 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Wala, J. A. et al. SvABA: genome-wide detection of structural variants and indels by local assembly. Genome Res. 28, 581–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Totoki, Y. et al. Trans-ancestry mutational landscape of hepatocellular carcinoma genomes. Nat. Genet. 46, 1267–1273 (2014). [DOI] [PubMed] [Google Scholar]

[CR29] 29.Nik-Zainal, S. et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature534, 47–54 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Supek, F. & Lehner, B. Differential DNA mismatch repair underlies mutation rate variation across the human genome. Nature521, 81–84 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Schuster-Böckler, B. & Lehner, B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature488, 504–507 (2012). [DOI] [PubMed] [Google Scholar]

[CR32] 32.De, S. & Michor, F. DNA replication timing and long-range DNA interactions predict mutational landscapes of cancer genomes. Nat. Biotechnol. 29, 1103–1108 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Yang, L. et al. Diverse mechanisms of somatic structural variations in human cancer genomes. Cell153, 919–929 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature10.1038/s41586-020-1943-3 (2020). [DOI] [PMC free article] [PubMed]

[CR35] 35.Lukusa, T. & Fryns, J. P. Human chromosome fragility. Biochim. Biophys. Acta1779, 3–16 (2008). [DOI] [PubMed] [Google Scholar]

[CR36] 36.Popova, T. et al. Ovarian cancers harboring inactivating mutations in CDK12 display a distinct genomic instability pattern characterized by large tandem duplications. Cancer Res. 76, 1882–1891 (2016). [DOI] [PubMed] [Google Scholar]

[CR37] 37.Xia, B. et al. Control of BRCA2 cellular and clinical functions by a nuclear partner, PALB2. Mol. Cell22, 719–729 (2006). [DOI] [PubMed] [Google Scholar]

[CR38] 38.Piazza, A., Wright, W. D. & Heyer, W. D. Multi-invasions are recombination byproducts that induce chromosomal rearrangements. Cell170, 760–773 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Yu, Y. et al. Dna2 nuclease deficiency results in large and complex DNA insertions at chromosomal breaks. Nature564, 287–290 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Patterns of somatic structural variation in human cancer genomes

Yilong Li

Nicola D Roberts

Jeremiah A Wala

Ofer Shapira

Steven E Schumacher

Kiran Kumar

Ekta Khurana

Sebastian Waszak

Jan O Korbel

James E Haber

Marcin Imielinski

Joachim Weischenfeldt

Rameen Beroukhim

Peter J Campbell

Abstract

Main

Classification of structural variants

Extended Data Table 1.

Fig. 1. Classification of structural variants in cancer genomes.

Annotation of structural-variant classes

Fig. 2. Frequency of structural-variant classes across tumour types.

Extended Data Fig. 1. Per-sample counts of structural-variant breakpoint junctions by histology group.

Cycles of templated insertions

Fig. 3. Chains, cycles and bridges of templated insertions.

Extended Data Fig. 2. Further examples of templated insertion chains, cycles and bridges.

Extended Data Fig. 3. Number of breakpoint junctions in cycles, bridges and chains of templated insertions.

Templated insertions that affect TERT

Extended Data Fig. 4. Templated insertion events that activate TERT in hepatocellular carcinoma.

Extended Data Fig. 5. Templated insertion events inactivating RB1 in breast and ovarian carcinomas.

Local n-jumps and local–distant clusters

Fig. 4. Examples of clusters of 2–5 rearrangements seen in human cancers.

Copy-and-paste patterns of clusters

Genomic properties of structural variants

Fig. 5. Size distribution and genomic properties of classified structural variants.

Extended Data Fig. 6. Size distribution of tandem duplications.

Extended Data Fig. 7. Size properties of clustered structural-variant classes.

Extended Data Fig. 8. Relationship of an extended panel of genomic properties with structural-variant categories.

Signatures of structural variation

Fig. 6. Structural-variant signatures in human cancers.

Extended Data Fig. 9. Properties of structural variants at chromosomal fragile sites.

DNA repair genes and tumour type

Extended Data Fig. 10. Consistency of associations between signatures and mutations in DNA-repair genes.

Extended Data Fig. 11. Patterns of structural variants causing fusion genes and enhancer hijacking.

Extended Data Fig. 12. Patterns of structural variants that affect selected tumour-suppressor genes.

Extended Data Fig. 13. Examples of structural variants increasing the copy number of MYC.

Discussion

Methods

Generation of the structural-variant call set

Structural-variant clustering and annotation

Distribution of structural variants across the genome

Structural-variant-signature analysis

Reporting summary

Online content

Supplementary information

Acknowledgements

Extended data figures and tables

Author contributions

Data availability

Code availability

Competing interests

Footnotes

Contributor Information

Supplementary information

Extended data

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases