Skip to main content
Bioinformatics logoLink to Bioinformatics
. 2023 Dec 7;39(12):btad719. doi: 10.1093/bioinformatics/btad719

ReConPlot: an R package for the visualization and interpretation of genomic rearrangements

Jose Espejo Valle-Inclán 1, Isidro Cortés-Ciriano 2,
Editor: Can Alkan
PMCID: PMC10710371  PMID: 38058190

Abstract

Motivation

Whole-genome sequencing studies of human tumours have revealed that complex forms of structural variation, collectively known as complex genome rearrangements (CGRs), are pervasive across diverse cancer types. Detection, classification, and mechanistic interpretation of CGRs requires the visualization of complex patterns of somatic copy number aberrations (SCNAs) and structural variants (SVs). However, there is a lack of tools specifically designed to facilitate the visualization and study of CGRs.

Results

We present ReConPlot (REarrangement and COpy Number PLOT), an R package that provides functionalities for the joint visualization of SCNAs and SVs across one or multiple chromosomes. ReConPlot is based on the popular ggplot2 package, thus allowing customization of plots and the generation of publication-quality figures with minimal effort. Overall, ReConPlot facilitates the exploration, interpretation, and reporting of CGR patterns.

Availability and implementation

The R package ReConPlot is available at https://github.com/cortes-ciriano-lab/ReConPlot. Detailed documentation and a tutorial with examples are provided with the package.

1 Introduction

The advent of whole genome sequencing (WGS) has enabled a more nuanced characterization of the diversity, rates and underlying mechanisms of chromosomal alterations than was ever possible using cytogenetic or pathology analyses (Greenman et al. 2007, Mardis and Wilson 2009, Garraway and Lander 2013, Cortés-Ciriano et al. 2021). WGS studies of human cancers have revealed that genomic instability, a hallmark of cancer, manifests as alterations in the structure and number of chromosomes (aneuploidy), whole genome doubling, repeat instability, and remarkably diverse forms of structural variants (SVs) (Macintyre et al. 2018, Priestley et al. 2019, ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium 2020, Steele et al. 2022). SVs, which account for most driver events in some cancer types (Zack et al. 2013, ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium 2020), refer to the rearrangement of the genome leading to the deletion, amplification or reshuffling of genomic segments. In cancer genomes, genomic rearrangements manifest as (i) simple events, such as deletions, duplications, inversions, and insertions occurring in isolation, or (ii) complex events involving multiple breakpoints across one or multiple chromosomes and showing complex patterns of both spatial and temporal clustering (Cortés-Ciriano et al. 2020, Li et al. 2020, Hadi et al. 2020, Bao et al. 2022). Such complex patterns, collectively referred to as complex genomic rearrangements (CGRs), include those recently discovered in cancer genome studies, such as chromothripsis (Stephens et al. 2011, Rausch et al. 2012, Cortés-Ciriano et al. 2020), chromoanasynthesis (Liu et al. 2011), chromoplexy (Baca et al. 2013), pyrgo, rigma, and tyfonas (Hadi et al. 2020), as well as others initially described in cytogenetic studies, such as breakage–fusion–bridge cycles (Campbell et al. 2010) and double minutes or extrachromosomal DNA elements (Turner et al. 2017, Deshpande et al. 2019). Multiple algorithms have been developed to detect and classify CGRs (Notta et al. 2016, Cortés-Ciriano et al. 2020, Hadi et al. 2020, Bao et al. 2022) based on the analysis of the patterns of SVs and somatic copy number aberrations (SCNAs) detected through computational cancer genome analysis. However, due to the diversity, complexity, variable scale and overlapping features of CGRs, coupled to their co-localization (Cortés-Ciriano et al. 2020), their detection and classification remains a challenging task. As a result, manual inspection of SCNA and SV data is often required to resolve the most complex cases (Li et al. 2014, Notta et al. 2016, Cortés-Ciriano et al. 2020). This task requires versatile methods to visualize SCNAs and SVs across genomic regions ranging from a few kbp to multiple chromosomes. A popular approach for genomics data visualization, termed Circos plot (Krzywinski et al. 2009), allows exploration of CGRs by displaying the cancer genome in a circular layout where concentric circles show different types of mutations and genomic features (Nusrat et al. 2019). Although versatile to provide an overview of the cancer genome (Davies et al. 2017, Goldman et al. 2020, Shale et al. 2022), Circos plots are often too complex to visualize CGRs involving large numbers of SVs and SCNAs. An alternative approach consists of displaying genomic regions of interest in a linear layout where regions of equal copy number are represented by segments, and SVs by arcs (Li et al. 2014, ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium 2020). This visualization strategy, usually referred to as genomic rearrangement plots, has become popular in the cancer genomics community for visualizing and reporting the patterns and consequences of CGRs (e.g., disruption of tumour suppressor genes by SVs) (Cortés-Ciriano et al. 2020, ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium 2020). However, there is lack of easy-to-use software packages to visualize genomic rearrangement profiles and generate publication-quality figures for reporting cancer genome analysis results. Here we present ReConPlot (REarrangement and COpy Number PLOT), an R package that provides functionalities for the joint visualization of SCNAs and SVs across one or multiple chromosomes.

2 Methods

ReConPlot relies on the popular package ggplot2 (Wickham 2016) for the visualization of SCNA and SV profiles, thus allowing for user-specific customization and integration with functionalities from other R packages to compose multi-panel figures easily. The main function of the package, ReConPlot, only requires as input the genomic coordinates for the regions to be visualized, integer minor and total copy number data, and SV information in browser extensible data paired-end (BEDPE) format (Quinlan and Hall 2010). ReConPlot permits the visualization of genomic rearrangement profiles across one or multiple chromosomes (Fig. 1).

Figure 1.

Figure 1.

Examples of ReConPlots visualizing complex genomic rearrangements detected in four cancer genomes from the PCAWG cohort. (a) Artificial example of a simple deletion (left, orange) and a simple duplication (right, blue). (b) Example of a canonical chromothripsis event detected in a liver adenocarcinoma. The ReConPlot shows the characteristic cluster of interleaved SVs and copy number oscillations between two copy number states accompanied by loss of heterozygosity, which is indicated by the regions with a minor copy number of 0. (c) Example of a CGR spanning two chromosomes detected in an ovarian adenocarcinoma. (d) Example of a CGR showing high-level amplifications detected in an osteosarcoma genome. The chromothripsis event occurred after whole-genome doubling, as evidenced by the minor copy number oscillations between copy number states 1 and 2, and caused the high-level amplification of CDK4 and MDM2. Tail-to-tail (t2tINV) inversions, head-to-head (h2hINV) inversions, duplication-like SVs (DUP), and deletion-like SVs (DEL) are depicted in black, green, blue, and orange, respectively. Total and minor copy number values are represented by black and grey segments, respectively. ICGC IDs are shown on top of each ReConPlot.

Each ReConPlot consists of three main panels. The bottom panel shows Giemsa binding data (Cheung et al. 2001, Furey and Haussler 2003) for the genomic regions of interest. The middle panel reports total and minor copy number information. Finally, the top panel shows SVs. SVs whose breakpoints fall within the regions selected to be displayed are represented by arcs. In cases where only one breakpoint maps to the selected genomic regions, the SV is represented as a vertical line ending with a 45-degree overhang. SVs are categorized into four groups depending on the read orientation at the breakpoints (i.e. type of fragment joins) following the notation established by the Pan-Cancer Analysis of Whole Genomes project [PCAWG (ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium 2020); Fig. 1]: deletion-like SVs (DEL) are represented as ‘+ −’, duplication-like SVs (DUP) as ‘− +’, tail-to-tail inversions (t2tInv) as ‘− −’, and head-to-head inversions (h2hInv) as ‘+ +’. Using the same notation as PCAWG allows for smooth integration with other software packages designed for the detection, classification and interpretation of CGR, such as ShatterSeek (Cortés-Ciriano et al. 2020). In addition, ReConPlot provides functionalities to highlight the location of genes (see Fig. 1 for examples). Currently, ReConPlot supports the following builds of the human reference genome: GRCh37, GRCh38, and T2T-CHM13, and the mouse reference genome: mm10, mm39. While default parameter values ensure the generation of publication-quality figures, the function ReConPlot is highly versatile, as it allows customization of the layout of the plots, font sizes, font colours, and other graphical parameters (see the documentation and tutorial of the package for a full list of customizable graphical options). Additionally, an optional module is available to depict a user-defined annotation below the chromosome ideogram with aligned genomic coordinates. The optional annotation module can be used to, for example, annotate point mutations of interest and their variant allele frequency, or any other genomic features of interest to the user.

3 Results

We have extensively validated the functionalities of ReConPlot using SV and SCNA calls from the PCAWG project, allowing us to identify and classify diverse CGRs, such as chromothripsis, CGRs involving multiple chromosomes, and CGRs showing high-level oncogene amplifications (Fig. 1). In sum, ReConPlot provides functionalities for the visualization and interpretation of complex genomic rearrangement profiles detected in cancer genomes and rare disease patients.

Acknowledgements

J.E.V.-I. and I.C.-C. thank the European Molecular Biology Laboratory (EMBL) for funding.

Contributor Information

Jose Espejo Valle-Inclán, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Isidro Cortés-Ciriano, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Conflict of interest

None declared.

Funding

This work was supported by the EMBL.

References

  1. Baca SC, Prandi D, Lawrence MS. et al. Punctuated evolution of prostate cancer genomes. Cell 2013;153:666–77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bao L, Zhong X, Yang Y. et al. Starfish infers signatures of complex genomic rearrangements across human cancers. Nat Cancer 2022;3:1247–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Campbell PJ, Yachida S, Mudie LJ. et al. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature 2010;467:1109–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Cheung VG, Nowak N, Jang W. et al. ; BAC Resource Consortium. Integration of cytogenetic landmarks into the draft sequence of the human genome. Nature 2001;409:953–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cortés-Ciriano I, Lee JJ-K, Xi R. et al. ; PCAWG Consortium. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat Genet 2020;52:331–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cortés-Ciriano I, Gulhan DC, Lee JJ-K. et al. Computational analysis of cancer genome sequencing data. Nat Rev Genet 2021;23:298–314. [DOI] [PubMed] [Google Scholar]
  7. Davies H, Glodzik D, Morganella S. et al. HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures. Nat Med 2017;23:517–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Deshpande V, Luebeck J, Nguyen N-PD. et al. Exploring the landscape of focal amplifications in cancer using AmpliconArchitect. Nat Commun 2019;10:392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Furey TS, Haussler D.. Integration of the cytogenetic map with the draft human genome sequence. Hum Mol Genet 2003;12:1037–44. [DOI] [PubMed] [Google Scholar]
  10. Garraway LA, Lander ES.. Lessons from the cancer genome. Cell 2013;153:17–37. [DOI] [PubMed] [Google Scholar]
  11. Goldman M, Zhang J, Fonseca NA. et al. A user’s guide to the online resources for data exploration, visualization, and discovery for the Pan-Cancer Analysis of Whole Genomes project (PCAWG). Nat Commun. 2020;11:3400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Greenman C, Stephens P, Smith R. et al. Patterns of somatic mutation in human cancer genomes. Nature 2007;446:153–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Hadi K, Yao X, Behr JM. et al. Distinct classes of complex structural variation uncovered across thousands of cancer genome graphs. Cell 2020;183:197–210.e32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Pan-cancer analysis of whole genomes. Nature 2020;578:82–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Krzywinski M, Schein J, Birol I. et al. Circos: an information aesthetic for comparative genomics. Genome Res 2009;19:1639–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Li Y, Schwab C, Ryan S. et al. Constitutional and somatic rearrangement of chromosome 21 in acute lymphoblastic leukaemia. Nature 2014;508:98–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Li Y, Roberts ND, Wala JA. et al. ; PCAWG Consortium. Patterns of somatic structural variation in human cancer genomes. Nature 2020;578:112–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Liu P, Erez A, Nagamani SCS. et al. Chromosome catastrophes involve replication mechanisms generating complex genomic rearrangements. Cell 2011;146:889–903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Macintyre G, Goranova TE, De Silva D. et al. Copy number signatures and mutational processes in ovarian carcinoma. Nat Genet 2018;50:1262–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Mardis ER, Wilson RK.. Cancer genome sequencing: a review. Hum Mol Genet 2009;18:R163–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Notta F, Chan-Seng-Yue M, Lemire M. et al. A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns. Nature 2016;538:378–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Nusrat S, Harbig T, Gehlenborg N. et al. Tasks, techniques, and tools for genomic data visualization. Comput Graph Forum 2019;38:781–805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Priestley P, Baber J, Lolkema MP. et al. Pan-cancer whole-genome analyses of metastatic solid tumours. Nature 2019;575:210–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Quinlan AR, Hall IM.. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010;26:841–2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Rausch T, Jones DTW, Zapatka M. et al. Genome sequencing of pediatric medulloblastoma links catastrophic DNA rearrangements with TP53 mutations. Cell 2012;148:59–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Shale C, Cameron DL, Baber J. et al. Unscrambling cancer genomes via integrated analysis of structural variation and copy number. Cell Genom 2022;2:100112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Steele CD, Abbasi A, Islam SMA. et al. Signatures of copy number alterations in human cancer. Nature 2022;606:984–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Stephens PJ, Greenman CD, Fu B. et al. Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell 2011;144:27–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Turner KM, Deshpande V, Beyter D. et al. Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature 2017;543:122–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Wickham H. ggplot2. Cham: Springer International Publishing, 2016. [Google Scholar]
  31. Zack TI, Schumacher SE, Carter SL. et al. Pan-cancer patterns of somatic copy number alteration. Nat Genet 2013;45:1134–40. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES