Dear Editor,
Preexisting genetic mutations in cells may influence phenotypes together with other molecular biology manipulations. For instance, the bladder cancer cell RT4 harbors a TACC3‐FGFR3 fusion rather than TP53 mutation (Figure 1F), whereas the other bladder cancer cell T24 have TP53 mutation but not FGFR3 mutation. 1 We questioned whether those preexisting genotypes could influence research conclusions and systematically overviewed urothelial carcinoma‐related 1589 articles either used or not used RT4 (Table S1).
FIGURE 1.

Comprehensive genomic profiling of urothelial carcinoma cell lines reveals hidden research bias and caveats. A, Distance from given target gene to TP53 on KEGG pathway is negatively correlated with RT4 usage. X‐axis: distance to TP53 and FGFR3 (1, 2, and 3 denote direct interacting with TP53, one‐step away from TP53, and two‐step away from TP53, etc.). Y‐axis: PubMed search result number of target gene name with RT4. B, The relative distance from given target gene to TP53 and FGFR3 based on protein‐protein interaction network is correlated with RT4 usage. X‐axis: The difference between target‐gene‐to‐TP53 distance and target‐gene‐to‐FGFR3 distance. A minus number denotes the gene is closer to FGFR3 compared to TP53, and a positive number denotes the gene is closer to TP53 compared to FGFR3. Y‐axis: Frequency of RT4‐containing search result in all PubMed search results with target gene and “bladder cancer cell line.” C, Papers studying FGFR3 pathway are more likely to use RT4 density distribution of the RT4‐containing search result frequency on PubMed for target genes in the KEGG FGFR3‐related signaling pathway or TP53‐related signaling pathway. D, Morphology of 2 BCa patient‐derived CRCs (a‐b). Scale bar is 50 μm. Visually pan‐genome copy number profile (yellow) with sequencing depth of probe region (gray) generated by CNVkit from 2 CRCs (c‐d) and 2 BCa tissue (e‐f) together with BCa cell lines T24 and RT4 (g‐h) showing the significant deviation from genuine BCa cell lines. E, Sequencing evidence for heterogeneity of driver oncogene fusion in SW780 cell line. Top panel showing the raw sequencing reads (yellow, cross‐chromosomal DNA fragments; gray, normal DNA fragments; green, fragments in inverted direction) of FGFR3 (top‐left) and BAIAP2L1 (top‐right) loci. Middle panel showing the chromosomal bands and gene structure, denoting the breakpoint locus. Bottom panel showing the original transcript (Tx) and respective functional domains of each fusion partner gene, and the predicted fusion product transcript with a functional FGFR3 kinase domain and longer C‐termini from BAIAP2L1. DNA fragments in support of fusion consist 7.3% (five in 81) of all sequenced fragments, suggesting that the fusion driver oncogene is lost in a part of cells. F, Sequencing evidence for TACC3‐FGFR3 driver oncogene fusion in RT4 cell line. Panel layout is similar to (E). DNA fragments in support of fusion consist 26.6% (53 in 268) of all sequenced fragments, suggesting that the fusion driver oncogene likely to exist in homogeneous one‐in‐a‐tetraploid cell state or in a heterogeneous manner. Copy number profile (a in panel D), however, did suggest that significant level of mosaicism exists in the RT4 population.
We found a significant association between target gene corresponding pathways and RT4 usage frequency. For example, the research was less likely to include RT4 (Figures 1A and 1B and Table S2) if the target gene was associate with TP53 pathways. We calculated the frequency of using RT4 and/or T24 for research on particular target gene (Table S3). Using PPI networks, 2 we calculated the “interaction distance” between these genes to TP53 and FGFR3. RT4 usage frequency was increased in research of genes with closer relationship to FGFR3 compared to TP53 (Figure 1C). Those results suggested the existence of latent bias in research, that is, researcher may implicitly select consistent results by selecting carcinoma cell lines. The preexisting carcinoma cell line mutation landscape could determine its response to particular molecular biology manipulation, which can be inconsistent, even though only consistent results would be reported.
Moreover, to define the genotype of carcinoma, cell line is critical to the research result. State‐of‐the‐art practices require genotyping cell lines using short tandem repeat (STR). 3 However, STR only identifies whether cell lines are cross‐contaminated, but does not fully report the genomic changes of cell lines. We collected eight most widely used bladder cancer cell lines and confirmed the identity by STR. Whole‐exome sequencing (WES) of the eight cell lines together with two conditionally reprogrammed cells (CRCs) 4 and two bladder cancer tissues (Table S4) showed the CRCs have good consistency with tumor tissues; however, the cell lines showed significant deviation from genuine bladder cancer by wide‐spread shattering copy number variation (Figures 1D and S1). Furthermore, comparing with the previous reports, 5 we detected only a tiny fraction of the known driver FGFR3 fusion in SW780 (Figure 1E). Those results indicated that the carcinoma cell lines may undergone pervasive genetic drifts characterized by copy number variation and random loss of driver mutation.
In conclusion, carcinoma cell line genotypes could influence research results by modifying molecular manipulation outcomes. Such influences could be implicit or explicit, and represent researcher's selection bias in research. Furthermore, the genotypes undergone neutral genetic drift, which might lead to loss of important driver mutation and gain of novel genetic identity. Thus, it is important that WES, instead of STR, should be widely used for determining carcinoma cell line genotypes. The CRCs closely resemble original tumor tissue and represent a better alternative for in vitro research to benefit the development of future precise medicine.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Supporting information
Supplement Information
ACKNOWLEDGMENTS
We gratefully acknowledge excellent technical assistance provided by Ms. Yuan Zhu, Ms. Mengxue Yu, and Ms. Yayun Fang from Zhongnan Hospital of Wuhan University. This study was supported in part by grants from the Science and Technology Department of Hubei Province Key Project (2018ACA159) and Medical Science Advancement Program (Clinical Medicine) of Wuhan University (TFLC2018002).
AVAILABILITY OF DATA AND MATERIALS
All the data and materials are available.
CONSENT FOR PUBLICATION
All the authors consent for publication.
ETHICS APPROVAL AND CONSENT TO PARTICIPATE
The study using clinical information and human samples (including surgical tissue specimens and primary cancer cells) was approved by the Ethics Review Committee at Zhongnan Hospital of Wuhan University (approval number: 2015029). Human sample preservation by the Zhongnan Hospital Biobank, the official member of the International Society for Biological and Environmental Repositories (https://irlocator.isber.org/details/60), was approved by the Ethics Review Committee at Zhongnan Hospital of Wuhan University (approval number: 2017038) and China Human Genetic Resources Management Office, Ministry of Science and Technology of China (approval number: 20171793).
AUTHOR CONTRIBUTIONS
YL and LJ performed experiments. GW, CC, YW, and LC performed literature review. All authors contributed to the research. YL, LJ, YZ, YX, and XW designed the study and wrote the manuscript. All authors approved the final manuscript.
Contributor Information
Yu Xiao, Email: yu.xiao@whu.edu.cn.
Xinghuan Wang, Email: wangxinghuan@whu.edu.cn.
REFERENCES
- 1. Hovelson DH, Udager AM, McDaniel AS, etal. Targeted DNA and RNA sequencing of paired urothelial and squamous bladder cancers reveals discordant genomic and transcriptomic events and unique therapeutic implications. Eur Urol. 2018;74(6):741‐753. [DOI] [PubMed] [Google Scholar]
- 2. Szklarczyk D, Gable AL, Lyon D, etal. STRING v11: protein‐protein association networks with increased coverage, supporting functional discovery in genome‐wide experimental datasets. Nucleic Acids Res. 2019;47(D1):D607‐D613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. American Type Culture Collection Standards Development Organization Workgroup ASN . Cell line misidentification: the beginning of the end. Nat Rev Cancer. 2010;10(6):441‐448. [DOI] [PubMed] [Google Scholar]
- 4. Kettunen K, Bostrom PJ, Lamminen T, etal. Personalized drug sensitivity screening for bladder cancer using conditionally reprogrammed patient‐derived cells. Eur Urol. 2019;76(4):430‐434. [DOI] [PubMed] [Google Scholar]
- 5. Acquaviva J, He S, Zhang C, etal. FGFR3 translocations in bladder cancer: differential sensitivity to HSP90 inhibition based on drug metabolism. Mol Cancer Res. 2014;12(7):1042‐1054. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplement Information
