Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2017 Feb 24;241(5):600–613. doi: 10.1002/path.4864

Derivation of marker gene signatures from human skin and their use in the interpretation of the transcriptional changes associated with dermatological disorders

Barbara B Shih 1, Ajit J Nirmal 1, Denis J Headon 1, Arne N Akbar 2, Neil A Mabbott 1,, Tom C Freeman 1,
PMCID: PMC5363360  PMID: 28008606

Abstract

Numerous studies have explored the altered transcriptional landscape associated with skin diseases to understand the nature of these disorders. However, data interpretation represents a significant challenge due to a lack of good maker sets for many of the specialized cell types that make up this tissue, whose composition may fundamentally alter during disease. Here we have sought to derive expression signatures that define the various cell types and structures that make up human skin, and demonstrate how they can be used to aid the interpretation of transcriptomic data derived from this organ. Two large normal skin transcriptomic datasets were identified, one RNA‐seq (n = 578), the other microarray (n = 165), quality controlled and subjected separately to network‐based analyses to identify clusters of robustly co‐expressed genes. The biological significance of these clusters was then assigned using a combination of bioinformatics analyses, literature, and expert review. After cross comparison between analyses, 20 gene signatures were defined. These included expression signatures for hair follicles, glands (sebaceous, sweat, apocrine), keratinocytes, melanocytes, endothelia, muscle, adipocytes, immune cells, and a number of pathway systems. Collectively, we have named this resource SkinSig. SkinSig was then used in the analysis of transcriptomic datasets for 18 skin conditions, providing in‐context interpretation of these data. For instance, conventional analysis has shown there to be a decrease in keratinization and fatty metabolism with age; we more accurately define these changes to be due to loss of hair follicles and sebaceous glands. SkinSig also highlighted the over‐/under‐representation of various cell types in skin diseases, reflecting an influx in immune cells in inflammatory disorders and a relative reduction in other cell types. Overall, our analyses demonstrate the value of this new resource in defining the functional profile of skin cell types and appendages, and in improving the interpretation of disease data. © 2016 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.

Keywords: transcriptomics, gene expression, skin, sebaceous gland, apocrine gland, sweat gland, psoriasis

Introduction

Disorders of the skin are ranked as the fourth leading cause of non‐fatal disease burden 1, and consequently there is significant interest in better understanding the biology of this organ and its dysregulation. Modern platforms allow the quantitative analysis of the complete set of transcripts expressed in a given sample. These technologies have been used to characterize the transcriptome of normal skin 2 and to determine how this is altered in certain skin diseases 3. However, interpretation of such data remains a significant challenge. During disease, the biology and the cellular composition of the skin may change significantly: for example, due to the influx of immune cells. In addition, when comparing samples gathered from different sites or by different methods, the cellular composition of individual samples may vary significantly. Such differences will be reflected in the transcriptional profile of the sample, but at present, many of the genes expressed in skin appendages (e.g. hair follicles and eccrine, apocrine, and sebaceous glands) or cell types are largely uncharacterized and therefore changes in their abundance may be difficult to interpret. If we knew the genes specifically expressed by cells found in the skin, we could use them to assess their relative abundance in samples, thereby allowing us to better interpret observed changes in transcriptomic data derived from this organ.

In an effort to address this problem, Swindell et al 4 used publicly available microarray data from isolated cell populations to define 24 ‘cell‐type specific’ signatures and used them to aid the interpretation of genes differentially expressed in psoriasis, and subsequently in multiple skin disorders 3. Although comprehensive gene marker sets for numerous immune cell subtypes were obtained, those for many types of skin appendages and cell types were not. Li et al 5 used a weighted gene co‐expression network analysis approach 6 to derive co‐expression clusters from RNA‐seq data of normal and psoriasis skin biopsies. The gene clusters reported were mostly large in size, ranging from 32 to 5427 genes, with several cell types overrepresented in multiple modules 5. Gene signatures for skin appendages and some cell types were also lacking in this study, as the annotation of these co‐expression clusters was predominately based on those reported by Swindell et al 4.

We therefore set out to identify sets of genes diagnostic of the many cell types present in human skin. Our approach is based on the fact that the expression level of a gene expressed specifically in a particular cell type or skin appendage will vary according to its abundance in a given sample. When measured across multiple samples, any other genes expressed in a similar manner will be observed to be co‐expressed. When correlation networks are constructed from transcriptomic data 7, 8, 9, groups of co‐expressed genes form highly connected cliques within the network, which can then be defined by network clustering algorithms 10. Taking advantage of this and the subtle variation in cellular composition between human skin samples (either intrinsic to a sample or due to variation in sampling), cell‐specific gene signatures may be extrapolated without the need to physically isolate specific cell types or skin appendages.

Here, we describe the derivation of 20 highly conserved gene expression signatures, collectively named SkinSig, for skin appendages, cells, and processes present in human skin. This resource aids the interpretation of transcriptomic data derived from human skin, allowing the cellular composition of samples to be explored. Furthermore, these signatures enable the pathological and physiological changes associated with skin conditions, disease subtypes or therapeutic interventions to be characterized.

Materials and methods

Data acquisition, quality control, and processing

The RNA‐seq dataset (mapped to gene level and RPKM‐normalized) was obtained from the GTEx project (http://www.gtexportal.org) 2, which at the time of download consisted of 607 post‐mortem skin samples. Further details on the dataset are provided in the supplementary material, Table S1. Quality control (QC) for the RNA‐seq dataset involved sample–sample correlation analyses, performed using the analysis software Miru (Kajeka Ltd, Edinburgh, UK). The topology of the sample–sample correlation was examined against the sample metadata, which identified major discrepancies between earlier and later RNA‐seq batches. Removal of early batches of data (LCSET‐1156 to LCSET‐1480) left a total of 578 samples, comprising 250 suprapubic and 328 lower leg samples.

A microarray dataset of normal skin was generated by combining the normal samples from two large psoriasis studies (GSE13355 and GSE30999) performed on the Affymetrix U133 plus 2.0 array and downloaded from the Gene Expression Omnibus (GEO) 11, 12, 13. Further details on these datasets are provided in the supplementary material, Table S1. Both QC and batch correction were carried out on the microarray dataset. Samples detected as outliers by the metrics report of arrayQualityMetrics 14 were excluded from further analysis. Samples passing QC were normalized using frozen robust multi‐array analysis 15 within study, followed by intra‐ and inter‐study batch correction using ComBat (sva, surrogate variable analysis; Bioconductor package) 16. Normalized data were loaded into Miru for sample–sample correlation analysis. Following the removal of outliers and all samples from psoriatic psoriasis skin, 165 normal (healthy subjects or non‐lesional skin from psoriasis subjects) samples were available for downstream analyses.

Co‐expression network analysis and cluster annotation for individual datasets

Of the 56 318 transcripts (genes defined in GENCODE v19) in the original RNA‐seq dataset, 24 128 transcripts were expressed in normal skin (>1 RPKM in at least one sample). The filtered dataset of skin‐expressed transcripts was loaded into Miru, with a gene–gene Pearson correlation threshold (r) set at r ≥ 0.73. A similar process was repeated for the microarray dataset at probe level with a correlation threshold of r ≥ 0.66. The Markov clustering (MCL) algorithm 10 was used with an inflation value of 2.2 for identifying co‐expression clusters.

In order to identify the functional relevance of transcript clusters, we used a combination of bioinformatics tools, literature review, as well as similarity to previously defined co‐expression clusters 17, 18. Each co‐expression cluster was examined using a number of bioinformatics tools, including gene ontology (GO) annotation enrichment analysis (http://pantherdb.org; Gene Ontology database release 2016‐04‐23), pathway inspection (Reactome, http://www.reactome.org; KEGG, http://www.genome.jp/kegg), and protein localization (Human Protein Atlas, http://www.proteinatlas.org). In addition, co‐expression signatures from previous studies were manually compared with clusters derived from skin, allowing the naming of some of the signatures. Clusters without GO enrichment or without similarity to previously reported co‐expression signatures were further investigated by checking individual genes against the literature and the phenotypes reported for knockout mice.

SkinSig derivation

In order to compare across analyses, the RNA‐seq and the microarray data were mapped to HGNC gene symbols (Ensembl BioMart, release 84), which were used as common identifiers. Where more than one microarray probe targeted a given symbol, the probe set with the highest median absolute deviation for signal intensity was used. Both datasets were further filtered to include only genes common to both platforms and with an expression greater than 1 RPKM in at least one sample in the RNA‐seq dataset. A total of 15 736 transcripts met these criteria.

Correlation networks were created from the common‐symbol filtered datasets at r ≥ 0.66 for the microarray dataset and r ≥ 0.73 for the RNA‐seq dataset. Network‐derived clusters from each study were individually annotated with the assistance of signature tracks imported from the network analysis of the full transcript/probe set analyses. Genes with the same annotation for both studies were used to construct SkinSig, together with three study‐specific annotations (circadian clock, skeletal muscle, and apocrine gland).

Application of SkinSig to gene expression data from skin conditions

The utilities of SkinSig in analysing gene expression data were demonstrated using existing microarray or RNA‐seq datasets from a variety of skin conditions. The combined dataset originally organized and normalized by Inkeles et al 3 was included, alongside six additional datasets identified in GEO, covering a total of 18 skin conditions. The dataset from ref 3 comprised microarray data from 15 skin conditions and normal skin, all based on the Affymetrix U133 plus 2.0 array platform. The six additional studies used a variety of platforms, covering six different skin conditions, three of which were not investigated in ref 3. An overview on each dataset is provided in the supplementary material, Table S1. In ref 3, the control group comprised normal skin samples from multiple studies (supplementary material, Table S1). For all other datasets, the test groups (i.e. skin condition) were compared against the control groups within the same study. Complex study designs were simplified accordingly (supplementary material, Table S1). Ageing gene expression data from Glass et al 19 were analysed by comparing each age group (50–60, 60–70, and > 70 years old) with the youngest group (≤50 years).

For each dataset, the geometric means were used to average across multiple probes for the same HGNC symbol. The expression level of each signature was calculated as an average of all genes within a signature (transformed to log2 prior to averaging). The log fold changes between these values from the pairing test and control groups were used to plot a heatmap, thereby allowing comparison across the different skin conditions. Positive fold change reflects overrepresentation of the signature in the test group, i.e. a particular skin condition. Details on each test–control pairing are described in the supplementary material, Table S1. Rotation gene set test (ROAST) (limma, linear models for microarray data; Bioconductor package) 20 was used to assess the statistical difference in expression between the control and the test groups, treating each gene within a signature as a separate measurement. Signatures were considered to be significantly altered when the false discovery rates (FDRs) ≤ 0.01 and ≥ 80% genes changed in the same direction (increased or decreased).

The effect of psoriasis on the keratinocyte differentiation signature was examined using quality‐controlled raw data from GSE13355 and GSE30999 and processed data for GSE54456 (supplementary material, Table S1). Co‐expression network analyses were carried out on the keratinocyte differentiation signature for each study, with only normal skin, only psoriatic skin or all samples. By using the MCL algorithm (inflation value = 2.2) on the co‐expression network for all samples from GSE13355, two subgroups of the keratinocyte differentiation signature were identified. ROAST was used to determine whether the keratinocyte differentiation subgroups were significantly differentially expressed. An FDR ≤ 0.01 was considered to be significant.

The study only involved publicly available de‐identified data; ethical approval was not required in the country/region in which the study was carried out (Scotland, UK).

Results

Network analysis of transcriptomic data from normal skin

Two datasets representing large collections of normal skin biopsies were selected (GSE13355 and GSE30999) 11, 12, 13. These comprised microarray data from healthy subjects or non‐lesional skin from psoriasis subjects of mixed anatomical origin (n = 165). In addition, a collection of RNA‐seq data from biopsies of post‐mortem normal skin from lower leg or suprapubic regions (n = 578) was obtained from the Genotype‐Tissue Expression (GTEx) project 2.

We interrogated these datasets using co‐expression network analysis 18, 21. Here, subtle differences between the normal skin biopsies result in groups of co‐expressed genes forming highly connected cliques within the network's overall topology. This method relies on characterizing groups of co‐expressed genes, rather than conventional analysis of statistically significant differences between pre‐determined sample groups. Co‐expression networks were independently constructed for the microarray and RNA‐seq datasets.

Following QC and batch correction, both the RNA‐seq and the microarray datasets showed little overall variation between samples, suggesting little variation attributable to technical or biological factors (supplementary material, Figure S1A, B). Although the microarray and RNA‐seq datasets differed in a number of respects (analysis platform, sampling site, live versus dead donors, etc.), the median expression levels for the majority of genes expressed across the two studies were consistent (supplementary material, Figure S1C). However, genes with little expression (<100 signal intensity, microarray; < 1 RPKM, RNA‐seq) demonstrated a non‐linear relationship between the two platforms. Saturation of microarray probes and a non‐linear signal response for highly expressed genes is a known limitation of microarray analyses, and here, the signal intensity plateaued at a signal intensity value of approximately 16 000 (supplementary material, Figure S1C). A small number of genes are observed to be very highly expressed in RNA‐seq data relative to others (supplementary material, Figure S1D).

Correlation analysis is based on the analysis of statistically improbable relationships (Figure S2) which are used in the generation of co‐expression networks. Use of the MCL algorithm 10 enables these networks to be divided into gene clusters that share a similar expression pattern across the dataset. The optimal Pearson correlation coefficient r value used to construct each network was determined empirically. At an r value of 0.73, the RNA‐seq dataset yielded a co‐expression network which was composed of 10 336 nodes (genes), connected by 114 904 edges, and contained 927 clusters (Figure 1A). The majority of the clusters were small; only 24 clusters had more than 50 genes, and 123 clusters had more than ten genes. Similar analyses were carried out on the microarray dataset at r ≥ 0.66, where the co‐expression network was composed of 15 158 nodes, 169 889 edges, and 1549 clusters (Figure 1B). The gene clusters derived from each dataset were mined extensively to understand their gene content. The significance of some clusters was easy to explain as their contents shared a high degree of similarity to those observed previously and/or were enriched in genes with informative GO annotations. Other gene clusters were less easy to interpret and required manual curation and expert review. The full cluster list, their gene composition, and functional annotations are provided in the supplementary material, Table S2.

Figure 1.

PATH-4864-FIG-0001-c

Network analysis of the RNA‐seq and microarray datasets. The IDs for some of the clusters are noted on the co‐expression network for (A) the RNA‐seq and (B) microarray datasets. Nodes are coloured according to cluster membership. Nodes represent genes or transcripts, and edges correlations between them above the Pearson correlation threshold value. (C) The average expression profile for a number of the clusters (highlighted in B) found in the RNA‐seq dataset, in some cases corresponding to the expected trends across gender and sampling location (suprapubic or leg) or gender.

The RNA‐seq dataset contained several gene clusters that appeared to be derived from the ‘contamination’ of samples from explicable sources, including skeletal muscle (Cluster 12), neurones (Cluster 43), and glial cells (Cluster 83) (see supplementary material, Table S2). Three clusters of genes were unexpected and inexplicable in skin samples; Cluster 1 (spermatids, expression predominately associated with male suprapubic samples) contained many genes involved in spermatogenesis, Cluster 29 (pancreas) contained genes encoding pancreatic enzymes (such as pancreatic lipases, proteases, and insulin), whereas Cluster 97 (stomach mucosa) contained genes encoding gastrokine, gastric lipase, and pepsinogens (see supplementary material, Table S2). However, many of the remaining clusters observed in the RNA‐seq dataset were found to show significant overlap with gene clusters present in the microarray dataset (see below). Exceptions included a cluster of genes exclusively expressed in the suprapubic samples which contained several apocrine gland markers, e.g. ABCC11 and ACSM1 (Figure 1C), reflecting the restricted presence of these glands to pubic and axillary skin regions 22. Conversely, a small cluster observed only in the microarray dataset consisted exclusively of circadian clock‐associated genes, such as PER1 and PER3, and may reflect the difference in the time of sampling or the use of post‐mortem samples in the RNA‐seq dataset.

Derivation conserved skin gene signatures (SkinSig)

We identified 17 signatures with overlapping gene membership in both the RNA‐seq and the microarray datasets (Figure 2A). These consisted of clusters identified as being derived from the majority of the appendages and cell types present in skin, as well as clusters of genes associated with core biological pathways, such as the cell cycle (Table 1 and supplementary material, Table S3). Within each signature, the overlap in gene membership in the RNA‐seq and the microarray datasets was 59 ± 18% and 63 ± 22% (mean ± SD), respectively (Figure 2A and Table 1). In order to define a robust set of marker genes, only those present in the same signature in both datasets were included in the final signature lists. The overlapping gene sets should be considered a highly conservative list of transcripts associated with skin appendages, cells, and pathways.

Figure 2.

PATH-4864-FIG-0002-c

Cross validation of signatures between the RNA‐seq and microarray datasets. (A) When gene clusters derived individually from the two datasets were compared, a portion of the genes was common to both analyses (red). These genes form the basis of SkinSig. Some clusters consisted of more genes in the RNA‐seq dataset (grey) than the microarray dataset (white) and vice versa. (B) A schematic diagram of skin illustrates the number of marker genes for each signature. Study‐specific signatures. The keratinocyte (subset) is a small group of highly co‐expressed genes but whose expression is independent of the keratinocyte differentiation signature genes.

Table 1.

Overlap between annotation made on the co‐expression clusters derived from network analysis of the RNA‐seq and microarray datasets

Cluster annotation RNA‐seq‐specific Microarray‐specific SkinSig (No. of genes) % Common (relative to the RNA‐seq dataset) % Common (relative to the microarray dataset)
Hair follicle 76 10 62 45% 86%
Sebaceous gland 17 229 105 86% 31%
Eccrine sweat gland 62 79 101 62% 56%
Apocrine gland 25 0 25 Dataset‐specific
Keratinocyte differentiation 48 114 78 62% 41%
Keratinocyte (subset) 4 4 7 64% 64%
Melanocyte 1 10 7 88% 41%
Fibroblast 69 160 132 66% 45%
Endothelium 150 18 40 21% 69%
Smooth muscle 49 19 49 50% 72%
Skeletal muscle 64 0 64 Dataset‐specific
Adipocyte 10 21 30 75% 59%
Macrophage/DC 45 39 49 52% 56%
T cell 29 13 17 37% 57%
FcϵR signalling 9 0 7 44% 100%
Plasma cell 2 17 7 78% 29%
IFN 21 5 22 51% 81%
Cell cycle (S/M) 69 8 65 49% 89%
Circadian clock 0 8 8 Dataset‐specific
Y‐chromosome 3 1 13 81% 93%

In addition to these 17 signatures, three additional signatures that were only observed in one of the two datasets were also included: circadian clock (microarray only), apocrine gland, and skeletal muscle (RNA‐seq only) (Figure 2B and Table 1). Apocrine glands are only found in certain skin regions, such as external genitalia (suprapubic samples) present only in the RNA‐seq samples. Skeletal muscle contamination, presumably due to biopsy depth, was also only evident in the RNA‐seq dataset. We have collectively named these gene signatures SkinSig.

GO enrichment terms and key marker genes present in each signature are detailed in the supplementary material, Table S3. However, GO enrichment analysis did not help with the functional assignation of some signatures, including sebaceous gland, apocrine gland, eccrine sweat gland, keratinocyte (subset), and Y‐chromosome. The justification and relevant literature supporting the functional annotation of these signatures are discussed in the supplementary material, Supplementary discussion. Our analyses obtained several novel marker gene signatures defined here for the first time. These include signatures for hair follicles, sebaceous glands, eccrine sweat glands, apocrine glands, and melanocytes. The localization of proteins encoded by some of these genes is independently confirmed in immunohistochemistry images derived from the Human Protein Atlas 23 (Figure 3).

Figure 3.

PATH-4864-FIG-0003-c

Examples of immunohistochemical staining of proteins encoded by SkinSig marker genes. Localization of a number of the proteins encoded by a selection of SkinSig marker genes, verifying the assignment given. Reprinted from the Human Protein Atlas database 23, with permission.

Application of SkinSig to the interpretation of gene expression data

To validate and test the utility of SkinSig, we gathered transcriptomic data from studies of 18 different skin conditions (pathological or physiologically altered). These included the combined dataset reported by Inkeles et al 3 and a further six datasets downloaded from GEO (see supplementary material, Table S1). For each skin condition, the fold change in expression level of all genes within each SkinSig gene signature was calculated and compared with control samples (supplementary material, Table S1). Each skin condition had a specific profile of altered gene expression of the SkinSig signatures. Hierarchical clustering of the signatures (excluding skeletal muscle, Y‐chromosome, and apocrine gland) based on these analyses revealed three main groupings of skin conditions (Figure 4).

Figure 4.

PATH-4864-FIG-0004-c

Skin signatures applied in analysis for 18 skin conditions. The heatmap and dendrogram were derived from the log2 fold change for each skin condition (test versus control group) for the 17 skin signatures on the left‐hand side of the dashed line. Expression of the remaining three signatures, apocrine gland, skeletal muscle, and Y‐chromosome, is highly dependent on sample properties unrelated to the condition (such as gender, contamination or sampling sites) and was therefore not included in the clustering. Full details of the datasets can be found in the supplementary material, Table S1. Significantly altered signatures (FDR ≤ 0.01 and ≥ 80% of the genes altered in the same direction) are indicated with a white dot. Due to the stated criteria for a comparison to be considered significant, there are instances where the average expression appears strongly dysregulated but is not considered significant; in these cases, it may be that FDR ≤ 0.01 but only < 80% of the signature is altered in the same direction. Data for each skin condition derived from ref 3 were compared against the data for the same group of normal skin derived from multiple studies. Sampling designs for these studies may introduce an artificially‐altered balance between cell populations (such as complete removal of epidermis) in the test group, but not the control group. Further details may be found in the supplementary material, Table S1.

We also analysed an additional dataset comprising normal and psoriatic skin (GSE13355; n = 46 external normal skin; n = 46 patient‐matched normal skin; n = 45 patient‐matched psoriatic skin) 11. The keratinocyte differentiation signature was found to split into two subgroups of genes which were significantly up‐ (38 genes) and down‐regulated (19 genes) in psoriasis (FDR ≤ 0.01) (Figure 5A). When these two subgroups of genes were applied to two additional psoriasis datasets (GSE30999 and GSE54456), we observed a similar trend (Figure 5B), demonstrating that this loss of co‐expression between the two subgroups was replicable across independent studies.

Figure 5.

PATH-4864-FIG-0005-c

Alteration in the keratinocyte differentiation signature in psoriasis. Co‐expression networks were constructed using only genes in the keratinocyte differentiation signature for psoriasis studies, GSE13355 (left), GSE30999 (middle), and GSE54456 (right). (A) Using the MCL algorithm on the network graph for the full dataset of GSE13355, the signature was split into two subgroups, one up‐regulated (red) and one down‐regulated (blue) in psoriasis. This clustering has been overlaid on the networks derived from the other studies. The separation of the keratinocyte differentiation signature subgroups is dependent on the sample types included in the network analysis: normal skin only (top), psoriatic lesions only (middle), and both sample types (bottom). (B) The average expression for all genes within each subgroup for each sample. (C) The log2 ratios between the expression of the two subgroups are generally lower than zero in the control samples. On the other hand, these ratios are higher and more variable in between psoriatic samples, perhaps reflecting disease severity.

Lastly, we applied SkinSig to a collection of data from ageing human skin (infra‐umbilical skin from female twins) 19. SkinSig revealed significant changes (FDR ≤ 0.01) to gene signatures associated with hair follicles and sebaceous gland in ageing skin (Figure 6). Approximately 44–62% of the content of these signatures was reported to be differentially expressed in ageing skin in the original study 19 (supplementary material, Table S4).

Figure 6.

PATH-4864-FIG-0006-c

Effects of ageing on SkinSig. (A) The heatmap illustrates the changes in SkinSig during ageing by comparing each age group with the youngest age group (≤50 years old), with comparisons yielding statistical significance (FDR ≤ 0.01 and ≥ 80% genes altered in the same direction) highlighted with a white dot. The fold change was calculated from the log2 mean expression for genes present in each signature. Rotation gene set tests were employed for statistical analysis. (B) The violin plots show the average expression for the hair follicle and sebaceous gland signatures for each individual. These plots show the change in distribution across age groups. The median expression value for each age group is indicated by a blue bar.

Discussion

Here, co‐expression network analysis has been used to interrogate transcriptomic data derived from normal human skin. Using this approach, we derived 20 co‐expression signatures that characterize the function‐specific profile of many cell types and appendages present in the skin. The most conserved of these co‐expressed genes across the two primary datasets used for this study have been named SkinSig. Evidence supporting these gene set annotations is available in Table S3 and the Supplementary discussion (supplementary material), and an expanded list of annotated gene clusters derived from these studies, including the analysis of the data from the MuTHER study 19, is presented in Table S2 (supplementary material).

The utility of SkinSig was demonstrated by using the signatures to describe the transcriptional changes associated with 18 skin conditions and ageing. SkinSig broadly separated the skin conditions into three groups (Figure 4). Group 1 included psoriasis, discoid lupus erythematosus, squamous cell carcinoma, and basal cell carcinoma (Figure 4). Each of these presents epidermal changes such as hyperkeratosis and epidermal hyperproliferation 24, 25, and was characterized by marked increases (FDR ≤ 0.01) in the keratinocyte (subset) and IFN signatures. Most conditions within this group also demonstrated a significant increase in the cell cycle signature (FDR ≤ 0.01). This group also included burns, post‐operative wounds, and chancroid, each of which was associated with wound healing or ulceration.

Group 2 was characterized by strongly up‐regulated T‐cell, IFN, and macrophage/dendritic cell (DC) signatures (FDR ≤ 0.01 for all but acute wound), together with down‐regulated keratinocyte differentiation and melanocyte signatures (FDR ≤ 0.01 for all but sarcoid) (Figure 4). The overall SkinSig expression pattern for groups 1 and 2 suggested that the relative contribution of other cell types to the transcriptional profiles of these samples was diluted: their signatures appear to be down‐regulated by the influx of immune cells and associated up‐regulation of immune response genes.

Group 3 generally showed less transcriptional perturbation relative to normal skin. UV‐challenged skin and vitiligo were most similar to normal skin, with the former showing an increase and the latter a decrease in the expression of the melanocyte signature (Figure 4). UV exposure would be expected to induce melanocyte proliferation 26, while vitiligo involves melanocyte loss from patches of skin 27. Other disorders in this group showed changes in the SkinSig signatures consistent with the known pathology of those conditions, such as the increased expression of the T‐cell signature in allergic contact dermatitis 28.

This study highlights the need to consider the sampling protocols used when interpreting transcriptomic data (Figure 4 and supplementary material, Table S1). In the case of the acute and post‐operative wound samples, the epidermis was removed prior to analysis 29, whereas the dermis was removed from the irritant contact dermatitis samples 30, and only blister fluid was used in the analysis for the Stevens–Johnson syndrome dataset 31. The complete removal of epidermis in the acute wound dataset, and the use of only epidermis for the irritant contact dermatitis dataset, may explain the unusual pattern within the heatmap for these two skin conditions; the former shows underrepresentation of epidermal cell types and overrepresentation of dermal cell types, whilst the reverse is true for the latter (Figure 4). Macro‐dissection or enrichment in tumour cell populations may likewise result in reduced resident skin cell diversity and abundance in samples such as those from melanoma and mycosis fungoides 32, 33.

Intriguingly, the keratinocyte differentiation signature did not appear to be up‐regulated in any of the skin conditions, including psoriasis. Upon closer examination of the expression of this signature in three independent psoriasis datasets, two subgroups of the keratinocyte differentiation signature genes were identified, and the ratio between the expression levels of these two subgroups was altered in psoriatic skin (Figure 5C). Higher variation of this ratio was also noted across the psoriatic samples, which may reflect the magnitude of the disease severity (Figure 5C). Within the keratinocyte differentiation signature subgroup that is down‐regulated in psoriasis, several genes, such as LCE1B, LCE2B, FLG2, and LOR, are known to be associated with the terminally differentiated keratinocytes (cornecytes) that make up the stratum corneum. Furthermore, LCE3B and LCE3C deletions have been identified as risk factors for psoriasis in multiple ethnic groups 34. Several genes known to be expressed in suprabasal keratinocytes were present within the keratinocyte differentiation signature subgroup up‐regulated in psoriasis. The altered expression profile of these genes most likely reflects the hyperproliferation of the epidermis that is associated with psoriasis 35. In short, genes in these subgroups of the keratinocyte differentiation signature were expressed at a similar ratio across normal skin, but become uncoordinated in psoriatic lesions. Instead of considering lists of dysregulated genes, this co‐expression approach with SkinSig allows the recognition of a disrupted system by defining the dynamics between genes in a physiologically normal state.

Glass et al 19 have reported a decrease in the expression of genes during skin ageing associated with keratinization and fatty acid metabolism. Using SkinSig, these changes may more accurately be described as a reduction in the number or functional activity of sebaceous glands and hair follicles with age (Figure 6). These observations are consistent with the rapid decline in scalp hair coverage in women over 45 years old 36. The reduction in the sebaceous gland signature is also consistent with previous reports showing gradually decreased activity of this gland after menopause 37. A small increase in the expression of smooth muscle signature is also seen in the age groups 50–60 and above 70 years (Figure 6); migration and accumulation of vascular smooth muscle cells into the tunica intima have been implicated in ageing 38.

In summary, we have defined a set of marker genes, collectively named SkinSig, which comprise a useful resource of gene signatures derived from skin appendages, cell types, and pathways present in normal human skin. SkinSig not only includes potential new marker genes for skin‐resident cell types and processes, but can also be used to interrogate gene expression data derived from whole human skin. Importantly, SkinSig can be used to obtain novel insights into the physiological and pathological changes that occur in the skin.

Author contributions statement

BS carried out the collection, analysis, and interpretation of data, as well as compilation of the figures and tables. AN and DH contributed to data analysis and interpretation. NA, NM, and TF contributed to the study design. NM and TF contributed equally. All authors were involved in writing the paper and had final approval of the submitted and published versions.

SUPPLEMENTARY MATERIAL ONLINE.

Supplementary discussion. Further discussion on other gene clusters of interest and justifications for the assignation of annotation of co‐expression signatures without a significant or relevant gene ontology term

Supplementary figure and table legends

Figure S1. Sample–sample correlation and signal comparisons between the RNA‐seq and microarray datasets

Figure S2. Pearson correlation thresholds in randomized and original data

Table S1. Details on the datasets used in this study

Table S2. SkinSig and co‐expression signatures derived from the analysis of different datasets derived from normal human skin.

Table S3. Gene ontology enrichment and example genes for each signature

Table S4. Comparison between SkinSig and genes found to be significantly altered during ageing

Supporting information

Supplementary discussion. Further discussion on other gene clusters of interest and justifications for the assignation of annotation of co‐expression signatures without a significant or relevant gene ontology term

Supplementary figure and table legends

Figure S1 Sample–sample correlation and signal comparisons between the RNA‐seq and microarray datasets. (A) Sample–sample correlation plots of data used in these studies, using the maximum Pearson correlation coefficient threshold that still retained all samples for the RNA‐seq (r ≥ 0.93) and (B) the microarray (r ≥ 0.97) datasets. There is minimal sample separation due to the main sample attribute, i.e. study (microarray) or site of sampling (RNA‐seq). (C) Log‐scale plot of median expression values, showing that expression levels for the majority of genes are positively correlated in the two datasets. (D) Of the top ten genes with the highest expression for the microarray (red) and RNA‐seq (blue) datasets, there were only two genes in common (RPS18 and KRT14). A higher dynamic range was also observed in the RNA‐seq dataset.

Figure S2 Pearson correlation thresholds in randomized and original data. (A) When expression values for each gene were randomized across the samples of the RNA‐seq dataset, only 95 pairing (edges) were observed at a threshold used in this analysis (r ≥ 0.73), whilst the untransformed data yielded 87 121 edges. Taking into account that a total of 123 802 980 calculations were made for every possible combination of gene–gene Pearson correlation analysis, the frequency of a pair of genes reaching the r threshold is 7.7 × 10−7 (blue), compared with a frequency of 7.0 × 10−4 for non‐random correlations observed in the actual dataset (red). This supports the notion that the vast majority of relationships used to build the network analysed here are non‐random. (B) Similar analysis was done to the microarray dataset. The randomized version of the dataset shows a frequency of 1 × 10−8 for a random correlation (blue) to occur at r ≥ 0.66, compared with a frequency of 6.7 × 10−4 observed in the actual dataset (red).

Table S1 Details on the datasets used in this study The subject details, experimental design, data source, and comparisons for all datasets included in this study are listed in Table S1. This includes the three primary data datasets, 22 validation datasets for skin conditions, three psoriasis datasets for integrating the keratinocyte differentiation signature, and one ageing dataset.

Table S2 SkinSig and co‐expression signatures derived from the analysis of different datasets derived from normal human skin The supplementary file includes the co‐expression signatures for all network analysis, including the individual full original datasets, gene‐symbol restricted datasets, and SkinSig.

Table S3 Gene ontology enrichment and example genes for each signature In addition to highlighting a number of key genes and significant gene ontology (GO) enrichment result for each SkinSig, the supplementary file also includes the full set results from gene ontology enrichment.

Table S4 Comparison between SkinSig and genes found to be significantly altered during ageing Genes reported to change in their expression during ageing are compared with SkinSig, showing a high percentage of the sebaceous gland and hair signatures being reported to alter with ageing by Glass et al [19].

Acknowledgements

This work was supported by Experimental Medicine Challenge Grant funding from the Medical Research Council (MR/M003833/1) and Institute Strategic Programme Grant funding from the Biotechnology and Biological Sciences Research Council (BB/J004227/1). We thank Dr William Swindell for sharing the cell‐specific signatures originally published in [4] and Professor Robert L Modlin for sharing normalized data originally published in [3].

No conflicts of interest were declared.

Contributor Information

Neil A Mabbott, Email: neil.mabbott@roslin.ed.ac.uk.

Tom C Freeman, Email: tom.freeman@roslin.ed.ac.uk.

References

*Cited only in supplementary material.

  • 1. Hay RJ, Johns NE, Williams HC, et al The global burden of skin disease in 2010: an analysis of the prevalence and impact of skin conditions. J Invest Dermatol 2014; 134 : 1527–1534. [DOI] [PubMed] [Google Scholar]
  • 2. Lonsdale J, Thomas J, Salvatore M, et al The Genotype‐Tissue Expression (GTEx) project. Nature Genet 2013; 45: 580‐585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Inkeles MS, Scumpia PO, Swindell WR, et al Comparison of molecular signatures from multiple skin diseases identifies mechanisms of immunopathogenesis. J Invest Dermatol 2015; 135:151–159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Swindell WR, Johnston A, Voorhees JJ, et al Dissecting the psoriasis transcriptome: inflammatory‐ and cytokine‐driven gene expression in lesions from 163 patients. BMC Genomics 2013; 14:527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Li B, Tsoi LC, Swindell WR, et al Transcriptome analysis of psoriasis in a large case–control sample: RNA‐seq provides insights into disease mechanisms. J Invest Dermatol 2014; 134:1828–1838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Zhang B, Horvath S. A general framework for weighted gene co‐expression network analysis. Stat Appl Genet Mol Biol 2005;4:article 17. [DOI] [PubMed] [Google Scholar]
  • 7. Freeman TC, Ivens A, Baillie JK, et al A gene expression atlas of the domestic pig. BMC Biol 2012; 10:1–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Hume DA, Summers KM, Raza S, et al Functional clustering and lineage markers: insights into cellular differentiation and gene function from large‐scale microarray studies of purified primary cell populations. Genomics 2010; 95:328–338. [DOI] [PubMed] [Google Scholar]
  • 9. Natividad A, Freeman TC, Jeffries D, et al Human conjunctival transcriptome analysis reveals the prominence of innate defense in Chlamydia trachomatis infection. Infect Immun 2010; 78:4895–4911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Enright AJ, Van Dongen S, Ouzounis CA. An efficient algorithm for large‐scale detection of protein families. Nucleic Acids Res 2002; 30:1575–1584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Nair RP, Duffin KC, Helms C, et al Genome‐wide scan reveals association of psoriasis with IL‐23 and NF‐kappaB pathways. Nature Genet 2009; 41:199–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Swindell WR, Johnston A, Carbajal S, et al Genome‐wide expression profiling of five mouse models identifies similarities and differences with human psoriasis. PLoS One 2011; 6:e18266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Suarez‐Farinas M, Li K, Fuentes‐Duculan J, et al Expanding the psoriasis disease profile: interrogation of the skin and serum of patients with moderate‐to‐severe psoriasis. J Invest Dermatol 2012; 132:2552–2564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Kauffmann A, Gentleman R, Huber W. arrayQualityMetrics – a bioconductor package for quality assessment of microarray data. Bioinformatics 2009; 25:415–416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. McCall MN, Bolstad BM, Irizarry RA. Frozen robust multiarray analysis (fRMA). Biostatistics 2010; 11:242–253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 2007; 8:118–127. [DOI] [PubMed] [Google Scholar]
  • 17. Doig TN, Hume DA, Theocharidis T, et al Coexpression analysis of large cancer datasets provides insight into the cellular phenotypes of the tumour microenvironment. BMC Genomics 2013; 14:469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Mabbott NA, Baillie JK, Brown H, et al An expression atlas of human primary cells: inference of gene function from coexpression networks. BMC Genomics 2013; 14:632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Glass D, Vinuela A, Davies MN, et al Gene expression changes with age in skin, adipose tissue, blood and brain. Genome Biol 2013; 14:R75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Wu D, Lim E, Vaillant F, et al ROAST: rotation gene set tests for complex microarray experiments. Bioinformatics 2010; 26:2176–2182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Freeman TC, Goldovsky L, Brosch M, et al Construction, visualisation, and clustering of transcription networks from microarray expression data. PLoS Comput Biol 2007; 3:2032–2042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Woollard HH. The cutaneous glands of man. J Anat 1930; 64:415–421. [PMC free article] [PubMed] [Google Scholar]
  • 23. Uhlen M, Bjorling E, Agaton C, et al A human protein atlas for normal and cancer tissues based on antibody proteomics. Mol Cell Proteomics 2005; 4 : 1920–1932. [DOI] [PubMed] [Google Scholar]
  • 24. Fuchs E, Raghavan S. Getting under the skin of epidermal morphogenesis. Nature Rev Genet 2002; 3:199–209. [DOI] [PubMed] [Google Scholar]
  • 25. Panjwani S. Early diagnosis and treatment of discoid lupus erythematosus. J Am Board Fam Med 2009; 22:206–213. [DOI] [PubMed] [Google Scholar]
  • 26. Sato T, Kawada A. Uptake of tritiated thymidine by epidermal melanocytes of hairless mice during ultraviolet light radiation. J Invest Dermatol 1972; 58:71–73. [DOI] [PubMed] [Google Scholar]
  • 27. Bastonini E, Kovacs D, Picardo M. Skin pigmentation and pigmentary disorders: focus on epidermal/dermal cross‐talk. Ann Dermatol 2016; 28:279–289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Kaplan DH, Igyarto BZ, Gaspari AA. Early immune events in the induction of allergic contact dermatitis. Nature Rev Immunol 2012; 12:114–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Nuutila K, Siltanen A, Peura M, et al Human skin transcriptome during superficial cutaneous wound healing. Wound Repair Regen 2012; 20:830–839. [DOI] [PubMed] [Google Scholar]
  • 30. Clemmensen A, Andersen KE, Clemmensen O, et al Genome‐wide expression analysis of human in vivo irritated epidermis: differential profiles induced by sodium lauryl sulfate and nonanoic acid. J Invest Dermatol 2010; 130:2201–2210. [DOI] [PubMed] [Google Scholar]
  • 31. Chung WH, Hung SI, Yang JY, et al Granulysin is a key mediator for disseminated keratinocyte death in Stevens–Johnson syndrome and toxic epidermal necrolysis. Nature Med 2008; 14:1343–1350. [DOI] [PubMed] [Google Scholar]
  • 32. Riker AI, Enkemann SA, Fodstad O, et al The gene expression profiles of primary and metastatic melanoma yields a transition point of tumor progression and metastasis. BMC Med Genomics 2008; 1:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. van Doorn R, van Kester MS, Dijkman R, et al Oncogenomic analysis of mycosis fungoides reveals major differences with Sézary syndrome. Blood 2009; 113:127–136. [DOI] [PubMed] [Google Scholar]
  • 34. Riveira‐Munoz E, He SM, Escaramis G, et al Meta‐analysis confirms the LCE3C_LCE3B deletion as a risk factor for psoriasis in several ethnic groups and finds interaction with HLA‐Cw6. J Invest Dermatol 2011; 131:1105–1109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Schön MP, Boehncke W‐H. Psoriasis. N Engl J Med 2005; 352:1899–1912. [DOI] [PubMed] [Google Scholar]
  • 36. Robbins C, Mirmirani P, Messenger AG, et al What women want – quantifying the perception of hair amount: an analysis of hair diameter and density changes with age in caucasian women. Br J Dermatol 2012; 167:324–332. [DOI] [PubMed] [Google Scholar]
  • 37. Pochi PE, Strauss JS, Downing DT. Age‐related changes in sebaceous gland activity. J Invest Dermatol 1979; 73:108–111. [DOI] [PubMed] [Google Scholar]
  • 38. Orlandi A, Bochaton‐Piallat ML, Gabbiani G, et al Aging, smooth muscle cells and vascular pathobiology: implications for atherosclerosis. Atherosclerosis 2006; 188:221‐230. [DOI] [PubMed] [Google Scholar]
  • 39*. Zhang S, Shui G, Wang G, et al Cidea control of lipid storage and secretion in mouse and human sebaceous glands. Mol Cell Biol 2014; 34:1827–1838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40*. Wakimoto K, Chiba H, Michibata H, et al A novel diacylglycerol acyltransferase (DGAT2) is decreased in human psoriatic skin and increased in diabetic mice. Biochem Biophys Res Commun 2003; 310:296–302. [DOI] [PubMed] [Google Scholar]
  • 41*. Ge L, Gordon JS, Hsuan C, et al Identification of the delta‐6 desaturase of human sebaceous glands: expression and enzyme activity. J Invest Dermatol 2003; 120:707–714. [DOI] [PubMed] [Google Scholar]
  • 42*. Zhang L, Li WH, Anthonavage M, et al Melanocortin‐5 receptor: a marker of human sebocyte differentiation. Peptides 2006; 27:413–420. [DOI] [PubMed] [Google Scholar]
  • 43*. Westerberg R, Tvrdik P, Unden AB, et al Role for ELOVL3 and fatty acid chain length in development of hair and skin function. J Biol Chem 2004; 279:5621–5629. [DOI] [PubMed] [Google Scholar]
  • 44*. Maier H, Meixner M, Hartmann D, et al Normal fur development and sebum production depends on fatty acid 2‐hydroxylase expression in sebaceous glands. J Biol Chem 2011; 286:25922–25934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45*. Jong MC, Gijbels MJ, Dahlmans VE, et al Hyperlipidemia and cutaneous abnormalities in transgenic mice overexpressing human apolipoprotein C1. J Clin Invest 1998; 101:145–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46*. Kieri CF, Bergendal B, Lind LK, et al EDAR‐induced hypohidrotic ectodermal dysplasia: a clinical study on signs and symptoms in individuals with a heterozygous c.1072C > T mutation. BMC Med Genet 2014; 15:57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47*. Rowe SM, Miller S, Sorscher EJ. Cystic fibrosis. N Engl J Med 2005; 352:1992–2001. [DOI] [PubMed] [Google Scholar]
  • 48*. Cui CY, Childress V, Piao Y, et al Forkhead transcription factor FoxA1 regulates sweat secretion through bestrophin 2 anion channel and Na–K–Cl cotransporter 1. Proc Natl Acad Sci U S A 2012; 109:1199–1203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49*. Nejsum LN, Kwon TH, Jensen UB, et al Functional requirement of aquaporin‐5 in plasma membranes of sweat glands. Proc Natl Acad Sci U S A 2002; 99:511–516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50*. Cui CY, Schlessinger D. Eccrine sweat gland development and sweat secretion. Exp Dermatol 2015; 24:644–650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51*. Martin A, Saathoff M, Kuhn F, et al A functional ABCC11 allele is essential in the biochemical formation of human axillary odor. J Invest Dermatol 2010; 130:529–540. [DOI] [PubMed] [Google Scholar]
  • 52*. Celis JE, Cabezon T, Moreira JM, et al Molecular characterization of apocrine carcinoma of the breast: validation of an apocrine protein signature in a well‐defined cohort. Mol Oncol 2009; 3:220–237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53*. Roy S, Shafi NQ, Rose MG. Locally recurrent and metastatic apocrine‐gland carcinoma in an elderly man. Nature Clin Prac Oncol 2007; 4:56–59. [DOI] [PubMed] [Google Scholar]
  • 54*. Martinsson H, Yhr M, Enerback C. Expression patterns of S100A7 (psoriasin) and S100A9 (calgranulin‐B) in keratinocyte differentiation. Exp Dermatol 2005; 14:161–168. [DOI] [PubMed] [Google Scholar]
  • 55*. Wolf R, Lewerenz V, Büchau AS, et al Human S100A15 splice variants are differentially expressed in inflammatory skin diseases and regulated through Th1 cytokines and calcium. Exp Dermatol 2007; 16:685–691. [DOI] [PubMed] [Google Scholar]
  • 56*. Dinulos JG, Mentele L, Fredericks LP, et al Keratinocyte expression of human beta defensin 2 following bacterial infection: role in cutaneous host defense. Clin Diagn Lab Immunol 2003; 10:161–166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57*. Meyer JE, Harder J, Sipos B, et al Psoriasin (S100A7) is a principal antimicrobial peptide of the human tongue. Mucosal Immunol 2008; 1:239–243. [DOI] [PubMed] [Google Scholar]
  • 58*. Kerkhoff C, Voss A, Scholzen TE, et al Novel insights into the role of S100A8/A9 in skin biology. Exp Dermatol 2012; 21:822–826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59*. Iotzova‐Weiss G, Dziunycz PJ, Freiberger SN, et al S100A8/A9 stimulates keratinocyte proliferation in the development of squamous cell carcinoma of the skin via the receptor for advanced glycation‐end products. PLoS One 2015; 10:e0120971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60*. Gibbs S, Fijneman R, Wiegant J, et al Molecular characterization and evolution of the SPRR family of keratinocyte differentiation markers encoding small proline‐rich proteins. Genomics 1993; 16:630–637. [DOI] [PubMed] [Google Scholar]
  • 61*. Coelho SG, Valencia JC, Yin L, et al UV exposure modulates hemidesmosome plasticity, contributing to long‐term pigmentation in human skin. J Pathol 2015; 236:17–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62*. Coelho SG, Yin L, Smuda C, et al Photobiological implications of melanin photoprotection after UVB‐induced tanning of human skin but not UVA‐induced tanning. Pigment Cell Melanoma Res 2015; 28:210–216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63*. Fan F, Jin S, Amundson SA, et al ATF3 induction following DNA damage is regulated by distinct signaling pathways and over‐expression of ATF3 protein suppresses cells growth. Oncogene 2002; 21:7488–7496. [DOI] [PubMed] [Google Scholar]
  • 64*. Staples CJ, Owens DM, Maier JV, et al Cross‐talk between the p38alpha and JNK MAPK pathways mediated by MAP kinase phosphatase‐1 determines cellular sensitivity to UV radiation. J Biol Chem 2010; 285:25928–25940. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65*. Schreiber M, Baumann B, Cotten M, et al Fos is an essential component of the mammalian UV response. EMBO J 1995; 14:5338–5349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66*. Devary Y, Gottlieb RA, Lau LF, et al Rapid and preferential activation of the c‐jun gene during the mammalian UV response. Mol Cell Biol 1991; 11:2804–2811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67*. Li T, Dai W, Lu L. Ultraviolet‐induced junD activation and apoptosis in myeloblastic leukemia ML‐1 cells. J Biol Chem 2002; 277:32668–32676. [DOI] [PubMed] [Google Scholar]
  • 68*. Angel P, Szabowski A, Schorpp‐Kistner M. Function and regulation of AP‐1 subunits in skin physiology and pathology. Oncogene 2001; 20:2413–2423. [DOI] [PubMed] [Google Scholar]
  • 69*. Rinn JL, Bondre C, Gladstone HB, et al Anatomic demarcation by positional variation in fibroblast gene expression programs. PLoS Genet 2006; 2:e119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70*. Schweizer J, Langbein L, Rogers MA, et al Hair follicle‐specific keratins and their diseases. Exp Cell Res 2007; 313:2010–2020. [DOI] [PubMed] [Google Scholar]
  • 71*. Kunisada M, Cui CY, Piao Y, et al Requirement for Shh and Fox family genes at different stages in sweat gland development. Hum Mol Genet 2009; 18:1769–1778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72*. Iotzova‐Weiss G, Dziunycz PJ, Freiberger SN, et al S100A8/A9 stimulates keratinocyte proliferation in the development of squamous cell carcinoma of the skin via the receptor for advanced glycation‐end products. PloS One 2015; 10:e0120971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73*. Hoashi T, Watabe H, Muller J, et al MART‐1 is required for the function of the melanosomal matrix protein PMEL17/GP100 and the maturation of melanosomes. J Biol Chem 2005; 280:14006–14016. [DOI] [PubMed] [Google Scholar]
  • 74*. Theos AC, Truschel ST, Raposo G, et al The Silver locus product Pmel17/gp100/Silv/ME20: controversial in name and in function. Pigment Cell Res 2005; 18:322–336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75*. Giebel LB, Strunk KM, King RA, et al A frequent tyrosinase gene mutation in classic, tyrosinase‐negative (type IA) oculocutaneous albinism. Proc Natl Acad Sci U S A 1990; 87:3255–3258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76*. Armstrong LJ, Heath VL, Sanderson S, et al ECSM2, an endothelial specific filamin A binding protein that mediates chemotaxis. Arterioscler Thromb Vasc Biol 2008; 28:1640–1646. [DOI] [PubMed] [Google Scholar]
  • 77*. Chen SH, Babichev Y, Rodrigues N, et al Gene expression analysis of Tek/Tie2 signaling. Physiol Genomics 2005; 22:257–267. [DOI] [PubMed] [Google Scholar]
  • 78*. Ueyama H, Hamada H, Battula N, et al Structure of a human smooth muscle actin gene (aortic type) with a unique intron site. Mol Cell Biol 1984; 4:1073–1078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79*. Gillett GT, Fox MF, Rowe PS, et al Mapping of human non‐muscle type cofilin (CFL1) to chromosome 11q13 and muscle‐type cofilin (CFL2) to chromosome 14. Ann Hum Genet 1996; 60:201–211. [DOI] [PubMed] [Google Scholar]
  • 80*. Robin YM, Penel N, Perot G, et al Transgelin is a novel marker of smooth muscle differentiation that improves diagnostic accuracy of leiomyosarcomas: a comparative immunohistochemical reappraisal of myogenic markers in 900 soft tissue tumors. Mod Pathol 2013; 26:502–510. [DOI] [PubMed] [Google Scholar]
  • 81*. Hu E, Liang P, Spiegelman BM. AdipoQ is a novel adipose‐specific gene dysregulated in obesity. J Biol Chem 1996; 271:10697–10703. [DOI] [PubMed] [Google Scholar]
  • 82*. Gonzales AM, Orlando RA. Role of adipocyte‐derived lipoprotein lipase in adipocyte hypertrophy. Nutr Metab (Lond) 2007; 4:22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83*. Murray PJ, Wynn TA. Protective and pathogenic functions of macrophage subsets. Nature Rev Immunol 2011; 11:723–737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84*. Cruse G, Kaur D, Leyland M, et al A novel FcϵRIβ‐chain truncation regulates human mast cell proliferation and survival. FASEB J 2010; 24:4047–4057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85*. van Anken E, Pena F, Hafkemeijer N, et al Efficient IgM assembly and secretion require the plasma cell induced endoplasmic reticulum protein pERp1. Proc Natl Acad Sci U S A 2009; 106:17019–17024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86*. Kitamura A, Takahashi K, Okajima A, et al Induction of the human gene for p44, a hepatitis‐C‐associated microtubular aggregate protein, by interferon‐alpha/beta. Eur J Biochem 1994; 224:877–883. [DOI] [PubMed] [Google Scholar]
  • 87*. Marie I, Svab J, Robert N, et al Differential expression and distinct structure of 69‐ and 100‐kDa forms of 2‐5A synthetase in human cells treated with interferon. J Biol Chem 1990; 265:18601–18607. [PubMed] [Google Scholar]
  • 88*. Goriki A, Hatanaka F, Myung J, et al A novel protein, CHRONO, functions as a core component of the mammalian circadian clock. PLoS Biol 2014; 12:e1001839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89*. Yan L, Miyake S, Okamura H. Distribution and circadian expression of dbp in SCN and extra‐SCN areas in the mouse brain. J Neurosci Res 2000; 59:291–295. [DOI] [PubMed] [Google Scholar]
  • 90*. Albrecht U, Sun ZS, Eichele G, et al A differential response of two putative mammalian circadian regulators, mper1 and mper2, to light. Cell 1997; 91:1055–1064. [DOI] [PubMed] [Google Scholar]
  • 91*. Mattiuzzo NR, Toulza E, Jonca N, et al A large‐scale multi‐technique approach identifies forty‐nine new players of keratinocyte terminal differentiation in human epidermis. Exp Dermatol 2011; 20:113–118. [DOI] [PubMed] [Google Scholar]
  • 92*. Toulza E, Mattiuzzo NR, Galliano MF, et al Large‐scale identification of human genes implicated in epidermal barrier function. Genome Biol 2007; 8:R107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93*. Kretz M, Siprashvili Z, Chu C, et al Control of somatic tissue differentiation by the long non‐coding RNA TINCR. Nature 2013; 493:231–235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94*. Dhingra N, Shemer A, Correa da Rosa J, et al Molecular profiling of contact dermatitis skin identifies allergen‐dependent differences in immune response. J Allergy Clin Immunol 2014; 134:362–372. [DOI] [PubMed] [Google Scholar]
  • 95*. Pedersen MB, Skov L, Menne T, et al Gene expression time course in the human skin during elicitation of allergic contact dermatitis. J Invest Dermatol 2007; 127:2585–2595. [DOI] [PubMed] [Google Scholar]
  • 96*. Guttman‐Yassky E, Suarez‐Farinas M, Chiricozzi A, et al Broad defects in epidermal cornification in atopic dermatitis identified through genomic analysis. J Allergy Clin Immunol 2009; 124:1235–1244.e58. [DOI] [PubMed] [Google Scholar]
  • 97*. Suarez‐Farinas M, Tintle SJ, Shemer A, et al Nonlesional atopic dermatitis skin is characterized by broad terminal differentiation defects and variable immune abnormalities. J Allergy Clin Immunol 2011; 127:954–964.e1‐4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98*. Riker AI, Enkemann SA, Fodstad O, et al The gene expression profiles of primary and metastatic melanoma yields a transition point of tumor progression and metastasis. BMC Med Genomics 2008; 1:13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99*. Greco JA 3rd, Pollins AC, Boone BE, et al A microarray analysis of temporal gene expression profiles in thermally injured human skin. Burns 2010; 36:192–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100*. Humphreys TL, Li L, Li X, et al Dysregulated immune profiles for skin and dendritic cells are associated with increased host susceptibility to Haemophilus ducreyi infection in human volunteers. Infect Immun 2007; 75:5686–5697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101*. Jabbari A, Suarez‐Farinas M, Fuentes‐Duculan J, et al Dominant Th1 and minimal Th17 skewing in discoid lupus revealed by transcriptomic comparison with psoriasis. J Invest Dermatol 2014; 134:87–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102*. Belone AdFF, Rosa PS, Trombone APF, et al Genome‐wide screening of mRNA expression in leprosy patients. Front Genet 2015; 6:334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103*. Montoya D, Cruz D, Teles RM, et al Divergence of macrophage phagocytic and antimicrobial programs in leprosy. Cell Host Microbe 2009; 6:343–353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104*. Lee DJ, Li H, Ochoa MT, et al Integrated pathways for neutrophil recruitment and inflammation in leprosy. J Infect Dis 2010; 201:558–569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105*. Talantov D, Mazumder A, Yu JX, et al Novel genes associated with malignant melanoma but not benign melanocytic lesions. Clin Cancer Res 2005; 11:7234–7242. [DOI] [PubMed] [Google Scholar]
  • 106*. Yao Y, Richman L, Morehouse C, et al Type I interferon: potential therapeutic target for psoriasis? PLoS One 2008; 3:e2737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107*. Tsoi LC, Iyer MK, Stuart PE, et al Analysis of long non‐coding RNAs highlights tissue‐specific expression patterns and epigenetic profiles in normal and psoriatic skin. Genome Biol 2015; 16:24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108*. Judson MA, Marchell RM, Mascelli M, et al Molecular profiling and gene expression analysis in cutaneous sarcoidosis: the role of interleukin‐12, interleukin‐23, and the T‐helper 17 pathway. J Am Acad Dermatol 2012; 66:901–910, 910.e1‐2. [DOI] [PubMed] [Google Scholar]
  • 109*. Regazzetti C, Joly F, Marty C, et al Transcriptional analysis of vitiligo skin reveals the alteration of WNT pathway: a promising target for repigmenting vitiligo patients. J Invest Dermatol 2015; 135:3105–3114. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary discussion. Further discussion on other gene clusters of interest and justifications for the assignation of annotation of co‐expression signatures without a significant or relevant gene ontology term

Supplementary figure and table legends

Figure S1 Sample–sample correlation and signal comparisons between the RNA‐seq and microarray datasets. (A) Sample–sample correlation plots of data used in these studies, using the maximum Pearson correlation coefficient threshold that still retained all samples for the RNA‐seq (r ≥ 0.93) and (B) the microarray (r ≥ 0.97) datasets. There is minimal sample separation due to the main sample attribute, i.e. study (microarray) or site of sampling (RNA‐seq). (C) Log‐scale plot of median expression values, showing that expression levels for the majority of genes are positively correlated in the two datasets. (D) Of the top ten genes with the highest expression for the microarray (red) and RNA‐seq (blue) datasets, there were only two genes in common (RPS18 and KRT14). A higher dynamic range was also observed in the RNA‐seq dataset.

Figure S2 Pearson correlation thresholds in randomized and original data. (A) When expression values for each gene were randomized across the samples of the RNA‐seq dataset, only 95 pairing (edges) were observed at a threshold used in this analysis (r ≥ 0.73), whilst the untransformed data yielded 87 121 edges. Taking into account that a total of 123 802 980 calculations were made for every possible combination of gene–gene Pearson correlation analysis, the frequency of a pair of genes reaching the r threshold is 7.7 × 10−7 (blue), compared with a frequency of 7.0 × 10−4 for non‐random correlations observed in the actual dataset (red). This supports the notion that the vast majority of relationships used to build the network analysed here are non‐random. (B) Similar analysis was done to the microarray dataset. The randomized version of the dataset shows a frequency of 1 × 10−8 for a random correlation (blue) to occur at r ≥ 0.66, compared with a frequency of 6.7 × 10−4 observed in the actual dataset (red).

Table S1 Details on the datasets used in this study The subject details, experimental design, data source, and comparisons for all datasets included in this study are listed in Table S1. This includes the three primary data datasets, 22 validation datasets for skin conditions, three psoriasis datasets for integrating the keratinocyte differentiation signature, and one ageing dataset.

Table S2 SkinSig and co‐expression signatures derived from the analysis of different datasets derived from normal human skin The supplementary file includes the co‐expression signatures for all network analysis, including the individual full original datasets, gene‐symbol restricted datasets, and SkinSig.

Table S3 Gene ontology enrichment and example genes for each signature In addition to highlighting a number of key genes and significant gene ontology (GO) enrichment result for each SkinSig, the supplementary file also includes the full set results from gene ontology enrichment.

Table S4 Comparison between SkinSig and genes found to be significantly altered during ageing Genes reported to change in their expression during ageing are compared with SkinSig, showing a high percentage of the sebaceous gland and hair signatures being reported to alter with ageing by Glass et al [19].


Articles from The Journal of Pathology are provided here courtesy of Wiley

RESOURCES