Skip to main content
  • Research article
  • Open access
  • Published:

Gene expression profiles from discordant monozygotic twins suggest that molecular pathways are shared among multiple systemic autoimmune diseases



The objective of this study is to determine if multiple systemic autoimmune diseases (SAID) share gene expression pathways that could provide insights into pathogenic mechanisms common to these disorders.


RNA microarray analyses (Agilent Human 1A(V2) 20K oligo arrays) were used to quantify gene expression in peripheral blood cells from 20 monozygotic (MZ) twin pairs discordant for SAID. Six affected probands with systemic lupus erythematosus (SLE), six with rheumatoid arthritis (RA), eight with idiopathic inflammatory myopathies (IIM), and their same-gendered unaffected twins, were enrolled. Comparisons were made between discordant twin pairs and these were also each compared to 40 unrelated control subjects (matched 2:1 to each twin by age, gender and ethnicity) using statistical and molecular pathway analyses. Relative quantitative PCR was used to verify independently measures of differential gene expression assessed by microarray analysis.


Probands and unrelated, matched controls differed significantly in gene expression for 104 probes corresponding to 92 identifiable genes (multiple-comparison adjusted P values < 0.1). Differentially expressed genes involved several overlapping pathways including immune responses (16%), signaling pathways (24%), transcription/translation regulators (26%), and metabolic functions (15%). Interferon (IFN)-response genes (IFI27, OASF, PLSCR1, EIF2AK2, TNFAIP6, and TNFSF10) were up-regulated in probands compared to unrelated controls. Many of the abnormally expressed genes played regulatory roles in multiple cellular pathways. We did not detect any probes expressed differentially in comparisons among the three SAID phenotypes. Similarly, we found no significant differences in gene expression when comparing probands to unaffected twins or unaffected twins to unrelated controls. Gene expression levels for unaffected twins appeared intermediate between that of probands and unrelated controls for 6535 probes (32% of the total probes) as would be expected by chance. By contrast, in unaffected twins intermediate ordering was observed for 84 of the 104 probes (81%) whose expression differed significantly between probands and unrelated controls.


Alterations in expression of a limited number of genes may influence the dysregulation of numerous, integrated immune response, cell signaling and regulatory pathways that are common to a number of SAID. Gene expression profiles in peripheral blood suggest that for genes in these critical pathways, unaffected twins may be in a transitional or intermediate state of immune dysregulation between twins with SAID and unrelated controls, perhaps predisposing them to the development of SAID given the necessary and sufficient environmental exposures.


Previous studies suggest that patterns of gene expression as measured by RNA microarray are correlated among healthy twins, siblings, and other first-degree family members [13]. These data support the importance of shared genetic and perhaps environmental influences on transcription and other controls of steady-state mRNA expression. Interestingly, similar profiles of global RNA expression are observed among first-degree family members discordant for autoimmune disease, suggesting that other healthy, first-degree relatives may have an underlying genetic predisposition to disease [2, 4, 5].

Shared patterns of global gene expression have also been observed among genetically unrelated individuals with systemic autoimmune diseases (SAIDs), including systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA) [69]. These data support the hypothesis that different autoimmune diseases share certain clinical features and perhaps mechanisms of disease pathogenesis. Most notably, the type I IFN 'signature' associated with the transcriptional control of many IFN-regulated gene products has been consistently detected among patients with SLE, dermatomyositis (DM), or other SAIDs [8, 1016]. Conversely, disease-specific gene expression profiles are associated with particular autoimmune pathologies and may reflect target tissue specificity and other distinctions in genetic susceptibilities or environmental exposure histories or both [7, 1720].

To minimize the role of polymorphic gene effects, we enrolled 20 pairs of monozygotic (MZ) twins discordant for SAID (six probands with SLE, six with RA, and eight with idiopathic inflammatory myopathies, or IIMs) and used RNA microarrays (Agilent Human 1A(V2); 20K) to examine gene expression patterns in their peripheral blood cells. To determine whether unaffected twins might show preclinical evidence of enhanced susceptibility to SAID, we also compared probands and their unaffected twins with 40 unrelated, matched control subjects. One might expect etiologically relevant genes to show expression patterns in the unaffected twins that are intermediate between those in their affected twin counterparts and in unrelated, matched controls. Moreover, in an effort to identify shared molecular pathways involved in the pathogenesis of these SAIDs, we asked whether particular gene expression profiles were characteristic of affected probands regardless of specific disease diagnosis.

Materials and methods

Study subjects

Five adult (at least 18 years of age) and 15 juvenile MZ twin pairs discordant for SAID and 40 unrelated control subjects (two controls per twin pair) matched on age within 6 years, gender, and ethnicity were subjects in this study. These subjects were among those enrolled between 2001 and 2006 in the National Institutes of Health (NIH) investigational review board-approved Twins-Sib study assessing the pathogenesis of SAID. Ethical approval for this microarray study was also obtained from the NIH investigational review board, and all human subjects provided informed consent. Twin pairs enrolled within 4 years of probands' diagnoses included 19 non-Hispanic Caucasian twin pairs and a single Hispanic twin pair (with SLE). Probands fulfilled American College of Rheumatology criteria for adult or juvenile SLE (n = 4 and 2, respectively), RA or JRA (n = 1 and 5, respectively), juvenile dermatomyositis (JDM) (n = 7), or juvenile polymyositis (JPM) (n = 1); we excluded patients with inherited, metabolic, infectious, or other causes of disease. The juvenile probands ranged in age from 3 to 18 years (mean of 11.2 years), whereas adults ranged from 19 to 43 years (mean of 29.2 years). Twins included 14 female and 6 male pairs. Monozygosity was confirmed by short tandem repeat analysis of genomic DNAs (Proactive Genetics, Inc., Augusta, GA, USA). Unrelated, matched controls were free of infections, trauma, vaccines, and surgeries for 8 weeks and had no first-degree family members with SAID.

RNA was purified from peripheral whole blood samples collected in PAXgene RNA tubes (VWR Scientific, Radnor, PA, USA) by using a PAXgene RNA Isolation Kit (Qiagen, Inc., Valencia, CA, USA) in accordance with the recommendations of the manufacturer. Total RNA was quantified spectrophotometrically and then stored at -80°C until analysis. To minimize sources of variability, we collected whole blood samples in the morning, and immunosuppressive therapy was held at least 24 hours before collection.

Microarray analysis

Gene expression analysis was conducted by using Agilent Human 1A(V2) Oligo arrays with approximately 20,000 probes for known genes and expressed sequence tags (Agilent Technologies, Inc., Santa Clara, CA, USA). Each sample was hybridized against a human universal RNA control (Stratagene, La Jolla, CA, USA). Total RNA (500 ng) was amplified and labeled by using the Agilent Low RNA Input Fluorescent Linear Amplification Kit in accordance with the protocol of the manufacturer. For each two-color comparison, 750 ng of each Cy3- (universal control) and Cy5-labeled sample cRNA was mixed and fragmented by using the Agilent in situ Hybridization Kit. Hybridizations were performed for 17 hours in a rotating hybridization oven. Slides were washed and then scanned with an Agilent scanner. Data were obtained by using Agilent Feature Extraction software (version 7.5), with defaults for all parameters. The Agilent Feature Extraction Software performed error modeling, adjusting for additive and multiplicative noise. The resulting data were processed by using the Rosetta Resolver system (version 7.2) (Rosetta Biosoftware, now part of Microsoft Corporation, Redmond, WA, USA). Resulting base 2 logarithms of expression ratios (subject over universal control) were exported from Rosetta Resolver for further analysis. The data reported in this publication have been deposited in the National Center for Biotechnology Information's Gene Expression Omnibus (GEO) [21] and are accessible through GEO Series accession number [GEO:GSE24060].

Microarray data normalization and quality control

We used principal component analysis (PCA) to investigate the effect of known sample variables such as subject age, gender, race, disease status, disease activity, medications, and peripheral white blood cell counts and differentials as well as technical variables such as sample hybridizations performed at different times. We observed no obvious differences across the first three principal components due to age, gender, ethnicity, disease status, disease activity, or treatment. We noticed, however, that samples clustered according to their hybridization batch. To correct for this, we centered the ratios for each probe by using the median for their hybridization group. The PCA plots generated by using the corrected data showed no obvious groupings, so we used these data for further analyses.

Identification of differentially expressed probes

To identify probes that were differentially expressed in probands (P) as compared with their unaffected twin (U) and with unrelated, matched controls (C), we employed a mixed-effects model to calculate P values for each of the approximately 20,000 probes. Random effects included indicators for twin pairs, for P-versus-U comparisons, and for the strata defined by the matching variables for the P-versus-C and U-versus-C comparisons. With these adjustments, we performed three binary comparisons: (a) P versus C, in which all probands with SAID were compared with two unrelated, matched controls; (b) U versus C, in which all unaffected twins were compared with two unrelated, matched controls; (c) P versus U, in which all probands with SAID were compared with their unaffected twins in a paired analysis. We adjusted P values from each of the above three comparisons to control the false discovery rate (FDR) at 0.1 by using the Benjamini-Hochberg method [22].

Disease-specific gene expression analyses

The goal was to investigate whether blood cell gene expression profiles for probands with SAID - namely (J)RA, (J)SLE, and (J)IIM (that is, JDM + JPM) - show characteristic differences in gene expression. To answer this question, we used all probe expression data and employed three different types of statistical analyses: (a) Hierarchical clustering - we performed unsupervised hierarchical clustering by using all 20 proband samples to check whether probands cluster by their known SAID subtype. (b) Multiclass analysis of variance (ANOVA) - to assess whether the difference between the gene expression of a proband and their respective twin is SAID subtype-specific, we first obtained a measurement of the gene expression difference per twin pair for each probe by subtracting the log-transformed expression value of the unaffected twin from that of the respective affected twin. We next grouped twin pairs into the three major SAID diagnostic classes. Finally, we performed a three-category, one-way ANOVA by using the differences in log-transformed expression values for each oligo probe. The P values were then adjusted for multiple testing by using the Benjamini-Hochberg method with the FDR of 0.1 [22]. (c) Mixed-effects model - to assess whether the difference between the gene expression of an affected proband and their corresponding unrelated but matched controls is SAID subtype-specific, we employed a mixed-effects model allowing a random effect for each matched set and treating the disease subtype as a fixed effect. The P values were then adjusted for multiple testing by using the Benjamini-Hochberg method with the FDR of 0.1 [22].

Functional analysis of differentially expressed genes

We used Ingenuity Pathways Analysis (IPA) software (Ingenuity® Systems, Redwood City, CA, USA) to assess functional class membership for probes identified as differentially expressed between probands and unrelated, matched controls. Each gene of interest was mapped and overlaid onto a global molecular network developed from information contained in the IPA Knowledge Base. Networks of genes were then generated algorithmically on the basis of their connectivity as established in the published literature. Fisher's exact test was used to assess whether the number of probes mapped to each biologic function or pathway (or both) differs from the number expected due to chance alone.

Real-time polymerase chain reaction analysis

Relative quantitation (RQ) measurements of gene expression were by real-time polymerase chain reaction (RT-PCR) assays using cDNA prepared from Paxgene-purified whole peripheral blood RNA preparations collected from disease-discordant twins and unrelated, matched controls as described above. cDNA was prepared by using a High Capacity cDNA Reverse Transcription kit (1.0 μg of input RNA per subject) in accordance with the recommendations of the manufacturer (Applied Biosystems, Foster City, CA, USA). RT-PCRs were performed in triplicate by using commercially prepared cDNA primers and probes (TaqMan Gene Expression Assays; Applied Biosystems) for the following genes of interest: MAP2K6 (TaqMan Assay Hs00992389), IL1RN (Hs00893626), IFI27 (Hs00271467), FCER1A (Hs01090134), FYN (Hs00176628), LTK (Hs00914334), KRTCAP2 (Hs00744717), SYTL2 (Hs00262988), ANXA3 (Hs00971411), CEACAM6 (Hs00366002), DEFA4 (Hs00157252), TNFAIP6 (Hs01113602), TNFSF10 (Hs00234356), EIF2AK2 (Hs00169345), and LGR6 (Hs00663887). Genes of interest were selected from the microarray analysis by a combination of criteria, including degree of statistical significance, magnitude of differential gene expression, and possible biologic relevance to disease. Standard TaqMan assays (20 μL) were performed by using an Applied Biosystems Universal Assay Buffer with 10 ng of input cDNA per reaction and amplified for 40 cycles in an ABI Prism 7900HT Sequence Detection System (SDS software version 2.3). Co-amplication of human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) served as an endogenous assay and standardization control. RQ of gene expression was calculated from log-phase mean threshold cycle values normalized to GAPDH expression by using Applied Biosystems RQ Manager software (version 1.2).


Microarray differential expression analysis

Significant differences (FDR-adjusted P values of less than 0.1) were observed for 104 oligo probes in analyses of SAID probands compared with the unrelated, matched controls (Table 1). Upon probe annotation, 92 genes representing several non-mutually exclusive categories, including immune function (16%), signaling pathways (24%), transcription/translation regulators (26%), and metabolic functions (15%), were identified (Table 2). Of these 92 genes, 73.9% were underexpressed (mean fold change of 0.71 and range of 0.91 to 0.56) and 26.1% were overexpressed (mean fold change of 2.2 and range of 1.1 to 7.2) in probands relative to unrelated, matched controls.

Table 1 Genes that were differentially expressed between probands (n = 20) with systemic autoimmune disease and unrelated, matched controls (n = 40)
Table 2 Functional categorization of genes differentially expressed between SAID probands and unrelated, matched controls

Comparisons between probands with SAID and their unaffected twins produced no significant differences in gene expression values (that is, the smallest adjusted P values were greater than 0.1), although several of the top-ranking genes were shared with those identified as significant in comparisons of probands and unrelated, matched controls. Comparisons of unaffected twins and unrelated, matched controls likewise revealed no significant differences in gene expression.

We conducted an unsupervised, hierarchical cluster analysis for all 80 subjects (20 probands, 20 unaffected twins, and 40 unrelated, matched controls) by using the expression data for the 104 probes with significant differential expression on the basis of the comparison of probands versus unrelated controls. The resulting heat map (Figure 1) demonstrated a clear segregation of the probands with SAID and the unrelated, matched controls. In contrast, the unaffected twins were interspersed among and between the probands and unrelated controls. The first two major branches of the dendrogram divided the heat map into approximately equal numbers of study subjects (that is, 40 in the top half and 40 in the bottom half of the heat map). While all probands segregate into the top half of the map (that is, Figure 1, branch 1), the unaffected twins were equally distributed between the two major branches. The unrelated, matched controls were distributed between the two major branches (approximately 1/3 in branch 1 and 2/3 in branch 2). We also observed that several probands mapped outside the primary proband cluster (branch 1). Further analyses of proband outliers did not reveal any significant differences in age, medications, disease duration, physical exam findings, patient- and physician-assessed levels of disease activity, white blood or platelet counts, or hematocrit when compared with probands in the primary cluster (data not shown).

Figure 1
figure 1

Heat map representing an unsupervised, hierarchical cluster analysis of the three study groups. The groups consisted of 20 probands with a systemic autoimmune disease, their 20 unaffected twins, and 40 unrelated, matched controls, respectively. The heat map uses 104 oligo probes that exhibited statistically significant differential gene expression (multiple comparison-adjusted P values (false discovery rate) of less than 0.1) between probands and unrelated, matched controls. Color codes to the immediate right of the dendrogram correspond to probands (yellow), unaffected twins (orange), and unrelated, matched controls (blue). The first major partition in the dendrogram (branches 1 and 2) is marked to the left of the heat map.

A PCA of the 80 study subjects by using the same 104 differentially expressed probes again revealed segregation of probands from unaffected twins and unrelated, matched controls (Figure 2). No differences due to age, gender, ethnicity, disease status, or treatment were observed across the first three principal components.

Figure 2
figure 2

Principal component analysis of the three study groups. The groups consisted of 20 probands with a systemic autoimmune disease (green), their 20 unaffected twins (blue), or 40 unrelated, matched controls (red), respectively. The analysis uses 104 oligo probes that exhibited statistically significant differential gene expression (multiple comparison-adjusted P values (false discovery rate) of less than 0.1) between probands and unrelated, matched controls. C, unrelated, matched control; P, proband; PC, principal component; U, unaffected twin.

While the distinction between unaffected twins and unrelated, matched controls was less obvious in this PCA, our inability to detect any significant differences in gene expression between unaffected twins and either probands or unrelated, matched controls suggested that the gene expression profile of unaffected twins may signify an intermediate or transitional state between the healthy and disease phenotypes. We observed further that the average gene expression for unaffected twins was intermediate between that of twins with SAID and unrelated, matched controls for 6,535 (32%) of a total of approximately 20,000 probes on the chip (about one third of all probes might be predicted to show this pattern by chance alone). By contrast, the same intermediate ordering was observed for 84 out of 104 probes (81%) that were significantly different between probands and controls.

Disease subtypes

By performing an unsupervised hierarchical clustering of all proband samples, we further examined whether probands might also segregate by disease subtype (that is, SLE, RA, and IIM). The clustering did not segregate probands into groups dominated by SAID subtypes (data not shown). Second, to identify probes that were significantly different among the disease groups, for each probe and each twin pair we calculated differences in log-transformed gene expression levels between unaffected and affected twins and compared these differences among disease groups with ANOVA. After adjustments for multiple testing, we did not detect any probes with significant P values among twin pairs whose probands had different diagnoses (data not shown).

To further assess these findings, we employed a mixed-effects model comparing probe expression values from probands and unrelated, matched controls, stratified by including a random effect for the matched sets, and comparing across disease phenotypes. Again, after adjustments for multiple testing, no significant differences in the change in gene expression (that is, the fixed-effects interaction between disease status and phenotype category) were observed. Together, these data suggest that either our disease-specific sample sizes were too small to detect significant differences or patients with different SAIDs possess similar gene expression patterns.

Pathway analysis

To examine the possible functional clustering of genes that were differentially expressed in SAID, we performed IPAs to assess whether differentially expressed genes might be linked by common biologic pathways. As shown in Figure 3, those findings having the greatest significance (that is, lowest P values) included several canonical biologic pathways involved in immune response and inflammation (signaling molecules for NF-κB, IL-6, FcεRI, Toll-like receptor signaling, apoptotic, and acute-phase responses). In some instances, genes expressed differentially in SAID were found to play a regulatory role in multiple immune response and inflammatory pathways. For example, mitogen-activated protein kinase kinase 6 (MAP2K6) plays a regulatory role in NF-κB, IL-6, FcεRI, Toll-like receptor, and acute-phase response pathways. Similarly, the interleukin 1 receptor antagonist (IL1RN) influences the NF-κB, IL-6, and acute-phase response pathways, and eukaryotic translation initiation factor 2-alpha kinase 2 (EIF2AK2) is likewise involved in NF-κB and Toll-like receptor signaling. These results suggest that alterations in expression of a limited number of genes may, in turn, influence the dysregulation of numerous integrated immune response and inflammatory pathways that are common to several SAIDs.

Figure 3
figure 3

Ingenuity pathways analysis of 104 probes with false discovery rate-adjusted P values of less than 0.1. The analysis compares 20 twins affected with systemic autoimmune disease and 40 unrelated, matched controls. P values (blue bars) describing the confidence of the association of the data set with a given pathway are shown on the left vertical axis (-log(P value)). The cutoff threshold value (defined as P = 0.05) is also shown (horizontal yellow line). The ratio of the number of genes from the data set that map to a given pathway divided by the total number of molecules that comprise the pathway (orange line connecting blue bars) is shown on the right vertical axis. IL-6, interleukin-6; NF-κB, nuclear factor-kappa-B.

RQ-PCR validation

RQ of differential gene expression among disease-discordant twin pairs and unrelated, matched controls was independently evaluated by RT-PCR analyses for several genes of interest (KRTCAP2, MAP2K6, IL1RN, IFI27, SYTL2, FYN, LTK, EIF2AK2, FCERA1, LGR6, DEFA4, ANXA3, TNFAIP6, CEACAM6, and TNFSF10) as detailed in Materials and methods. As summarized in Table 3, 13 of the 15 genes evaluated by RQ-PCR showed the same trend (that is, increased or decreased levels of gene expression) as those observed from the microarray analysis in comparisons of twin probands and unrelated, matched controls. Moreover, the observed magnitude of differential expression for these genes was generally similar for these independent microarray and RQ-PCR assays. An opposite trend of differential expression was observed for the FYN gene by these assays, although the magnitudes of differential gene expression were small (approximately 1.2-fold). For the LTK gene, RQ-PCR did not detect a difference in gene expression between probands and unrelated, matched controls whereas microarray analysis estimated a 2.3-fold increase in gene expression. Similar comparative analyses of unaffected twins demonstrated intermediate levels of differential gene expression versus unrelated, matched controls for the majority of genes surveyed.

Table 3 Comparison of differential gene expression values determined by relative quantitative-polymerase chain reaction and microarray analyses


Pathology in human autoimmune diseases is thought to result from a complex interplay of multiple, often polymorphic, genetic loci and accumulating environmental exposures. The importance of genetic predisposition is evidenced by high disease concordance rates and genetic linkage analyses among family members [23, 24]. Further evidence for the role of genetics in SAID comes from the identification of candidate susceptibility loci, especially genes encoding polymorphic variants of HLA class II antigen-presenting molecules [25]. These findings are supported by single-nucleotide polymorphism-based genome-wide association studies (GWASs) identifying large numbers of putative susceptibility markers positioned along multiple chromosomes [26]. Many of the confirmed GWAS disease markers map in close proximity to or within genes controlling immune response signaling pathways (PTPN22, TNFAIP3, TRAF1, CD40, and REL), transcriptional activation (STAT4 and IRF5), and cytokine production (IL-12 and IL-23) [26, 27]. Interestingly, the majority of these disease susceptibility loci individually confer only modest disease risks (relative risk values of 1.2 to 1.5) [28]. Moreover, many genetic risk factors are shared among multiple autoimmune diseases (for example, SLE, RA, type 1 diabetes, and multiple sclerosis) and across different ethnic backgrounds, suggesting that the dysregulation of fundamental immune regulatory pathways underlies shared pathogenetic mechanisms [2, 4, 29].

We used RNA microarrays to measure differential gene expression among MZ twins discordant for SAID and unrelated, healthy controls matched on age, gender, and ethnicity. Gene expression levels may be influenced by subjects' genetic background, age, gender, and environmental exposures as well as by many experimental variables relating to clinical sampling, processing, and data analysis. Despite these potential confounders, an array-based study of peripheral blood cells from MZ twins demonstrated significantly less variation in the proportion of differentially expressed genes among twins affected with an SAID (<2%) compared with unrelated controls (>14%) [1]. Interestingly, MZ twin-derived lymphoblastoid cell lines exhibit near identical patterns of monoallelic expression from otherwise heterozygous, polymorphic loci, suggesting, at least in part, a role for shared genetic controls of transcription or genetic imprinting or both [30]. Unexpectedly, we did not detect any significant differences in gene expression among disease-discordant MZ twins in our study after correction for multiple comparisons. Similar profiles of gene expression were described for array analyses examining eight pairs of MZ twins discordant for multiple sclerosis, in which investigators found only a single gene, IFN alpha-inducible protein GIP3, differentially expressed in over half of the eight twin pairs [5]. In contrast, larger numbers of differentially expressed genes were identified from lymphoblastoid cell lines derived from 11 MZ twin pairs discordant for RA, although this analysis was not adjusted for multiple comparisons [31]. Curiously, similar patterns of differential gene expression have been observed among SAID patients and their healthy, first-degree relatives [2, 23]. Collectively, these data support a role for both genetic and other non-genetic variables, including perhaps environmental exposures influencing disease development. Also, small changes in levels of differential gene expression (generally 1.2- to 2.5-fold variations) may not be statistically observable in studies of smaller numbers of twin pairs, including our present study.

For 84 out of 104 probes (81%) with statistically significant differential expression between twins with an SAID and unrelated, matched controls, we observed that gene expression values of unaffected twins are positioned between affected twins and unrelated controls, suggesting that unaffected twins may represent an intermediate or transitional state of disease development. Together, our results and previous studies of MZ twins argue for the importance of shared genetic determinants in regulating gene expression profiles. However, additional factors, including epigenetic modifications that accumulate in an age-dependent fashion (that is, epigenetic drift) as well as accumulating environmental exposures, may also influence asymmetric patterns of disease development among MZ twins [32, 33]. Differential patterns of gene expression between the affected probands and unrelated controls could reflect either acquired modifications that were present before the onset of disease and could (to some extent) have conferred susceptibility to developing an SAID or phenotypic features of the syndrome itself, having less to do with disease etiology.

Heterogeneous phenotypes characterize the spectrum of clinical entities comprising SAID (for example, SLE, RA, and IIM). However, as a group, these diseases share many clinical, laboratory, genetic, autoantibody, and possibly pathogenetic features [34]. In fact, many patients present clinically with overlap syndromes sharing diagnostic criteria for multiple autoimmune diseases. Previous microarray studies of peripheral blood cells isolated from patients with four different autoimmune diagnoses (SLE, RA, multiple sclerosis, and insulin-dependent diabetes mellitus) were found to have similar profiles of differential gene expression relative to unrelated, healthy controls [4, 6]. Moreover, a meta-analysis of gene expression array profiles demonstrated high concordance between SLE and RA, further emphasizing disease similarities [7]. Although our power to detect differences was limited, our data indirectly support these conclusions in that we were unable to detect significant differences in gene expression among SAID subtypes (six with (J)SLE, six with (J)RA, and eight with JIIM) by using three independent statistical analyses (ANOVA, mixed-effects models, and hierarchical clustering). Together, these data suggest that shared alterations in gene expression may underlie similar pathogenic mechanisms among different SAIDs.

Our findings have corroborated previous studies identifying upregulation of type I IFN response genes observed predominantly in SLE but also in RA and IIM [7, 10, 12, 14, 16, 19, 35, 36]. In fact, altered expression of IFN-inducible genes has been correlated with disease activity in SLE and IIM (DM and polymyositis) patients undergoing therapy [13, 17, 36]. We likewise observed upregulation of several SLE-associated IFN response genes, including OASL (2.2-fold), PLSCR1 (2.3-fold), EIF2AK2 (2.4-fold), and IFI27, the last of which exhibited the highest degree of increased gene expression (7.2-fold) in our study. Moreover, our findings are consistent with a report of IFN response-associated gene expression in DM whereby factors TNFAIP6, TNFSF10, OASF, PLSCR1, EIF2AK2, and IFI27 were elevated in peripheral blood [16].

Our study has several limitations, including small sample sizes and limited statistical power, resulting from the challenges of identifying and recruiting qualified MZ twins discordant for SAID. Studies of MZ twins, however, help to mitigate confounding factors associated with genetic polymorphisms in studies of unrelated human subjects. Variations in peripheral blood cell composition, disease activity, and effects of immunosuppressive therapies may also present difficulties; however, statistical evaluations of these variables did not reveal evidence of effects on gene expression patterns. Methodological and other technical variables were evaluated in part by the use of independent RQ-PCR assays, which corroborated our microarray data for the majority of genes studied. Other limitations included the phenotypic heterogeneity of the SAIDs studied and variations in environmental exposure histories. Despite these variables, many of the differentially expressed genes and candidate molecular pathways identified in our study are consistent with findings reported in other studies of SAID [7, 1214, 16, 36]. Functional analysis of genes found differentially expressed in our SAID probands identified multiple immunoregulatory (NF-κB, IL-6, and apoptosis) and proinflammatory (acute-phase response and Toll-like receptor) pathways. Moreover, several of these genes (for example, MAP2K6, EIF2AK2, and IL1RN) have coordinated functions in the integration of multiple, non-mutually exclusive regulatory pathways that together may contribute to common features of the SAID phenotype. The functional implications of each of these pathways in autoimmune disease were reviewed recently [37].


Gene expression profiling of biofluids and tissues from SAID patients and matched controls has provided a wealth of information into possible mechanisms of disease development, chronicity, and therapeutic response. Despite the variability of data anticipated with human clinical studies, some meaningful consensus has emerged [4, 8]. As expected, we observed different gene expression profiles between disease probands and unrelated controls. Our identification of an SAID-associated profile of genes influencing multiple molecular pathways, including the immune response, cellular signaling, inflammation, transcriptional/translational, and other regulatory controls, is consistent with GWASs demonstrating large numbers of genetic susceptibility loci, each of which contributes only a modest degree of overall risk [26, 28, 29]. Moreover, the lack of statistically significant differences in gene expression detected between affected and unaffected twins emphasizes the importance of genetic susceptibility to disease whereby unaffected twins may represent a transitional state between health and disease. Our report of shared profiles of altered gene expression among SAID patients with different clinical phenotypes is consistent with the hypothesis that many SAIDs have common features of disease pathogenesis. The identification of relevant genes whose products regulate and integrate multiple physiologic pathways might permit the development of targeted therapeutics benefiting a broader spectrum of patients with multiple SAID phenotypes.



analysis of variance


unrelated, matched control




false discovery rate


glyceraldehyde-3-phosphate dehydrogenase


Gene Expression Omnibus


genome-wide association study




idiopathic inflammatory myopathy




ingenuity pathways analysis


juvenile dermatomyositis


juvenile polymyositis


juvenile rheumatoid arthritis




nuclear factor-kappa-B


National Institutes of Health




principal component analysis


rheumatoid arthritis


relative quantitation


relative quantitation polymerase chain reaction


real-time polymerase chain reaction


systemic autoimmune disease


systemic lupus erythematosus


unaffected twin.


  1. Sharma A, Sharma VK, Horn-Saban S, Lancet D, Ramachandran S, Brahmachari SK: Assessing natural variations in gene expression in humans by comparing with monozygotic twins using microarrays. Physiol Genomics. 2005, 21: 117-123. 10.1152/physiolgenomics.00228.2003.

    Article  CAS  PubMed  Google Scholar 

  2. Maas K, Chen H, Shyr Y, Olsen NJ, Aune T: Shared gene expression profiles in individuals with autoimmune disease and unaffected first-degree relatives of individuals with autoimmune disease. Hum Mol Genet. 2005, 14: 1305-1314. 10.1093/hmg/ddi141.

    Article  CAS  PubMed  Google Scholar 

  3. Hemminki K, Li X, Sundquist K, Sundquist J: Shared familial aggregation of susceptibility to autoimmune diseases. Arthritis Rheum. 2009, 60: 2845-2847. 10.1002/art.24749.

    Article  PubMed  Google Scholar 

  4. Olsen NJ, Moore JH, Aune TM: Gene expression signatures for autoimmune disease in peripheral blood mononuclear cells. Arthritis Res Ther. 2004, 6: 120-128. 10.1186/ar1190.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  5. Sarkijarvi S, Kuusisto H, Paalavuo R, Levula M, Airla N, Lehtimaki T, Kaprio J, Koskenvuo M, Elovaara I: Gene expression profiles in Finnish twins with multiple sclerosis. BMC Med Genet. 2006, 7: 11-

    Article  PubMed Central  PubMed  Google Scholar 

  6. Liu Z, Maas K, Aune TM: Identification of gene expression signatures in autoimmune disease without the influence of familial resemblance. Hum Mol Genet. 2006, 15: 501-509.

    Article  CAS  PubMed  Google Scholar 

  7. Silva GL, Junta CM, Mello SS, Garcia PS, Rassi DM, Sakamoto-Hojo ET, Donadi EA, Passos GA: Profiling meta-analysis reveals primarily gene coexpression concordance between systemic lupus erythematosus and rheumatoid arthritis. Ann N Y Acad Sci. 2007, 1110: 33-46. 10.1196/annals.1423.005.

    Article  CAS  PubMed  Google Scholar 

  8. van Baarsen LG, Bos CL, van der Pouw Kraan TC, Verweij CL: Transcription profiling of rheumatic diseases. Arthritis Res Ther. 2009, 11: 207-10.1186/ar2557.

    Article  PubMed Central  PubMed  Google Scholar 

  9. Junta CM, Sandrin-Garcia P, Fachin-Saltoratto AL, Mello SS, Oliveira RD, Rassi DM, Giuliatti S, Sakamoto-Hojo ET, Louzada-Junior P, Donadi EA, Passos GA: Differential gene expression of peripheral blood mononuclear cells from rheumatoid arthritis patients may discriminate immunogenetic, pathogenic and treatment features. Immunology. 2009, 127: 365-372. 10.1111/j.1365-2567.2008.03005.x.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  10. Ishii T, Onda H, Tanigawa A, Ohshima S, Fujiwara H, Mima T, Katada Y, Deguchi H, Suemura M, Miyake T, Miyatake K, Kawase I, Zhao H, Tomiyama Y, Saeki Y, Nojima H: Isolation and expression profiling of genes upregulated in the peripheral blood cells of systemic lupus erythematosus patients. DNA Res. 2005, 12: 429-439.

    Article  CAS  PubMed  Google Scholar 

  11. Mandel M, Achiron A: Gene expression studies in systemic lupus erythematosus. Lupus. 2006, 15: 451-456. 10.1191/0961203306lu2332oa.

    Article  CAS  PubMed  Google Scholar 

  12. Nzeusseu Toukap A, Galant C, Theate I, Maudoux AL, Lories RJ, Houssiau FA, Lauwerys BR: Identification of distinct gene expression profiles in the synovium of patients with systemic lupus erythematosus. Arthritis Rheum. 2007, 56: 1579-1588. 10.1002/art.22578.

    Article  CAS  PubMed  Google Scholar 

  13. Nikpour M, Dempsey AA, Urowitz MB, Gladman DD, Barnes DA: Association of a gene expression profile from whole blood with disease activity in systemic lupus erythaematosus. Ann Rheum Dis. 2008, 67: 1069-1075. 10.1136/ard.2007.074765.

    Article  CAS  PubMed  Google Scholar 

  14. van der Pouw Kraan TC, Wijbrandts CA, van Baarsen LG, Voskuyl AE, Rustenburg F, Baggen JM, Ibrahim SM, Fero M, Dijkmans BA, Tak PP, Verweij CL: Rheumatoid arthritis subtypes identified by genomic profiling of peripheral blood cells: assignment of a type I interferon signature in a subpopulation of patients. Ann Rheum Dis. 2007, 66: 1008-1014. 10.1136/ard.2006.063412.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  15. van der Pouw Kraan TC, van Baarsen LG, Wijbrandts CA, Voskuyl AE, Rustenburg F, Baggen JM, Dijkmans BA, Tak PP, Verweij CL: Expression of a pathogen-response program in peripheral blood cells defines a subgroup of rheumatoid arthritis patients. Genes Immun. 2008, 9: 16-22. 10.1038/sj.gene.6364438.

    Article  CAS  PubMed  Google Scholar 

  16. Walsh RJ, Kong SW, Yao Y, Jallal B, Kiener PA, Pinkus JL, Beggs AH, Amato AA, Greenberg SA: Type I interferon-inducible gene expression in blood is present and reflects disease activity in dermatomyositis and polymyositis. Arthritis Rheum. 2007, 56: 3784-3792. 10.1002/art.22928.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  17. Chaussabel D, Quinn C, Shen J, Patel P, Glaser C, Baldwin N, Stichweh D, Blankenship D, Li L, Munagala I, Bennett L, Allantaz F, Mejias A, Ardura M, Kaizer E, Monnet L, Allman W, Randall H, Johnson D, Lanier A, Punaro M, Wittkowski KM, White P, Fay J, Klintmalm G, Ramilo O, Palucka AK, Banchereau J, Pascual V: A modular analysis framework for blood genomics studies: application to systemic lupus erythematosus. Immunity. 2008, 29: 150-164. 10.1016/j.immuni.2008.05.012.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  18. Raju R, Dalakas MC: Gene expression profile in the muscles of patients with inflammatory myopathies: effect of therapy with IVIg and biological validation of clinically relevant genes. Brain. 2005, 128: 1887-1896. 10.1093/brain/awh518.

    Article  PubMed  Google Scholar 

  19. Tezak Z, Hoffman EP, Lutz JL, Fedczyna TO, Stephan D, Bremer EG, Krasnoselska-Riz I, Kumar A, Pachman LM: Gene expression profiling in DQA1*0501+ children with untreated dermatomyositis: a novel model of pathogenesis. J Immunol. 2002, 168: 4154-4163.

    Article  CAS  PubMed  Google Scholar 

  20. Huber R, Hummert C, Gausmann U, Pohlers D, Koczan D, Guthke R, Kinne RW: Identification of intra-group, inter-individual, and gene-specific variances in mRNA expression profiles in the rheumatoid arthritis synovial membrane. Arthritis Res Ther. 2008, 10: R98-10.1186/ar2485.

    Article  PubMed Central  PubMed  Google Scholar 

  21. Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30: 207-210. 10.1093/nar/30.1.207.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  22. Hochberg Y, Benjamini Y: More powerful procedures for multiple significance testing. Stat Med. 1990, 9: 811-818. 10.1002/sim.4780090710.

    Article  CAS  PubMed  Google Scholar 

  23. Prahalad S, Shear ES, Thompson SD, Giannini EH, Glass DN: Increased prevalence of familial autoimmunity in simplex and multiplex families with juvenile rheumatoid arthritis. Arthritis Rheum. 2002, 46: 1851-1856. 10.1002/art.10370.

    Article  PubMed  Google Scholar 

  24. Alarcon-Segovia D, Alarcon-Riquelme ME, Cardiel MH, Caeiro F, Massardo L, Villa AR, Pons-Estel BA: Familial aggregation of systemic lupus erythematosus, rheumatoid arthritis, and other autoimmune diseases in 1,177 lupus patients from the GLADEL cohort. Arthritis Rheum. 2005, 52: 1138-1147. 10.1002/art.20999.

    Article  PubMed  Google Scholar 

  25. O'Hanlon TP, Carrick DM, Targoff IN, Arnett FC, Reveille JD, Carrington M, Gao X, Oddis CV, Morel PA, Malley JD, Malley K, Shamim EA, Rider LG, Chanock SJ, Foster CB, Bunch T, Blackshear PJ, Plotz PH, Love LA, Miller FW: Immunogenetic risk and protective factors for the idiopathic inflammatory myopathies: distinct HLA-A, -B, -Cw, -DRB1, and -DQA1 allelic profiles distinguish European American patients with different myositis autoantibodies. Medicine (Baltimore). 2006, 85: 111-127. 10.1097/

    Article  Google Scholar 

  26. Gregersen PK, Olsson LM: Recent advances in the genetics of autoimmune disease. Annu Rev Immunol. 2009, 27: 363-391. 10.1146/annurev.immunol.021908.132653.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  27. Lettre G, Rioux JD: Autoimmune diseases: insights from genome-wide association studies. Hum Mol Genet. 2008, 17: R116-121. 10.1093/hmg/ddn246.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  28. Nica AC, Dermitzakis ET: Using gene expression to investigate the genetic basis of complex disorders. Hum Mol Genet. 2008, 17: R129-134. 10.1093/hmg/ddn285.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  29. Grant SF, Hakonarson H: Microarray technology and applications in the arena of genome-wide association. Clin Chem. 2008, 54: 1116-1124. 10.1373/clinchem.2008.105395.

    Article  CAS  PubMed  Google Scholar 

  30. Cheung VG, Bruzel A, Burdick JT, Morley M, Devlin JL, Spielman RS: Monozygotic twins reveal germline contribution to allelic expression differences. Am J Hum Genet. 2008, 82: 1357-1360. 10.1016/j.ajhg.2008.05.003.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  31. Haas CS, Creighton CJ, Pi X, Maine I, Koch AE, Haines GK, Ling S, Chinnaiyan AM, Holoshitz J: Identification of genes modulated in rheumatoid arthritis using complementary DNA microarray analysis of lymphoblastoid B cell lines from disease-discordant monozygotic twins. Arthritis Rheum. 2006, 54: 2047-2060. 10.1002/art.21953.

    Article  CAS  PubMed  Google Scholar 

  32. Petronis A: Epigenetics and twins: three variations on the theme. Trends Genet. 2006, 22: 347-350. 10.1016/j.tig.2006.04.010.

    Article  CAS  PubMed  Google Scholar 

  33. Javierre BM, Fernandez AF, Richter J, Al-Shahrour F, Martin-Subero JI, Rodriguez-Ubreva J, Berdasco M, Fraga MF, O'Hanlon TP, Rider LG, Jacinto FV, Lopez-Longo FJ, Dopazo J, Forn M, Peinado MA, Carreño L, Sawalha AH, Harley JB, Siebert R, Esteller M, Miller FW, Ballestar E: Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus. Genome Res. 2010, 20: 170-179. 10.1101/gr.100289.109.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  34. Koopman WJ, Moreland LW, (Eds): Arthritis and Allied Conditions: A Textbook of Rheumatology. 2005, Philadelphia, PA: Lippincott Williams & Wilkins, 15

    Google Scholar 

  35. Zhou X, Dimachkie MM, Xiong M, Tan FK, Arnett FC: cDNA microarrays reveal distinct gene expression clusters in idiopathic inflammatory myopathies. Med Sci Monit. 2004, 10: 191-197.

    Google Scholar 

  36. Greenberg SA: A gene expression approach to study perturbed pathways in myositis. Curr Opin Rheumatol. 2007, 19: 536-541. 10.1097/BOR.0b013e3282efe261.

    Article  PubMed  Google Scholar 

  37. Pascual V, Chaussabel D, Banchereau J: A genomic approach to human autoimmune diseases. Annu Rev Immunol. 2010, 28: 535-571. 10.1146/annurev-immunol-030409-101221.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

Download references


The authors thank Robert Colbert and Nina Raben for their critical review of the manuscript. This work was supported by the intramural research program of the National Institute of Environmental Health Sciences. The funding sponsors had no role in the design, conduct, or interpretation of the study. The content of this publication does not necessarily reflect the views or policies of the US Department of Health and Human Services, nor does the mention of trade names, commercial products, or organizations imply endorsement by the US Government.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Terrance P O'Hanlon.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

TPO'H contributed to sample preparation, RQ-PCR studies, data and bioinformatic analyses, and manuscript preparation. LGR contributed to patient recruitment, clinical assessments and data analyses, and manuscript editing. LG contributed to sample preparation, RQ-PCR studies, data analysis, and manuscript editing. RF contributed to microarray analyses and data processing, bioinformatic analyses, and manuscript preparation and editing. RSP contributed to microarray analyses and data processing, bioinformatic analyses, and manuscript editing. DMU and CRW contributed to statistical analyses and manuscript editing. RRS contributed to data processing and analysis and to manuscript editing. DM contributed to data processing and analysis. MFG contributed to patient recruitment, clinical assessments, and manuscript editing. FWM contributed to study design, patient recruitment, clinical assessments, data analyses, and manuscript preparation and editing. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Reprints and permissions

About this article

Cite this article

O'Hanlon, T.P., Rider, L.G., Gan, L. et al. Gene expression profiles from discordant monozygotic twins suggest that molecular pathways are shared among multiple systemic autoimmune diseases. Arthritis Res Ther 13, R69 (2011).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: