Identification of blood biomarkers of rheumatoid arthritis by transcript profiling of peripheral blood mononuclear cells from the rat collagen-induced arthritis model
Arthritis Research & Therapy volume 8, Article number: R28 (2006)
Rheumatoid arthritis (RA) is a chronic debilitating autoimmune disease that results in joint destruction and subsequent loss of function. To better understand its pathogenesis and to facilitate the search for novel RA therapeutics, we profiled the rat model of collagen-induced arthritis (CIA) to discover and characterize blood biomarkers for RA. Peripheral blood mononuclear cells (PBMCs) were purified using a Ficoll gradient at various time points after type II collagen immunization for RNA preparation. Total RNA was processed for a microarray analysis using Affymetrix GeneChip technology. Statistical comparison analyses identified differentially expressed genes that distinguished CIA from control rats. Clustering analyses indicated that gene expression patterns correlated with laboratory indices of disease progression. A set of 28 probe sets showed significant differences in expression between blood from arthritic rats and that from controls at the earliest time after induction, and the difference persisted for the entire time course. Gene Ontology comparison of the present study with previous published murine microarray studies showed conserved Biological Processes during disease induction between the local joint and PBMC responses. Genes known to be involved in autoimmune response and arthritis, such as those encoding Galectin-3, Versican, and Socs3, were identified and validated by quantitative TaqMan RT-PCR analysis using independent blood samples. Finally, immunoblot analysis confirmed that Galectin-3 was secreted over time in plasma as well as in supernatant of cultured tissue synoviocytes of the arthritic rats, which is consistent with disease progression. Our data indicate that gene expression in PBMCs from the CIA model can be utilized to identify candidate blood biomarkers for RA.
Rheumatoid arthritis (RA) is a chronic autoimmune disease of unknown etiology that affects 0.5–1% of the population . It is a polyarthritis characterized by inflammation, altered humoral and cellular immune responses, and synovial hyperplasia, leading to destruction and subsequent loss of function of multiple joints [1–4]. Although the exact pathogenesis of RA is not fully understood, the immune and inflammatory systems are intimately linked. Studies on affected joints focusing on cartilage, bone, and synovial tissues have yielded important insights into the mechanisms of disease initiation and progression. Initially, T cell recruitment and recognition of autologous or cross-reacting antigens in the joint produce a variety of mediators, some of which facilitate the development of autoantibodies that are detectable in the serum of RA patients . The ensuing inflammatory responses, induced by tumor necrosis factor (TNF)-α and other proinflammatory cytokines, lead to synovial fibroblast hyperplasia, destruction of the extracellular matrix, and eventual damage to the affected joints [5, 6]. Although there have been many studies of cells within the arthritic joint, the responses of the peripheral blood leukocytes are not well understood. An examination of the circulating lymphocytes may provide an important alternative perspective of the processes that underlie RA and complement local characterization of affected joints .
Circulating leukocytes provide an important source for biomarker discovery for RA. Emerging high content approaches such as genomics and proteomics have radically changed the ways in which biomarkers are being studied [8–10]. The genomic approaches have been used to elucidate the pathogenesis of inflammatory diseases, including RA, and to identify novel drug targets for RA treatment [3, 11–15]. In contrast to target tissue biopsy based approaches, which are often limited by restricted access to target tissues, profiling peripheral blood cells has emerged as an attractive biomarker discovery strategy [10, 16–22]. Another added advantage to analyzing peripheral blood cells is the fact that blood is a highly dynamic environment, communicating with practically every tissue in the body, and is thus proposed as a 'sentinel tissue' that reflects disease progression in the body [21, 23]. Profiling peripheral blood cells has indeed been used to elucidate autoimmune diseases [7, 24].
The rat model of collagen-induced arthritis (CIA) has many similarities to RA . In this model (also demonstrable in mice and monkeys), immunization with type II collagen (CII) – the collagen found in joint cartilage – induces T cell activation, anti-CII autoantibody production, and inflammation and joint destruction similar to that observed in human RA [25, 26]. Although there are clearly differences between RA and CIA, changes in peripheral blood gene expression during the development of CIA may suggest potential novel biomarkers for RA. This could be of value both in monitoring the effects of drugs on disease progression and in discovering potential biomarkers, particularly for individuals with early RA. The latter is major problem in RA biomarker identification efforts because human studies are often limited by the late diagnosis relative to the early disease onset. Studying CIA with gradual induction of arthritis could potentially reveal early biomarkers for RA. Moreover, gene expression profiling in animal model holds great promise for our understanding of human pathogenesis. For example, profiling gene expression in a rat model of inflammation using SAGE (serial analysis of gene expression) has provided novel insights into mast cell activation .
In the present study, we profiled gene expression in rat peripheral blood mononuclear cells (PBMCs) during the development of CIA. We established the method for blood collection, cell fractionation, RNA isolation, and microarray analysis using the Affymetrix GeneChip technology (Affymetrix, Santa Clara, CA, USA). We identified a large number of genes that were differentially expressed between blood from control and arthritic animals. The gene expression signature in blood appeared to correlate with laboratory indices of disease induction. Using bioinformatics and statistical analyses, we identified a subset of putative biomarkers, which were subsequently validated using TaqMan RT-PCR and immunoblot analyses.
Materials and methods
Rat collagen-induced arthritis model, blood collection, and peripheral blood mononuclear cell isolation
The protocol for the in vivo studies was approved by the Lilly Institutional Animal Care and Use Committee. Adult (approximately 8 weeks old) female Lewis rats weighing approximately 150 g were obtained from Charles River (Wilmington, MA, USA), housed under standard conditions, and given free access to food and water. Animals were acclimated to the holding room for at least 7 days before initiation of the studies. For the induction of CIA, CII (Elastin Products Company, Owensville, MO, USA) was dissolved in sterilized 0.01 mol/l acetic acid (Sigma-Aldrich, St. Louis, MO, USA) to a final concentration of 2 mg/ml. The mixture was stirred at 4°C overnight until the CII was completely dissolved. CII (2 mg/ml) and incomplete Freund's adjuvant were homogenized at a 1:1 ratio using a PowerGen 125 (Fisher Scientific, Pittsburgh, PA, USA). Each rat was injected intradermally at multiple sites on the back with a total of 0.3 ml of the emulsion (day 0). Seven days later (day 7) this immunization protocol was repeated. Induction and severity of arthritis was determined by change in ankle weight, measured using calipers. Based on previous experience, arthritis (as determined by the first signs of redness or swelling of the ankle joints) is observed approximately 12 days after the first CII immunization. By day 21 the inflammatory response in the ankles has reached its peak, and by day 28 there is significant joint pathology. For these reasons, samples were collected on day 0 (baseline), and on days 10, 21, and 28. Ten rats were collected at each time point. We also included non-immunized animals as negative controls on days 10, 21, and 28. Because of the loss of a few samples due to sample processing or raw chip data quality assurance, the actual number of chips that were statistically analyzed were (respectively) 10, 5, 4, and 5 for control rats on days 0, 10, 21, and 28; and 9, 2, and 8 for arthritic rats on days 10, 21, and 28.
For gene expression analysis, on days 0, 10, 21, and 28, a volume of 3–5 ml blood from individual animals at time of sacrifice was collected by cardiac puncture into heparinized vacutainer tubes (Becton Dickenson, San Jose, CA, USA). Leukocyte counts were determined using a Hemovet 950 (Drew Scientific, Oxford, CT, USA). For PBMC isolation, blood was centrifuged at 1500 g for 20 minutes to remove the plasma. The cell pellet was resuspended in Hanks' balanced salt solution (Gibco BRL/Invitrogen, Carlsbad, CA, USA) to the original volume and the cell suspension was carefully layered over the top of 5 ml of Lympholyte-Rat (Cedarlane Labs, Hornby, Ontario, Canada) in a 15 ml Falcon tube. The tubes were centrifuged for 40 minutes at 1500 g and the white cell layer was collected using a Pasteur pipette. PBMCs were rinsed twice with cold Hanks' balanced salt solution and stored in RNAlater (Ambion Inc., Austin, TX) until RNA isolation.
RNA isolation and microarray experiments
RiboPure-Blood Kit (Ambion Inc., Austin, TX, USA) was used for isolation of high quality total RNA from PBMCs. After removing RNAlater by centrifugation, blood cell pellets were lysed in lysis buffer with sodium acetate solution, in accordance with the manufacturer's instruction. RNA was isolated by acid-phenol:chloroform extraction and further purified on a column with glass fiber filter. RNA was then eluted in RNase-free water. Samples were run on a RNA 6000 Nano Gel System (Agilent Technologies Inc., Palo Alto, CA, USA) using Agilent 2100 Bioanalyzer (Agilent) for RNA quality determination. RNA was further purified by using the RNeasy spin column (QIAGEN Inc., Valencia, CA, USA), and then cDNA was generated and labeled for Affymetrix GeneChip according to the standard Affymetrix approach and as previously described [28, 29]. Two micrograms of total RNA was used per labeling reaction. cDNA and labeled in vitro transcription product were purified using the GeneChip Sample Clean Module (Affymetrix). We obtained an average in vitro transcription product yield of about 26.8 ± 9.7 μg/2 μg input RNA, which is sufficient for chip hybridization. Biotin labeled RNA was fragmented and hybridized to rat genome RAE230A chips. Chip processing, image capturing, and raw data analyses were performed using the Affymetrix Microarray Suite MAS5. Probe set signal intensities of each hybridized gene chip were extracted using MAS5 and were normalized using all probe sets to reach the overall 2% trimmed mean of 1,500 for each chip. Chip performance of both control and arthritic samples met standard quality assurance criteria. The chips had an average background of 61.3 ± 8.2, a Raw Q of 2.5 ± 0.4, and percent present call of 46.8 ± 3.3%.
Statistical analysis to identify differentially expressed genes
The signal intensity data were fitted to an analysis of variance (ANOVA) model to compare the CIA treated samples with control samples at each time point. For a particular probe set, let Yijk be the normalized signal of sample k in treatment j at time I (specifically, i = 1, 2, 3, and 4 for days 0, 10, 21, and 28, respectively; j = 1 and 2 for control and CII injected rats, respectively; and k = 1 ... 10 for rats in each treatment group at each time point). The data were fitted to the following statistical model:
Yijk = μ + βi + τj + β τij + εijk, εijk ~ N(0,σ2)
This ANOVA model uses data from all the samples for each probe set to estimate accurately the sample variance to reach robust hypothesis testing. It applies the time effects of sample collection for both CIA and control animals when identifying changes in gene expression after CII injection. This model allows identification of gene expression changes between CIA and control samples at each matched time points, as well as gene expression changes over time in the control samples. The gene expression fold change is the ratio of the average signals of samples in the comparison (for example, treated/control); if the fold change is less than 1, then the ratio is reversed and a '-' added (for example, minus control/treated). Data from each probe set were fitted to the above model independently as is done in other studies [30, 31].
To control the false positive rate of testing the expression change of thousands of genes simultaneously, false discovery rate (fdrate or FDR) was estimated using an algorithm derived by Benjamini and Hochberg . FDR estimates the false positive rate of a 'significant' gene list. Suppose that Pi (i = 1, 2 ... m) are the P values resulting from testing m expression changes. Sort Pi from the smallest to the largest, and let P(i) be the ith sorted P value and i its rank. Then, the FDR for each sorted P value was calculated by timing the P value with m/i, and monotonizing all of the FDRs from the largest to the smallest:
Clustered correlation analysis
Cluster correlation analysis was performed with an R script written in-house, in accordance with the method proposed by Weinstein and coworkers .
Ortholog mapping and Gene Ontology analyses
Genbank accessions or gene identifications were retrieved from published papers or online supplementary materials, and their rat orthologs were obtained by querying NCBI HomoloGene database . The Gene Ontology (GO) analysis was carried out by using GoMiner, developed by Weinstein and colleagues . Briefly, retrieved gene symbols were input into GoMiner, which maps them onto the GO tree, in particular the ontology Biological Process, using organism-specific information provided by NCBI GoMiner server. Percentages of differentially expressed genes were calculated for 10 selected entries within the ontology Biological Process at the third or fourth GO level.
Quantitative real-time RT-PCR validation
RNA from an independent CIA life phase study was used to validate microarray data. Before cDNA synthesis, RNA samples were DNase treated to remove genomic DNA contamination by using Ambion's DNA-free Kit (Ambion Inc., Austin, TX, USA), in accordance with the manufacturer's instructions. cDNA was prepared from total RNA using Superscript III (InVitrogen, Carlsbad, CA, USA) with random primers as described by the manufacturer. Real-time PCR was performed on an ABI 7900HT from Applied Biosystems (ABI, Foster City, CA, USA) with gene expression assays or with primers and probes from Biosource International (Camarillo, CA). Primers and probes were designed using Primer Express (ABI). Briefly, cDNA templates for real-time PCR were prepared by diluting 1:100 with 10 mmol/l Tris (pH 7.5). The 20 μl TaqMan reaction consisted of 1 × Universal Master Mix (ABI), 1 × Gene Expression Assay (ABI), and 4 μl diluted cDNA. TaqMan reactions for genes that were assayed with primers and probes consisted of 1 × Universal Master Mix (ABI), 0.8 μmol/l forward and reverse primers, 0.2 μmol/l probe, and 4 μl diluted cDNA in a final volume of 20 μl.
Five replicates of each RT-PCR reaction were assembled in 384-well plates, on a Tecan Genesis 150 (Maennedorf, Switzerland) liquid handling robot. Each plate included no RT controls for each sample and no template control. Raw data were analyzed using a macro created in Microsoft Excel. Briefly, the high and low values from each of the five replicates were discarded and the remaining three values averaged. The average values were normalized to 18s rRNA relative expression values. Data analysis was conducted in JMP 5.1.1 (SAS Institute, Cary, NC, USA). Best Box-Cox transformation was used in order to fit the model. For comparing the means of groups with the control group, the data for different time points were tested through Dunnet's test. Conventional alpha (a = 0.05) is regarded as significant.
Gene expression assays (ABI) were included for the following genes: Galectin-3 (Lgals3, Rn_00582910_m1) and Cish3 (Rn00585674_s1). Primers and probes for Versican (Cspg2) and IL-6 were purchased from Biosource International. Sequences for the Cspg2 primers were as follows: forward, 5'-CGCCTAAGACACTACGTATGCTTGT-3'; reverse, 5'-TTGGTCCTATGTTGACTGTTTCTCA-3'; and probe, 5'-AGCATAGTCATTCCCTCTAAGCCAAAGAAGGTTC-3', labeled with 6-FAM and BHQ-1. IL-6 primers were as follows: forward, 5'-CATAGTCGTGCCTGTGTGCTTAG-3'; reverse, 5'-AGGTCTCGTTTATTAAAGCAGAACAAG-3'; and probe, 5' TTTCCTCCTGACAACGCTGCTGGG-3', labeled with 6-FAM and BHQ-1.
Synovial tissue culture and Western blot analysis for Galectin-3
Synovial tissue from the arthritic rats at different times after CII immunization were dissected and collected in the collecting medium (Dulbecco's modified Eagle's medium + 0.5% penicillin/streptomycin and antimycotics; Gibco-BRL/Invitrogen). The tissue was washed two times with the collecting medium and one time with the culture medium (Dulbecco's modified Eagle's medium + 10% heat inactivated fetal calf serum and 1% penicillin/streptomycin; Gibco-BRL/Invitrogen). The synovial tissue was then placed immediately into a 24-well tissue culture plate (two pieces of synovium in 1 ml medium per well) with culture medium, and cultured in 5% carbon dioxide at 37°C for 48 hours. The culture plate was centrifuged at 1500 rpm for 10 minutes at 4°C. The supernatant was collected and stored under -80°C until the assay.
Plasma or supernatant from cultured tissue synoviocytes of the CIA rats was subjected to Western blotting using NuPage 4–12% Bis-Tris gels, MOPS running buffer, transfer buffer, and 0.2 μm PVDF membrane (Invitrogen), in accordance with the manufacturer's protocol. Monoclonal antibody to Galectin-3 antibody (A3A12; cat. no. 804-284-C100) was purchased from Alexis Biochemicals (San Diego, CA, USA). Recombinant mouse Galectin-3 protein (cat. no. 1197-GA; R&D Systems, Minneapolis, MN, USA) was used as positive control. The blots were developed using SuperSignal West Femto Maximum Sensitivity Substrate from Pierce (Rockford, IL, USA).
Gene expression profiling in peripheral blood mononuclear cells in the collagen-induced arthritis model
To identify putative biomarkers for arthritis, we surveyed global gene expression profiles of PBMCs in a rat CIA model using DNA microarray technology. We assayed PBMCs from animals sacrificed at days 10, 21, and 28 after the first CII immunization and day 0 naïve rats. These time points were chosen based on the pathological development of disease in this model. Changes in ankle diameter (a measure of inflammation) in the different groups are presented in Figure 1.
We applied statistical analyses to examine the difference in gene expression between the control and arthritic rat blood samples. We considered FDR 0.05 to be significant (for example, of the selected 'significant' probe set list, 95% are expected to be real positives). We further trimmed down the probe set list by applying empirical criteria of fold change at least 1.4 (increase or decrease) and mean signal difference at least 250, in order to reduce errors pertained to low-level expression at close to noise level. In addition, in this experiment we had time-matched naïve control samples at each time point, so we could assess the gene expression changes over time in the control animals, or basal expression variation.
The control animals at each time point were compared with day 0 control animals. We observed a considerable amount of basal gene expression change, which could be attributable to biologic fluctuation or technical variation. Because we were interested in biomarkers, we focused our analysis on genes with large expression changes after CIA induction but that were relatively stable in the control animals. Thus, we excluded genes that had a large basal expression fluctuation. After excluding the 'fluctuating' probe sets from our significant gene lists, we identified a total of 998 nonredundant probe sets, including 714 known genes that changed significantly at least at one time point. The number of significantly changed probe sets was plotted as a function of time after CII immunization in Fig. 2a. The probe sets and associated annotations are summarized in Additional file 1 for each of the three time points. Venn logic analysis of the 998 probe sets showing the distribution of these genes with respect to time is shown in Figure 2b. We observed a notable amount of overlapping probe sets between day 10 and day 21, but substantially fewer genes were identified for day 28 samples. Nevertheless, almost half (28 out of 58 probe sets) of the day 28 probe sets overlapped with day 10 and day 21. As an initial effort, we focused on genes whose expression changed significantly at all three time points – a list of 28 probe sets that might have a wider time window for assay development. Because of probe set redundancy for Versican/Cspg2, the 28 probe sets actually represented 20 unique known genes and six expressed sequence tags. These 28 probe sets are summarized in Table 1.
Correlation of gene expression pattern with laboratory indices for disease progression
We next explored the hypothesis that differences in gene expression between the arthritic and the control rat peripheral blood reflect pathological progression in the CIA model. Shown in Figure 3a is a hierarchical clustering analysis using the nonredundant 998 differentially expressed genes (DEGs) identified from the ANOVA analysis. Expression of these 998 probe sets in the arthritic rats was clearly distinct from that in control rats. We next clustered the samples using the normalized laboratory indices including blood cell counts and paw size measurements. The animals were grouped in a manner similar to gene expression clustering (Figure 3b). The total white blood cells, percentage of lymphocytes, and percentage of and total neutrophil counts in arthritic animals were different from those in controls over time. We then performed statistical analysis by fitting the laboratory indices to a similar ANOVA model used for gene expression analysis over the three time points (days 10, 21, and 28). The test showed that the difference between CIA and control animals over the three time points were significant for most of these laboratory measurements. The P value for each measurement is shown in Figure 3b.
In an attempt to explore the possible correlation between gene expression pattern and laboratory indices of disease progression, we integrated the gene expression data with the laboratory indices using clustered correlation analysis . The results are shown in Figure 4a. Details regarding the correlation between each of the 998 DEGs and laboratory indices are summarized in Additional file 2. Remarkably, the 28 probe sets we identified using ANOVA test and Venn logic analysis were among the genes that best correlated with laboratory indices. The gene that exhibited the strongest correlation with total white cell, and total and percentage neutrophil counts was Versican, whereas the gene that negatively correlated with percentage lymphocyte count the best was GIIg15b. Both genes are among the 28 probe sets identified (Table 1). Concordant change between Versican and neutrophil count is shown in Figure 4b as a representative example of the agreement between gene expression and laboratory measurements. Taken together, these data suggest that the gene expression pattern overall correlates with laboratory indices of disease progression.
Comparison of the present study with published microarray studies in murine rheumatoid arthritis models
We compared our results with the findings of four previous studies conducted in murine autoimmune arthritis models [11, 13–15] in order to appreciate better the gene expression in PBMCs in the rat CIA model. We retrieved the reported DEGs from these published studies. Comparisons were made at two levels. First, we compared differentially expressed rat and mouse ortholog genes, which originated from a common ancestor gene and are assumed to play similar biological functions in two distinct species . Of 714 DEGs identified from our study, 70 genes were also identified by at least one other study. Nine of them were identified by at least three studies, including Scos3/Cish3, S100a8, Ptpns1, Lst1, Ctsk, Cd14, Csrp3, App, and Bzrp. Although ortholog gene comparison is relatively easy to interpret, it may not be desirable because of the fact that the different studies were conducted in different conditions, for example using different chip platforms. Thus, we compared our study with the other four studies in terms of the Biological Processes (GO ontology) in which the identified DEGs were involved. Each list of DEGs identified by the different studies was mapped onto the Biological Process GO tree using GoMiner . Percentages of DEGs at each GO category at the third and fourth levels were calculated. Figure 5 shows the percentages of the top 10 Biological Processes in the five studies. Although gene–gene comparison shows relatively little overlap, comparison at higher Biological Processes revealed much greater consistency. For example, the most important Biological Processes include metabolism, cell communication, localization, and transport. Heterogeneous response was only observed in the category of response to stimulus.
Functional relevance and validation of putative biomarker candidates
Regulated cytokine expression was reported to be associated with local joints during the development of RA . We surveyed our data for cytokine expression. The expression of cytokine-related probe sets defined by GO are summarized in Additional file 3. Our data indicated that a few cytokines were differentially regulated between arthritic rats and the controls. For example, expression of IL-1β and its type II receptor were significantly upregulated at days 10 and 21, but not at day 28. Our data revealed the involvement of interferon-γ, TNF-α, and transforming growth factor-β signaling pathways during arthritis development in the CIA model, which is consistent with previous studies.
We focused our initial experimental characterization and validation on three genes: Galectin-3, Versican, and Socs3. They were previously implicated in RA and other immune and inflammatory disorders [24, 36–38]. As shown in Figure 6, all three genes were expressed to significantly greater extents in the arthritic animals than in the controls at all three time points, correlating with inflammation and immune responses. To validate our microarray findings, we performed real-time RT-PCR on the three identified candidate biomarker genes using a separate animal cohort with more defined time points to increase validity. The results are shown in Figure 7. The numbers of samples assayed for a given gene at each time point are marked on the histogram. The expression of Galectin-3, Socs3, and Versican over time in the CIA model, as revealed by RT-PCR, agreed well with the microarray data. In contrast IL-6, which is an acute response cytokine  and was not identified as a significantly changed gene in our microarray study, did not exhibit significant difference in expression over time by the RT-PCR analysis.
Immunoblot analysis of Galectin-3 expression in collagen-induced arthritis rat cultured synoviocytes and plasma
We examined whether the difference in gene expression observed at the mRNA level in PBMCs could be extended to the protein level. We performed Western blot analysis on Galectin-3 using cultured tissue synoviocytes or plasma from the CIA animal cohort that was used for PCR validation. Because Galectin-3 is a secreted protein , we first attempted to detect it in the supernatant of cultured tissue synoviocytes. A recombinant mouse Galectin-3 was used as a positive control for the anti-Galectin-3 antibody used in our study. Although the predicted molecular weight of mouse Galectin-3 is 27.3 kDa, the recombinant protein appeared to have a greater molecular mass on the Western blot (Figure 8a). Importantly, a corresponding band was detected in the cell supernatant samples collected at days 17, 22 and 25, but not at the earlier time points. A similar protein expression profile for Galectin-3 was detected in plasma (Figure 8b), further supporting our RNA expression results and the feasibility of developing Galectin-3 as a blood biomarker-based standard protein assay for preclinical and clinical studies.
Biomarkers for RA are much needed if we are to understand and measure disease progression, and to facilitate the development of novel treatments for RA. In the present study we described a noninvasive strategy to discover RA biomarkers by transcript profiling of peripheral circulating lymphocytes. As an initial proof-of-concept, we demonstrated the feasibility of such technology by successful profiling PBMCs in a rat CIA model. We characterized differential gene expression between the normal and arthritic animals, and demonstrated that the gene expression in PBMCs could serve as surrogates that are indicative of disease progression.
We used the combination of statistical ANOVA analysis with clustered correlation and biologic relevance analysis to select a workable number of genes as potential biomarker candidates and to assess the specificity of these marker candidates. We were able to confirm elevated Galectin-3 protein expression in the CIA plasma and cultured synovial tissue . Interestingly, Galectin-3 and its binding protein, but not Galectin-1, were reported to be elevated in RA but not in osteoarthritis . In our study, Galectin-1 was not shown to be elevated in arthritic rat blood either. Thus, blood expression of Galectin-3 is likely to be specific to RA. Socs3 might also be specific to RA . In contrast, Versican/CSPG2 is implicated in osteoarthritic cartilage . Although it was also reported to be over-expressed in PBMCs from RA patients [7, 24, 39], we speculate that Versican might be involved more in the inflammation responses linked to bone erosion.
The genes we identified also exhibited strong correlation with phenotypic measurements, as demonstrated by the clustered correlation analysis. Versican is the gene exhibiting the strongest correlation with the characteristic measurements, particularly neutrophil count, in the CIA model. Moreover, members of the Galectin family and its binding proteins, Socs3, and Versican are all found to present in human blood (Shou and coworkers, unpublished data). In the future, it will be of great interest to extend these findings to clinical human blood and explore the possibility that these markers could be used to aid preclinical and clinical studies.
The differences between arthritic and control rat blood could result from induction or suppression of gene expression, or could be due to cell type specific gene expression in cell populations recruited to the blood during the development of disease  – two alternatives that are very challenging to distinguish. Our cell counting data indicate that total white cell and neutrophil counts, among other parameters, are significantly different between arthritic and control rat blood. Hence, differences in composition or activation state between different types of lymphocytes should contribute to and reflect the differential gene expression that we observed. Our analysis of the correlation between gene expression and laboratory indices might potentially reveal some insights regarding cell type specific gene expression. In the future, it will be of interest to explore further differential cell recruitment and its contribution to gene expression and RA pathogenesis. Additional cell fractionation and small quantity RNA labeling technologies [41, 42] will need to be developed to address this issue. Another future direction in evaluating our candidate markers is to monitor the expression of these genes when effective experimental drugs are administrated to CIA rats. It will be important to establish the association between drug effects on inflammation or bone erosion and the expression of the marker genes; this may improve our understanding of drug pharmacokinetics/pharmacodynamics, and facilitate assessment of new compounds for RA treatment, ultimately in a clinical setting.
Major advantages in using the peripheral blood cells instead of local joint tissue to seek biomarkers include the noninvasive nature for the former approach and associated ease preclinical and clinical development [10, 20]. Moreover, blood is a highly dynamic system, in which blood cells have a rapid natural turnover (blood cell turnover is estimated at 1 trillion cells/day) . Because the leukocytes interact and communicate with practically every tissue, they bear rich information regarding inflammation and immune responses . Thus, blood – increasingly being recognized as a sentinel tissue – is uniquely suited to study of systematic responses during disease progression. For example, expression in blood of tissue-specific cardiac genes was reported to permit distinction between patients with coronary artery disease and normal control individuals . This strategy has also been successfully applied to the study of cancer biology [17, 19], autoimmune disease [7, 13, 24], cardiovascular disease , kidney disease , post-traumatic stress disorder , and psychiatric disorders . Gene expression profiling in peripheral blood therefore holds great promise for clinical development .
In the present study, we demonstrated that gene expression in PBMCs from rats with CIA could distinguish arthritic samples from normal control samples, and that gene expression in PBMCs can indeed serve as a potential candidate biomarker of disease progression. Interestingly, some genes that we identified in PBMCs have also been reported to exhibit altered expression in local joints, suggesting conservation between PBMCs and the local joint tissue in terms of their responsiveness to collagen-induced immunity. The contribution of the genes expressed in PBMCs per se to disease progression in CIA and the relevance of these genes to RA is not clear and warrants future investigation. Nevertheless, the present study provides additional evidence supporting the 'sentinel' hypothesis.
A number of genomics studies were previously performed to study RA pathogenesis in murine models [11, 13–15, 45] or human patients [3, 46], with a major focus on local joint tissues. We compared our PBMC profiling findings with those of four published local joint profiling studies using murine models of RA. However, we only identified a limited number of individual genes exhibiting consensus. The observed discrepancy may have multiple causes. First, gene expression in arthritic animal blood is expected to differ substantially from local arthritic joint responses. Second, the difference in technical platforms (for example, spotted array versus the Affymetrix GeneChip, different array versions, and differences in sample preparation and analysis methods) used in these studies may contribute significantly to the difference in DEGs identified. Third, there is only a small portion of the annotated probe sets for which rat orthologs have been identified. Finally, the inherent difference between the murine and rat models of RA may also contribute to the difference in gene expression.
We were able to confirm some known RA related genes in the present study, such as Stat3, Bst1 (bone marrow stromal cell antigen 1), Ptgs2 (prostaglandin G/H synthase 2), S100a9 (S100 calcium binding protein A9) and Ets1 (Ets avian erythroblastosis virus E2 oncogene homolog 1), in addition to chondroitin sulfate proteoglycan 2, Galectin3 and Socs3 (suppressor of cytokine signaling 3). However, we failed to identify some other previously reported RA related genes such as CD36, CD44, STAT5b (signal transducer activator transcription 5b), IL-1Ra follistatin-like genes, IL-13 receptor α, and CCL27 (CC chemokine ligand 27), among others. Interestingly, we identified a IL-1 decoy receptor that antagonizes IL-1 signaling similarly to IL-1Ra, which is known to be involved in RA, suggesting that the consensus could be reached at the gene function level as opposed to the individual gene level. We thus compared our data at a higher level by examining the GO-defined Biological Process represented by the DEGs. We observed a significant degree of agreement between our study and the four previously published ones (Figure 5). The consensus suggested conservation of Biological Processes involved in arthritis development between the local joints and PBMCs, as well as between murine and rat RA models.
The CIA model is a highly dynamic model, with time dependent disease progression. Survey of DEGs identified at various time points can help to improve our understanding of disease development and facilitate biomarker identification. Genes identified at early time points would presumably be informative with respect to early signaling cascades during disease onset. For example, Tnfrsf1b was found to be up regulated in day 10 arthritic rat PBMCs, but not at later time points. Tnfrsf1b encodes a protein with strong similarity to TNF receptor 1b, which induces T cell proliferation and apoptosis. Our data support the involvement of TNF signaling events in the early autoimmune response. Swollen joints are among the important characteristics of arthritis [47–49]. However, paw size measurement only revealed moderate correlation with gene expression in PBMCs (Figure 4). The findings regarding gene expression and correlation with laboratory indices indicate that differences in gene expression in PBMCs between the arthritic rats and control rats, even before the joint swelling, are evident, and thus might be indicative of the early onset of disease. The details of differentially expressed probe sets at different time points are described in Additional file 1. Further characterization of the genes novel to arthritis will advance our understanding of and facilitate identification of novel biomarkers for RA.
We established a noninvasive strategy to identify biomarkers by gene expression profiling in PBMCs in an experimental model of RA. We characterized the differential gene expression between the normal and arthritic animals, and demonstrated that the gene expression in peripheral blood correlated with laboratory indices of disease progression. Potential biomarker candidates were further validated in independent samples using real-time RT-PCR analysis. Finally, Galectin-3 protein was detected by immunoblot analysis in plasma from CIA rats as well as in supernatants from cultured arthritic rat synovial tissue. Further characterization of the genes novel to arthritis will advance our understanding of and facilitate the identification of novel biomarkers for RA.
analysis of variance
type II collagen
differentially expressed gene
- FDR (fdrate):
false discovery rate
peripheral blood mononuclear cell
reverse transcriptase polymerase chain reaction
tumor necrosis factor.
Smolen JS, Steiner G: Therapeutic strategies for rheumatoid arthritis. Nat Rev Drug Discov. 2003, 2: 473-488. 10.1038/nrd1109.
Gay S, Gay RE, Koopman WJ: Molecular and cellular mechanisms of joint destruction in rheumatoid arthritis: two cellular mechanisms explain joint destruction?. Ann Rheum Dis. 1993, S39-S47.
Neumann E, Kullmann F, Judex M, Justen HP, Wessinghage D, Gay S, Scholmerich J, Muller-Ladner U: Identification of differentially expressed genes in rheumatoid arthritis by a combination of complementary DNA array and RNA arbitrarily primed-polymerase chain reaction. Arthritis Rheum. 2002, 46: 52-63. 10.1002/1529-0131(200201)46:1<52::AID-ART10048>3.0.CO;2-1.
Feldmann M: Pathogenesis of arthritis: recent research progress. Nat Immunol. 2001, 2: 771-773. 10.1038/ni0901-771.
Choy EH, Panayi GS: Cytokine pathways and joint inflammation in rheumatoid arthritis. N Engl J Med. 2001, 344: 907-916. 10.1056/NEJM200103223441207.
Lorenz HM, Herrmann M, Kalden JR: The pathogenesis of autoimmune diseases. Scand J Clin Lab Invest Suppl. 2001, 235: 16-26. 10.1080/003655101753352004.
Olsen NJ, Moore JH, Aune TM: Gene expression signatures for autoimmune disease in peripheral blood mononuclear cells. Arthritis Res Ther. 2004, 6: 120-128. 10.1186/ar1190.
Ideker T, Galitski T, Hood L: A new approach to decoding life: systems biology. Annu Rev Genomics Hum Genet. 2001, 2: 343-372. 10.1146/annurev.genom.2.1.343.
Hood L, Heath JR, Phelps ME, Lin B: Systems biology and new technologies enable predictive and preventative medicine. Science. 2004, 306: 640-643. 10.1126/science.1104635.
Frank R, Hargreaves R: Clinical biomarkers in drug discovery and development. Nat Rev Drug Discov. 2003, 2: 566-580. 10.1038/nrd1130.
Gierer P, Ibrahim S, Mittlmeier T, Koczan D, Moeller S, Landes J, Gradl G, Vollmar B: Gene expression profile and synovial microcirculation at early stages of collagen-induced arthritis. Arthritis Res Ther. 2005, 7: R868-R876. 10.1186/ar1754.
Heller RA, Schena M, Chai A, Shalon D, Bedilion T, Gilmore J, Woolley DE, Davis RW: Discovery and analysis of inflammatory disease-related genes using cDNA microarrays. Proc Natl Acad Sci USA. 1997, 94: 2150-2155. 10.1073/pnas.94.6.2150.
Adarichev VA, Vermes C, Hanyecz A, Mikecz K, Bremer EG, Glant TT: Gene expression profiling in murine autoimmune arthritis during the initiation and progression of joint inflammation. Arthritis Res Ther. 2005, 7: R196-R207. 10.1186/ar1472.
Ibrahim SM, Koczan D, Thiesen HJ: Gene-expression profile of collagen-induced arthritis. J Autoimmun. 2002, 18: 159-167. 10.1006/jaut.2001.0580.
Thornton S, Sowders D, Aronow B, Witte DP, Brunner HI, Giannini EH, Hirsch R: DNA microarray analysis reveals novel gene expression profiles in collagen-induced arthritis. Clin Immunol. 2002, 105: 155-168. 10.1006/clim.2002.5227.
Amundson SA, Grace MB, McLeland CB, Epperly MW, Yeager A, Zhan Q, Greenberger JS, Fornace AJ: Human in vivo radiation-induced biomarkers: gene expression changes in radiotherapy patients. Cancer Res. 2004, 64: 6368-6371. 10.1158/0008-5472.CAN-04-1883.
Xu T, Shu CT, Purdom E, Dang D, Ilsley D, Guo Y, Weber J, Holmes SP, Lee PP: Microarray analysis reveals differences in gene expression of circulating CD8+ T cells in melanoma patients and healthy donors. Cancer Res. 2004, 64: 3661-3667. 10.1158/0008-5472.CAN-03-3396.
Alcorta D, Preston G, Munger W, Sullivan P, Yang JJ, Waga I, Jennette JC, Falk R: Microarray studies of gene expression in circulating leukocytes in kidney diseases. Exp Nephrol. 2002, 10: 139-149. 10.1159/000049909.
Twine NC, Stover JA, Marshall B, Dukart G, Hidalgo M, Stadler W, Logan T, Dutcher J, Hudes G, Dorner AJ, et al: Disease-associated expression profiles in peripheral blood mononuclear cells from patients with advanced renal cell carcinoma. Cancer Res. 2003, 63: 6069-6075.
Fan H, Hegde PS: The transcriptome in blood: challenges and solutions for robust expression profiling. Curr Mol Med. 2005, 5: 3-10. 10.2174/1566524053152861.
Ogawa M: Differentiation and proliferation of hematopoietic stem cells. Blood. 1993, 81: 2844-2853.
Tsuang MT, Nossova N, Yager T, Tsuang MM, Guo SC, Shyu KG, Glatt SJ, Liew CC: Assessing the validity of blood-based gene expression profiles for the classification of schizophrenia and bipolar disorder: a preliminary report. Am J Med Genet B Neuropsychiatr Genet. 2005, 133: 1-5.
Ma J, Liew CC: Gene profiling identifies secreted protein transcripts from peripheral blood cells in coronary artery disease. J Mol Cell Cardiol. 2003, 35: 993-998. 10.1016/S0022-2828(03)00179-2.
Olsen N, Sokka T, Seehorn CL, Kraft B, Maas K, Moore J, Aune TM: A gene expression signature for recent onset rheumatoid arthritis in peripheral blood mononuclear cells. Ann Rheum Dis. 2004, 63: 1387-1392. 10.1136/ard.2003.017194.
Trentham DE, Townes AS, Kang AH: Autoimmunity to type II collagen an experimental model of arthritis. J Exp Med. 1977, 146: 857-868. 10.1084/jem.146.3.857.
Williams RO: Collagen-induced arthritis as a model for rheumatoid arthritis. Methods Mol Med. 2004, 98: 207-216.
Chen H, Centola M, Altschul SF, Metzger H: Characterization of gene expression in resting and activated mast cells. J Exp Med. 1998, 188: 1657-1668. 10.1084/jem.188.9.1657.
Shou J, Soriano R, Hayward SW, Cunha GR, Williams PM, Gao WQ: Expression profiling of a human cell line model of prostatic cancer reveals a direct involvement of interferon signaling in prostate tumor progression. Proc Natl Acad Sci USA. 2002, 99: 2830-2835. 10.1073/pnas.052705299.
Onyia JE, Helvering LM, Gelbert L, Wei T, Huang S, Chen P, Dow ER, Maran A, Zhang M, Lotinun S, et al: Molecular profile of catabolic versus anabolic treatment regimens of parathyroid hormone (PTH) in rat bone: an analysis by DNA microarray. J Cell Biochem. 2005, 95: 403-418. 10.1002/jcb.20438.
Jin W, Riley RM, Wolfinger RD, White KP, Passador-Gurgel G, Gibson G: The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster. Nat Genet. 2001, 29: 389-395. 10.1038/ng766.
Chen JJ, Delongchamp RR, Tsai CA, Hsueh HM, Sistare F, Thompson KL, Desai VG, Fuscoe JC: Analysis of variance components in gene expression data. Bioinformatics. 2004, 20: 1436-1446. 10.1093/bioinformatics/bth118.
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc. 1995, 57: 289-300.
Weinstein JN, Myers TG, O'Connor PM, Friend SH, Fornace AJ, Kohn KW, Fojo T, Bates SE, Rubinstein LV, Anderson NL, et al: An information-intensive approach to the molecular pharmacology of cancer. Science. 1997, 275: 343-349. 10.1126/science.275.5298.343.
Wheeler DL, Church DM, Lash AE, Leipe DD, Madden TL, Pontius JU, Schuler GD, Schriml LM, Tatusova TA, Wagner L, et al: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2001, 29: 11-16. 10.1093/nar/29.1.11.
Zeeberg BR, Feng W, Wang G, Wang MD, Fojo AT, Sunshine M, Narasimhan S, Kane DW, Reinhold WC, Lababidi S, et al: GoMiner: a resource for biological interpretation of genomic and proteomic data. Genome Biol. 2003, 4: R28-10.1186/gb-2003-4-4-r28.
Ohshima S, Kuchen S, Seemayer CA, Kyburz D, Hirt A, Klinzing S, Michel BA, Gay RE, Liu FT, Gay S, et al: Galectin 3 and its binding protein in rheumatoid arthritis. Arthritis Rheum. 2003, 48: 2788-2795. 10.1002/art.11287.
Nishida Y, Shinomura T, Iwata H, Miura T, Kimata K: Abnormal occurrence of a large chondroitin sulfate proteoglycan, PG-M/versican in osteoarthritic cartilage. Osteoarthritis Cartilage. 1994, 2: 43-49. 10.1016/S1063-4584(05)80005-6.
Shouda T, Yoshida T, Hanada T, Wakioka T, Oishi M, Miyoshi K, Komiya S, Kosai K, Hanakawa Y, Hashimoto K, et al: Induction of the cytokine signal regulator SOCS3/CIS3 as a therapeutic strategy for treating inflammatory arthritis. J Clin Invest. 2001, 108: 1781-1788. 10.1172/JCI200113568.
Moore JH, Parker JS, Olsen NJ, Aune TM: Symbolic discriminant analysis of microarray data in autoimmune disease. Genet Epidemiol. 2002, 23: 57-69. 10.1002/gepi.1117.
Gregersen PK, Brehrens TW: Fine mapping the phenotype in autoimmune disease: the promise and pitfalls of DNA microarray technologies. Genes Immun. 2003, 4: 175-176. 10.1038/sj.gene.6363976.
Glanzer JG, Eberwine JH: Expression profiling of small cellular samples in cancer: less is more. Br J Cancer. 2004, 90: 1111-1114. 10.1038/sj.bjc.6601668.
Shou J, Qian HR, Lin X, Stewart T, Onyia JE, Gelbert LM: Optimization and validation of small quantity RNA profiling for identifying TNF responses in cultured human vascular endothelial cells. J Pharmacol Toxicol Methods. 2005.
Bull TM, Coldren CD, Moore M, Sotto-Santiago SM, Pham DV, Nana-Sinkam SP, Voelkel NF, Geraci MW: Gene microarray analysis of peripheral blood cells in pulmonary arterial hypertension. Am J Respir Crit Care Med. 2004, 170: 911-919. 10.1164/rccm.200312-1686OC.
Segman RH, Shefi N, Goltser-Dubner T, Friedman N, Kaminski N, Shalev AY: Peripheral blood mononuclear cell gene expression profiles identify emergent post-traumatic stress disorder among trauma survivors. Mol Psychiatry. 2005, 10: 500-513. 10.1038/sj.mp.4001636. 425
Rioja I, Clayton CL, Graham SJ, Life PF, Dickson MC: Gene expression profiles in the rat streptococcal cell wall-induced arthritis model identified using microarray analysis. Arthritis Res Ther. 2005, 7: R101-R117. 10.1186/ar1458.
Devauchelle V, Marion S, Cagnard N, Mistou S, Falgarone G, Breban M, Letourneur F, Pitaval A, Alibert O, Lucchesi C, et al: DNA microarray allows molecular profiling of rheumatoid arthritis and identification of pathophysiological targets. Genes Immun. 2004, 5: 597-608. 10.1038/sj.gene.6364132.
Paulus HE, Oh M, Sharp JT, Gold RH, Wong WK, Park GS, Bulpitt KJ: Classifying structural joint damage in rheumatoid arthritis as progressive or nonprogressive using a composite definition of joint radiographic change: a preliminary proposal. Arthritis Rheum. 2004, 50: 1083-1096. 10.1002/art.20270.
Pincus T, Amara I, Koch GG: Continuous indices of core data set measures in rheumatoid arthritis clinical trials: lower responses to placebo than seen with categorical responses with the American College of Rheumatology 20% criteria. Arthritis Rheum. 2005, 52: 1031-1036. 10.1002/art.20995.
Pincus T, Sokka T: Uniform databases in early arthritis: specific measures to complement classification criteria and indices of clinical change. Clin Exp Rheumatol. 2003, S79-S88.
We wish to thank Lawrence Gelbert, Kevin Duffin, Peter Mitchell, Mark Rekhter, and members of the functional genomics group for helpful discussion, and the members of the Shou laboratory for critical reading of the manuscript. We also acknowledge the support from the bioinformatics/IT group. We should also like to thank the referees for their constructive comments.
The authors declare that they have no competing interests.
JS, HRQ, SLT, NWR, JAW and JEO participated in study design. CMB and LL carried out the life phase animal experiments and blood collection. JS performed the microarray study and drafted the manuscript. HRQ performed the statistical analysis. TW performed the bioinformatics analysis. SL, DP, PJS, and LL were involved in PCR validation and analysis. XYCC and SLT contributed to the Western blot analysis. JS, HRQ, TW, SLT, JAW, and JEO contributed to data interpretation and participated in writing the manuscript. All authors read and approved the final text before submission of the manuscript.
Electronic supplementary material
Additional File 1: An Excel file containing a list of probe sets that are differentially expressed by CIA and control rat PBMCs. (XLS 393 KB)
Additional File 2: An Excel file showing correlations between the 998 differentially expressed probe sets and laboratory indices for disease progression. (XLS 551 KB)
Additional File 3: An Excel file showing differentially expressed cytokine related probe sets between CIA and control rats. (XLS 20 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Shou, J., Bull, C.M., Li, L. et al. Identification of blood biomarkers of rheumatoid arthritis by transcript profiling of peripheral blood mononuclear cells from the rat collagen-induced arthritis model. Arthritis Res Ther 8, R28 (2006). https://doi.org/10.1186/ar1883
- Rheumatoid Arthritis
- Gene Ontology
- Laboratory Index
- Arthritic Animal
- Cell Type Specific Gene Expression