Mass spectrometry-based proteomics identify novel serum osteoarthritis biomarkers

Tardif, Ginette; Paré, Frédéric; Gotti, Clarisse; Roux-Dalvai, Florence; Droit, Arnaud; Zhai, Guangju; Sun, Guang; Fahmi, Hassan; Pelletier, Jean-Pierre; Martel-Pelletier, Johanne

doi:10.1186/s13075-022-02801-1

Research article
Open access
Published: 23 May 2022

Mass spectrometry-based proteomics identify novel serum osteoarthritis biomarkers

Ginette Tardif¹,
Frédéric Paré¹,
Clarisse Gotti²,
Florence Roux-Dalvai²,
Arnaud Droit²,
Guangju Zhai³,
Guang Sun⁴,
Hassan Fahmi¹,
Jean-Pierre Pelletier¹ &
…
Johanne Martel-Pelletier ORCID: orcid.org/0000-0003-2618-383X¹

Arthritis Research & Therapy volume 24, Article number: 120 (2022) Cite this article

3966 Accesses
16 Citations
3 Altmetric
Metrics details

Abstract

Background

Osteoarthritis (OA) is a slowly developing and debilitating disease, and there are no validated specific biomarkers for its early detection. To improve therapeutic approaches, identification of specific molecules/biomarkers enabling early determination of this disease is needed. This study aimed at identifying, with the use of proteomics/mass spectrometry, novel OA-specific serum biomarkers. As obesity is a major risk factor for OA, we discriminated obesity-regulated proteins to target only OA-specific proteins as biomarkers.

Methods

Serum from the Osteoarthritis Initiative cohort was used and divided into 3 groups: controls (n=8), OA-obese (n=10) and OA-non-obese (n=10). Proteins were identified and quantified from the liquid chromatography–tandem mass spectrometry analyses using MaxQuant software. Statistical analysis used the Limma test followed by the Benjamini-Hochberg method. To compare the proteomic profiles, the multivariate unsupervised principal component analysis (PCA) followed by the pairwise comparison was used. To select the most predictive/discriminative features, the supervised linear classification model sparse partial least squares regression discriminant analysis (sPLS-DA) was employed. Validation of three differential proteins was performed with protein-specific assays using plasma from a cohort derived from the Newfoundland Osteoarthritis.

Results

In total, 509 proteins were identified, and 279 proteins were quantified. PCA-pairwise differential comparisons between the 3 groups revealed that 8 proteins were differentially regulated between the OA-obese and/or OA-non-obese with controls. Further experiments using the sPLS-DA revealed two components discriminating OA from controls (component 1, 9 proteins), and OA-obese from OA-non-obese (component 2, 23 proteins). Proteins from component 2 were considered related to obesity. In component 1, compared to controls, 7 proteins were significantly upregulated by both OA groups and 2 by the OA-obese. Among upregulated proteins from both OA groups, some of them alone would not be a suitable choice as specific OA biomarkers due to their rather non-specific role or their strong link to other pathological conditions. Altogether, data revealed that the protein CRTAC1 appears to be a strong OA biomarker candidate. Other potential new biomarker candidates are the proteins FBN1, VDBP, and possibly SERPINF1. Validation experiments revealed statistical differences between controls and OA for FBN1 (p=0.044) and VDPB (p=0.022), and a trend for SERPINF1 (p=0.064).

Conclusion

Our study suggests that 4 proteins, CRTAC1, FBN1, VDBP, and possibly SERPINF1, warrant further investigation as potential new biomarker candidates for the whole OA population.

Introduction

Osteoarthritis (OA), the most common musculoskeletal disorder, is a multifactorial disease irreversibly affecting several joint tissues, the knee being the most prevalent [1]. OA is a major cause of pain, disability, and comorbidities, and about 30% of the worldwide population aged 50 years and older suffer from this disease [2, 3]. OA progression is influenced by numerous factors including age, gender, obesity (major risk factors), and inflammatory mediators, to name a few.

At present, there are no treatments to cure this disease; the current ones only target symptomatic relief. This is related, in part, to the inability to diagnose OA at an early stage, as the existing methods are not sensitive enough. Early and specific OA diagnosis would allow early and targeted treatments/interventions to prevent or delay not only the progression of the disease but also surgery such as joint replacement. This would result in less pain and a better quality of life for patients, in addition to reducing the substantial societal economic burden [4,5,6].

Because the alteration of the articular tissues develops over a few years, the identification of specific molecules/biomarkers that would enable OA early determination is proving to be a challenging task. To date, there are no regulatory agency-approved biomarkers, as none has yet reached the required specificity, sensitivity, and reliability.

Over the years, several approaches, such as genomics, antibody signature, and metabolomics, have been used to identify biochemical and physiological aspects of OA [7,8,9,10]. Another interesting avenue in the search for biomarkers is proteomics. Compared to metabolomics and genomics, the proteomic approach has the advantage of reflecting the patient’s condition at a specific time as well as being more stable than metabolites.

Proteomics using liquid chromatography–tandem mass spectrometry (LC-MS/MS) can identify and quantify thousands of proteins in a single analysis using a relatively small sample amount, which is ideal for the high throughput analysis of a high dynamic range sample such as serum [11]. Such a proteomic approach has been used to identify specific diagnostic markers of many pathologies such as cancer, cardiovascular, liver, and kidney diseases [12,13,14,15,16], as well as some arthritic diseases [8, 17,18,19,20,21,22,23,24], to name a few. LC-MS/MS has been used to monitor the individual proteomes of healthy or OA joint tissues (cartilage, meniscus, synovial membrane), cells (chondrocyte, synoviocyte), and fluids (serum/plasma, synovial fluid, urine) [19, 25,26,27,28,29]. Several proteins that may relate to OA pathological mechanisms have been found but, as mentioned above no molecule has been validated as a specific marker for OA patients, not to mention the early stages of this disease. This could be due in part to the non-specificity of the molecules, which is related more to pathological conditions other than OA, including obesity [30,31,32,33,34].

Therefore, there is an urgent need to identify novel and specific biomarkers that will prove to be both efficient and sensitive enough to be used for OA early diagnosis. The objective of this study was to identify, with the use of LC-MS/MS, novel OA-specific serum biomarkers.

Material and methods

Study participants

Participants were selected from the control and progressor subcohorts of the Osteoarthritis Initiative (OAI) database. The individuals in the progressor cohort had symptomatic radiographic OA as described (https://oai.nih.gov). Serum samples were from 8 controls and 20 OA, the latter equally divided into OA-obese (n=10; body mass index ≥30 kg/m²) and OA-non-obese (n=10; BMI <30 kg/m²).

For validation purposes, fasting plasma samples were derived from the Newfoundland and Labrador cohort in which the controls were from the Complex Diseases in Newfoundland population: Environment and Genetics (CODING) [35] and the OA samples from the Newfoundland Osteoarthritis Study (NFOAS; https://www.med.mun.ca/NFOAS/Home.aspx) [36]. Plasma samples were from 20 controls and 20 OA, the latter equally divided into OA-obese (n=10) and OA-non-obese (n=10).

The characteristics of the selected individuals are listed in Table 1 (OAI) and Table 2 (CODING and NFOAS). For the OAI, the demographic, clinical, and radiographic data were obtained from the OAI database (https://oai.nih.gov).

Table 1 Osteoarthritis Initiative (OAI) participant characteristics

Full size table

Table 2 Complex Diseases in Newfoundland population: Environment and Genetics (CODING) (control) and the Newfoundland Osteoarthritis Study (NFOAS) (OA) participant characteristics

Full size table

All participants had provided written informed consent for their participation. For the OAI cohort, the ethics approval was obtained by each of the OAI clinical sites (University of Maryland Baltimore Institutional Review Board, Ohio State University’s Biomedical Sciences Institutional Review Board, University of Pittsburgh Institutional Review Board, and Memorial Hospital of Rhode Island Institutional Review Board) and the OAI coordinating center (Committee on Human Research at the University of California, San Francisco, CA, USA). For the CODING and NFOAS cohorts, the ethics approval was obtained from the Health Research Ethics Board of Newfoundland and Labrador.

The Institutional Ethics Committee Board of the University of Montreal Hospital Research Centre approved the use of the human serum/plasma.

Serum/plasma samples

Serum/plasma samples were obtained from the OAI (refer to the OAI operations manual detailing specimen collection and processing methods [https://oai.nih.gov]) and the CODING/NFOAS, as previously described [36, 37]. The specimens were collected after an overnight fast using a uniform protocol. For the plasma, blood was collected and plasma separated from the red cells immediately after collection by centrifugation (20,000 rpm for 10 min). Upon reception, samples for both cohorts were aliquoted, stored frozen at −80°C, and thawed at 4°C just before use.

Mass spectrometry

Preparation of serum samples

Data for the samples (non-depleted and depleted) were both acquired in Data Dependent Acquisition mode and analyzed with MaxQuant software, version 1.6.7 [38], as previously described [39].

The non-depleted samples were randomized before analysis. One microliter of each serum sample was diluted in 24 μl of sodium deoxycholate (SDC) buffer consisting of 1% deoxycholate/10 mM Tris (2-carboxyethyl)phosphine/40 mM chloroacetamide/100 mM Tris pH 8.5, heated for 10 min at 95°C, followed by treatment with a mixture of trypsin and Lys-C (Promega, Madison, WI, USA) (0.66 μg of each enzyme) for 1 h at 37°C. The digestion was stopped with 5 μl 50% formic acid causing the precipitation of deoxycholate. The samples were then centrifuged at 16,000g for 15 min at 4°C.

The peptides contained in the supernatant were purified on StageTips C18 Empore (3M, St-Paul, MN, USA) according to Rappsilber et al. [40]. Finally, the peptides were vacuum dried and stored at −20°C prior to mass spectrometry analysis.

High-abundance protein depletion for building a matching library

To improve the number of peptides/protein identification, in the final analysis, a matching library was prepared for its use with the MaxQuant software, as described by Geyer et al. [41]. This used a depleted serum. By adding a library of depleted serum in the analysis, this strategy took advantage of the “match between runs” function of the MaxQuant software, where peptides identified by MS/MS in the library can be matched to the non-depleted samples to recover their quantification even without MS/MS. This library was obtained by pooling 2 μl of each patient’s serum sample, which was then depleted for high abundance proteins using the Seppro IgY14 Spin Column kit according to the manufacturer protocol (Sigma-Aldrich, St Louis, MO, USA). The flow-through was collected, and the proteins were precipitated with the addition of 5 volumes of ice-cold acetone and incubated overnight at −20°C. After centrifugation at 10,000g for 10 min, the pellet was resuspended by 120 μl of SDC buffer and heated at 95°C for 10 min. After cool down, the pooled samples were digested with 1:100 Trypsin:proteins and 1:100 Lys-C:proteins ratios according to a Bradford protein assay. The resulting peptides were purified on Oasis HLB Cartridge (Waters) according to the manufacturer’s procedure. The peptides were then fractionated on a high pH reversed-phase peptide chromatography according to Yang et al. [42]. The 12 resulting fractions were vacuum dried and stored at −20°C prior to mass spectrometry analysis.

Liquid chromatography (LC)-MSMS analysis

Both non-depleted samples and fractions of the depleted pool were analyzed, as previously described [43]. In brief, samples or fractions were resuspended with 30 μl 2% acetonitrile/0.05% trifluoroacetic acid. Protein concentration was determined at 205 nm using a NanoDrop 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA); the protein concentration was adjusted to 0.2 μg/μl. Five microliters of the resuspended peptide digestion (equivalent to 1 μg peptides) was injected on a nanoflow liquid chromatography/MSMS (nanoflow LC-tandem MS). The experiments were performed with a Dionex UltiMate 3000 nanoRSLC chromatography system (Thermo Fisher Scientific/Dionex Softron GmbH, Germering, Germany) connected to an Orbitrap Fusion Tribrid ETD mass spectrometer (Thermo Fisher Scientific, San Jose, CA, USA) equipped with a nano electrospray ion source. Peptides were trapped at 20 μl/min in a loading solvent (2% acetonitrile, 0.05% trifluoroacetic acid [44]) on a 5-mm length 300 μm Internal Diameter (I.D.), 5 μm particles Acclaim™ PepMap™ 100 pre-column cartridge (Thermo Fisher Scientific/Dionex Softron GmbH) for 5 minutes. Then, the pre-column was switched online with 500-mm length, 75 μm I.D., 3 μm particles, Acclaim™ PepMap™ 100 C18 analytical column (Thermo Fisher Scientific/Dionex Softron GmbH), and the peptides were eluted with a linear gradient from 5 to 40% (A: 0,1% formic acid, B: 80% acetonitrile, 0.1% formic acid) for 90 min, at 300 nl/min. Mass spectra were acquired using a Data Dependent Acquisition mode (Thermo XCalibur software, version 4.3). Full scan mass spectra (350 to 1800 m/z) were acquired in the orbitrap using an automatic gain control (AGC) target of 4e5, a maximum injection time of 50 ms, and a resolution of 120,000. Internal calibration using lock mass on the m/z 445.12003 siloxane ion was used. Each MS scan was followed by the acquisition of fragmentation MSMS spectra of the most intense ions for a total cycle time of 3 s (highest speed mode). The selected ions were isolated using the quadrupole analyzer in a window of 1.6 m/z and fragmented by higher energy collision-induced dissociation (HCD) with 35% of collision energy. The resulting fragments were detected by the linear ion trap at a rapid scan rate with an AGC target of 1e4 and a maximum injection time of 50 MS. Dynamic exclusion of previously fragmented peptides was set for a period of 20 s and a tolerance of 10 ppm.

Database searching and label-free quantification

Spectra were searched against a human proteins database (Uniprot Homo sapiens Reference Proteome – UP000005640 – 74435 entries - 21.04.2019) using the Andromeda module of the MaxQuant software [39]. In brief, the trypsin/P enzyme parameter was selected with two possible missed cleavages. Carbamidomethylation of cysteines was set as a fixed modification, methionine oxidation, and deamidation of glutamine and asparagine as variable modifications. Mass search tolerances were 5 ppm and 0.5 Dalton for MS and MS/MS, respectively. For protein validation, a maximum false discovery rate of 1% at peptide and protein levels was used based on a target/decoy search. MaxQuant was also used for label-free quantification with a minimum ratio count of 1. The “match between runs” algorithm was used with 20 min as alignment time window and 0.7 min as match time window values to enable a peptide MS1 signal match between the matching library consistent with fractions of depleted samples and the non-depleted serum samples. Only unique and razor peptides were used for quantification. All other parameters were set at default values.

Protein assays

Proteins tested were the fibrillin-1 (FBN1), Vitamin D-binding protein (VDBP), and SERPINF1. They were determined with specific assays according to manufacturer’s specifications. FBN1 was quantitated by ELISA (dilution 1:5; #MBS3804755, MyBiosource, San Diego, CA, USA), VDBP with a Multiplex assay (dilution 1:10000; #HCCBP2MAG-58K, EMD Millipore Corporation, Billerica, MA, USA), and SERPINF1, by Luminex assay (dilution 1:4000; #LXSAHM-01, R&D systems, Minneapolis, MN, USA). Protein quantification was performed using the LiquiChip 200 apparatus, and the data analysis performed with ht LiquiChip Analyzer software (Qiagen, Toronto, ON, Canada). For each biomarker, an 8-point standard curve and appropriate controls were included, and samples were done in duplicate. The minimum detectable doses were for FBN1, 0.312 ng/ml; VDBP, 0.58 ng/ml; and SERPINF1, 3.66 pg/ml.

Data treatment and statistical analysis

The proteinGroups.txt file generated by MaxQuant was used in R software, version 3.4 [45]. The intensity values of each peptide in each non-depleted serum sample were normalized using the median of all intensity values in each sample (normalization by column). For each comparison, only peptides having at least 60% of non-missing values across all the non-depleted samples were considered as quantifiable. Missing values remaining after this filtering were imputed using a noise value calculated as the first centile of all intensity values per sample (calculation per column), as previously described [46]. Only proteins with at least two quantified peptides were kept for further analysis.

For the analysis of differential expression between two groups, a protein ratio was calculated using the average of protein intensities in all samples of the same group. These ratios were then converted into z-score (z = (x-μ)/σ where x =log₂(ratio); μ = average of all log₂(ratios); σ =standard deviation of all log₂(ratios)) for data centering. Statistical analysis was performed using the Limma Bioconductor package [47] to define the probability of variation (p-value) of each protein between two groups. This method has been preferred to the usual Student t-test as it has been shown to be less sensitive to the number of biological replicates. This was followed by the Benjamini-Hochberg method to adjust for multiple comparison (q-value). Proteins with a q-value < 0.050 and absolute value of z-score > 1.96 were considered significantly different.

Further, two multivariate methods were used through the MixOmics R package [48]. First, to compare the proteomic profiles, the multivariate unsupervised principal component analysis (PCA) [49] followed by the pairwise comparison were used. PCA method enables to cluster the samples by reducing the dimension of expression data with minimum information loss and visualize the similarities between the proteins. It is a logistic regression that provide a relative weighting of the protein importance. Second, to select the most predictive/discriminative features, the supervised classification model sparse partial least squares regression discriminant analysis (sPLS-DA) [50] was used. This method is a linear classification model enabling discriminative variable selection that could predict the outcome. It allows to seek for components that best separate the samples. Moreover, this method presented a graphical representation of the components and proteins assisting for the interpretation of the results. The number of components and variables was defined after a tuning step to optimize the distinction between the three groups (control, OA-obese, OA-non-obese).

For the validation experiments, the differences between groups were assessed using the Student t-test. A value of p≤0.050 was considered statistically significant. Statistical analysis was performed using the GraphPad Prism 8 (San Diego, CA, USA).

Results

Subject characteristics

Table 1 shows the characteristics of the participants from the OAI cohort comparing control, OA, OA-obese, and OA-non-obese individuals. The obese/non-obese division was performed in an attempt to discriminate proteins not specific to OA but to obesity. Compared to controls, OA patients were older (p=0.037) and had higher BMI (p=0.011), Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) scores (p≤0.0003), Kellgren-Lawrence grades (p<0.0001), and smaller medial joint space width (p<0.0001). Comparison between OA-obese with OA-non-obese showed only, and as expected, that the former had a higher BMI (p=0.047). When each two OA subgroups were compared to control, data were comparable to the total OA group, but OA-non-obese were slightly older (p=0.023) and had a similar BMI.

Table 2 shows that none of the participant characteristics differed between the CODING (controls) and NFOAS (OA) cohorts. Compared to the OAI controls, the CODING participants were older (p<0.0001) and had a higher BMI (p=0.002), and OA participants from the NFOAS had higher WOMAC scores (p<0.0001) than those from the OAI.

Quantitative proteomic analysis

Principal component analysis (PCA)

A shotgun proteomic analysis was performed on the non-depleted individual serum samples. Five hundred and nine (509) proteins could be identified in at least one individual sample. As mentioned above, in addition to the non-depleted, we added a depleted serum library for the database searching and quantification. Such an addition boosted the protein identification by 28%. Two hundred and seventy-nine (279) proteins (Table S1) were quantified after filtering for proteins having at least 60% of non-missing values in at least one of the two compared conditions and having two quantified peptides or more to retain only high-quality protein measurements. This data was used to explore the global proteomic profile of each sample and group through a PCA analysis. This unsupervised multivariate method (Fig. 1) generates principal component axes that best explain the variability in the data without knowing the group of the sample. The data showed that the three groups (control, OA-obese, OA-non-obese) could not be clearly distinguished based on their global proteomic profile suggesting that the differences between the groups might be from low variations in protein expression and/or variations on a small number of protein species.

Pairwise differential expression analysis

To unveil the small differences between the groups, pairwise differential analyses were performed using the protein quantitative value. Comparisons were made between control with OA-non-obese and OA-obese as well as between OA-obese with OA-non-obese. For each comparison, protein ratios were calculated between the two groups and converted into z-score for data centering. Statistical analysis was performed with the Limma method. Table S2 lists the normalized intensity values, means, ratios, and z-scores for the 12 proteins that were found significantly differentially expressed in at least one of the three pairwise comparisons, and Table 3 summarizes the data. Of note, differential expressions could not be performed for FBN1, comparing OA-obese with controls, and for lysine-specific demethylase4C/4E/4B (KDM4C/4B/4E), comparing OA-non-obese with controls, as these proteins could not be quantitated accurately due to missing values in OA-obese and OA-non-obese groups, respectively (Tables S2 and Table 3).

Table 3 Principal component analysis-pairwise differential expression

Full size table

For these 12 proteins, pairwise comparison revealed that 8 were differentially regulated between OA-obese with controls, and also 8 between OA-non-obese with controls; some proteins being common to both comparisons (Table 3, Fig. 2A, B). No protein was found differentially regulated between the two OA subgroups (Fig. 2C). One may also note that, in Fig. 2A, B, the ratio distribution is not centered when OA obese and OA non-obese are compared to control. In the latter, there are slightly less quantified proteins; however, the overall intensity is somewhat strong. Although this cannot be explained at present, to overcome this issue, we centered the data by calculating a z-score and considered proteins as regulated or not between two conditions based on both their q-value and z-score.

Compared to controls, data revealed that in the OA-obese group (Fig. 2A, Table 3), CRP, CRTAC1, LYZ, PTGDS, IGHD, and KDM4C/4B/4E were all upregulated, whereas ACTA1/ACTC1/ACTG2/ACTA2 and ADIPOQ were downregulated. In the OA-non-obese/control comparison (Fig. 2B), CRP, CRTAC1, LYZ, PTGDS, FBN1, IGHV3-35, KHSRP, and S100A9 were all upregulated in the OA-non-obese; PTGDS was included in the upregulated proteins as the q-value (q=0.054) showed a strong trend towards significance.

Sparse partial least squares regression discriminant analysis (sPLS-DA)

The pairwise comparison between the two OA subgroups did not reveal proteins that were significantly different and that could be related to the obesity condition. To mine deeper into the data and unveil proteins related to obesity, not specific necessarily to OA, we performed another multivariate analysis, the sPLS-DA. This supervised analysis enabled the selection of the most discriminative proteins in the data to classify the samples [50].

Data revealed that a very good classification (area under the curve [AUC] >95%) was obtained with two components. Component 1 (9 proteins; Fig. 3) comprised proteins discriminating the two OA groups from the controls, and component 2 (23 proteins; Fig. 4) discriminated the OA-non-obese from the OA-obese. In a given component, each protein does contribute in combination but not equally to the discrimination process, i.e., when a protein is removed from a component, the discriminatory strength of the component is altered.

Figure 3A illustrates a clear separation of the control group from the two OA subgroups, which is particularly visible in component 1. Figure 3B shows the contribution of each of the 9 proteins comprised in component 1 listed by order of importance—CRTAC1, GC, C1R, SERPINF1, PROS1, SEPP1, C1QC, ITIH4, and APCS. Of note, CRTAC1, which was found to contribute the most, was also identified previously in the pairwise analysis as upregulated in both OA-obese and OA-non-obese compared to controls (Fig. 2, Table 3). Figure 3C shows the intensities of the 9 proteins contributing to component 1 for each group and their comparisons between the groups. Compared to controls, both OA groups were upregulated for all 9 proteins and statistical difference was reached for all in the OA-non-obese. Although values of both OA-obese and OA-non-obese were relatively similar for all the 9 proteins, comparison between the OA-obese with controls showed that the proteins PROS1 and SEPP1 did not reach statistical difference.

Component 2 is a group of 23 proteins that discriminates OA-obese from OA-non-obese. Figure 4 shows the contribution value of each of these proteins. Importantly, none of the 23 proteins found in component 1, which discriminates OA from controls, and only the protein ADIPOQ (with a very low contribution) were previously identified in pairwise comparison as down-regulated in OA-obese compared to controls (Fig. 2A, Table 3 and Table S2).

Some of the proteins of component 2 were involved in the coagulation/fibrinolysis pathways or lipid metabolism. Also listed are some immunoglobulins, mostly light chains (lambda and kappa variable). Regarding the contribution of each protein to component 2, ApoC1 and SERPINC1 were the proteins with the strongest contribution in the OA-non-obese group, while HPR, IGKV3-15, and APOL1 led in the OA-obese group.

Protein validation

To complement this work, comparison of three proteins (FBN1, VDPB and SERPINF1) using plasma from another cohort (CODING and NFOAS) was performed between controls and OA. Data showed that statistical difference was reached when OA was compared to controls for FBN1 (p=0.044), and VDPB (p=0.022), and a trend toward significance for SERPINF1 (p=0.064) (Fig.S1). Of note, no difference was obtained when OA-obese and OA-non-obese were compared for all the three proteins studied (p=0.656, p=0.104, and p=0.315, respectively) (Fig. S1), suggesting that these proteins are not likely obesity-regulated.

Discussion

The search for a reliable biomarker in OA is an active field of investigation. Our study identified the proteins CRTAC1, FBN-1, VDBP, and possibly SERPINF1 as potential new and OA-specific serum biomarkers.

To gather the most information about the proteomic analysis performed on our serum samples, we first assessed their proteomic profile through an unsupervised PCA analysis, then two methodologies were used to recover the most discriminative proteins involved in each group: a pairwise differential analysis based on the Limma (Student derived) statistical test and a supervised sPLS-DA analysis. The latter enabled us to find proteins discriminating OA-obese from OA-non-obese groups, which was not possible with the pairwise comparisons.

The PCA data revealed that controls can be partially discriminated from OA patients based on their global proteomic profile, while OA-obese and OA-non-obese patients cannot be differentiated. This was confirmed by pairwise differential expression analyses, which revealed that CRP, LYZ, CRTAC1, and PTGDS were all upregulated in OA individuals compared to controls. These proteins are not likely obesity-regulated as they were significantly higher than the controls in both OA-obese and OA-non-obese in addition to not being found differently regulated in the sPLS-DA component 2, which evaluates proteins between the two OA groups.

CRTAC1 appears to be a strong OA biomarker candidate as it is the only protein identified in both pairwise (increased intensity levels in OA compared to controls) and sPLS-DA (highest contribution in component 1) analyses. However, very little is known about this protein and its role, not only in OA but also in normal human physiology. Two splice variants of this gene have been reported and, in regard to articular tissues, the CRTAC1-A being the predominant form in cartilage [51]. In OA knees, studies have reported that it is a glycosylated extracellular molecule found in the inter-territorial matrix of the deep zone of the cartilage as well as in synovial fluid and serum [21, 26, 51]. It is upregulated in late-stage OA cartilage compared to healthy or early OA cartilage [52, 53]. While preparing the present work, a proteomic study done on an Icelandic population (n=39,155 including 12,178 OA) corroborates our finding that CRTAC1 was the most strongly associated (among 4792 proteins studied) to OA diagnosis and progression to joint replacement [54]. It asserts that CRTAC1 is a strong and promising biomarker candidate for OA.

FBN1 is an extracellular matrix protein that assembles into microfibrils to form the template for elastic fiber formation. In the pairwise analysis, data showed its upregulation in OA-non-obese compared to controls. In this analysis, unfortunately this protein could not be assessed in the OA-obese as it had too many missing values to assign a final score. However, in the sPLS-DA component 2, this protein did not discriminate OA-non-obese and OA-obese. Complementary experiments confirm the statistical difference of this protein between OA with controls, in addition no difference was found between OA-non-obese and OA-obese, thus not likely regulated by obesity factors. FBN1 was previously identified in the synovial fluids of OA patients, but no comparison with controls was done [55]. There are three isoforms of FBN, FBN1 being the most abundant in adult tissues [56]. Related to OA, FBN1 has been reported to sequester a key factor involved in the disease’s cartilage and bone, the latent TGF-β1 complex, regulating its bioavailability [57,58,59]. In addition, FBN1 was found associated with two other musculoskeletal diseases, systemic sclerosis and Marfan syndrome [60,61,62,63]. FBN1 would be an interesting molecule for further analysis as a potential OA biomarker.

Other proteins were found upregulated in OA compared to controls, CRP, LYZ, and PTGDS. However, they alone would not be suitable choices as specific OA biomarkers due to their rather non-specific role (CRP, a general marker of inflammation [64, 65]; LYZ, an antibacterial role or their strong link to other pathological conditions (PTGDS) [66,67,68,69,70]. Nonetheless, it is worth mentioning that a ratio of serum CRP with another molecule (monocyte chemoattractant protein-1 [MCP-1]) was suggested as an OA biomarker. This ratio has been found associated with OA symptoms and predicted, in combination with other factors, OA individuals with knee structural degenerative progression [37, 71]. Furthermore, CRP is also known to activate the classical complement pathway by binding to C1q [72]. Although we did not identify C1q in the PCA-pairwise analysis, it was found as a contributor to component 1 in the sPLS-DA analysis.

Several other proteins showed differential regulation in pairwise analysis, but are likely obesity-related, and thus not specific to all OA population. These included IGHD and KDM4C, which were upregulated in OA-obese, and KHSRP, S100A9, and IGHV3-35 were so in OA-non obese, whereas ADIPOQ and ACT were downregulated only in OA-obese.

The sPLS-DA complemented the differential expression findings and further identified proteins that discriminated both OA-obese and OA-non-obese from controls (component 1), as well as OA-obese from OA-non-obese individuals (component 2). This analysis offers an insight into which proteins contribute and how important the contribution of each is towards the discrimination of given groups.

Several proteins comprising component 1 (OA vs. controls) are molecules for which there are few or no reports as to their association with OA, as such offering novel potential candidates for OA biomarker research. The sPLS-DA revealed that the abovementioned CRTAC1 protein contributed the most towards the discrimination of OA and controls. The second contributor being VDBP and validation experiments demonstrated a significant difference between OA and controls and, as for FBN1, not between the OA subgroups. This is a multifunctional protein that not only binds to vitamin D but also has several other different physiological functions such as actin scavenging, binding of fatty acids, and chemotaxis [73]. There has been only one OA study showing increased levels of VDBP and vitamin D receptors in muscles from patients with end-stage knee OA compared to controls [74]. As knee muscles are gaining great interest regarding their impact on OA progression, this protein should be studied further as an OA biomarker.

Two other proteins in component 1, C1R and C1QC, are directly involved in the first step of the classical complement cascade. Of note, the contribution of C1QC is from the OA-obese individuals, thus probably related to obesity. C1 proteases can also cleave non-complement proteins including the LDL receptor-related protein 6, IGFBP5, and nucleolin [75]. The presence of complement proteins in this list was not unexpected, as previous studies reported the activation of the complement cascade in OA [37, 76, 77]. As complement proteins are activated in various diseases as well as in general inflammation processes, the abovementioned proteins would therefore not be very useful as specific OA markers. It has previously been reported that one of the complement proteins, as for the CRP, when employed in ratio with another molecule could be of use as a biomarker for OA cartilage degradation in OA-obese individuals. Hence, the adipokine adipsin, a component of the alternative complement pathway, when combined as a ratio with MCP-1 was found strongly associated with knee cartilage volume loss in OA-obese individuals [37].

SERPINF1, as its name indicates, belongs to the serpin family, but does not display the serine protease inhibitory activity shown by many of its family members. The SERPINF1 gene codes for the pigment epithelium-derived factor (PEDF), which was found to exacerbate mice joint cartilage damage in an in vivo inflammatory joint destruction model (monosodium iodoacetate) [78]. However, PEDF production in the joint is somewhat controversial as it was found upregulated in human OA cartilage in two studies [78, 79], while another showed no expression in articular chondrocytes but an up-regulation in osteophytic chondrocytes [80]. In regard to a musculoskeletal disease, the heritable disorder osteogenesis imperfecta, characterized by bone fragility and low bone mass, is caused by mutations in the SERPINF1 gene [81, 82]. Validation experiments showed that there was a numerical trend toward significance when OA was compared to controls. However, this protein needs more support as a potential OA biomarker and further analysis is suggested.

The other less-contributing proteins in the sPLS-DA component 1 included PROS, a vitamin K-dependent plasma protein that functions as a cofactor for the anticoagulant protease (activated protein C) in the degradation of coagulation factors Va and VIIIa; SEPP1, a selenoprotein implicated as an extracellular antioxidant, and in the transport of selenium to extra-hepatic tissues; ITIH4, a member of the serine protease inhibitor family with diverse functions such as a matrix-stabilizing molecule [83]; and APCS (amyloid P component serum), a glycoprotein capable of binding to apoptotic cells at an early stage and associated with the innate immune system. As for the C1QC, the contribution of APCS in component 1 is from the OA-obese individuals, thus probably related to obesity. Although all these proteins were not specifically studied with respect to their role in OA, some have been associated with other arthritis pathologies including rheumatoid arthritis [84, 85], lupus [86], Kashin-Beck [87], and ankylosing spondylitis [88].

Data from sPLS-DA’s component 2 offered important information related to differentially regulated proteins between OA-obese and OA-non-obese, which are potentially related to obesity. Obesity is a well-known and major risk factor for OA, but not all OA patients are obese. Thus, in the search for a specific OA biomarker, it is important to focus on molecules that are regulated in the general OA population, avoiding other pathological condition-related (obesity) proteins. Among the 23 proteins identified in component 2, none were found in component 1 (discriminating OA from controls), and thus are mostly related to conditions other than OA. Several of these proteins are involved in lipid metabolism: apolipoproteins A1, C1, and L1, paraoxonase 1 (PON1), which binds to HDL, HPR, which is known to associate with APOL1-containing HDL, and ADIPOQ, an adipokine involved in the control of fat metabolism and insulin sensitivity, which is also listed in the PCA analysis. Notably, a number of apolipoproteins and serpins identified as part of component 2 are among the highest contributors, and a number of those proteins have been studied as to their presence/levels in OA [19, 89,90,91,92,93,94]. Some others, such as SERPINC1, coagulation factor XII (involved in contact activation pathways), and protein C (PROC), are involved in the coagulation/fibrinolysis pathways, which are known to be activated in OA, as well as in obesity [95,96,97,98,99,100]. However, it cannot be ascertained that these pathways are specific of OA or rather of obesity. It is our opinion that the use of those proteins in the search for specific OA biomarkers should not be pursued. Nevertheless, in the OA-obese subgroup, some of these proteins including APOA1 and SERPINC1 would be worthwhile studying, as they may amplify/accelerate the OA process in these people and thus be used as therapeutic targets.

Although our study has identified potential biomarkers, it has limitations. First, the cohort used (proteomic, OAI; validation CODING and NFOAS) included individuals from the USA and Canada, respectively. A validation of our results from other countries would be required to determine whether those proteins could indeed be further studied as biomarkers. Second, gender discrimination could also be performed as it is well known that there are sex-specific differences in OA [101,102,103]. In this study, we could not perform such a discrimination as we had a relatively modest sample size, which was limited by the methodology used. A technique allowing a greater sample size should permit it. Third, despite a data filtering, some of the proteins (for example CRP, KDM4C, FBN1, and actin) across the whole dataset showed a high number of imputed noise values (Table S1), which might have created a bias in the reported fold changes. However, as some of the targeted proteins selected as new potential biomarkers for the entire OA population were further validated using samples from an external cohort, including FBN1, this reduces the risk to report wrong biomarkers. Moreover, the use of a larger cohort combined with other proteomic analysis strategies could confirm our findings.

Conclusion

In OA, current diagnoses are not sensitive enough to identify the disease in the early stages. To improve therapeutic approaches for the prevention or delay of the progression of this disease, the identification of specific molecules/biomarkers enabling early determination of this disease is needed. At present, there are no such validated specific serum biochemical markers. As a novel contribution, we identified, by using proteomics/mass spectrometry and targeted disease-specific proteins, four OA serum potential new biomarker candidates for the entire OA population: CRTAC1, FBN1, VDBP, and possibly SERPINF1.

Availability of data and materials

Data from the Osteoarthritis Initiative (OAI) cohort are publicly available (https://data-archive.nimh.nih.gov/oai/). All mass spectrometry data (raw files and MaxQuant search result files) are publicly available on ProteomeXchange repository (www.proteomexchange.org) with the identifier PXD032112. Additional data may be obtained upon a reasonable request to JMP, as long as the request is evaluated as scientifically relevant and pertinent.

Abbreviations

ACT:: Actins
ADIPOQ:: Adiponectin
A0A0G2JMB2:: Ig alpha-2 chain C region (fragment)
APCS:: Serum amyloid P
APOA1:: Apolipoprotein A-I
APOC1:: Apolipoprotein C-I
APOL1:: Apolipoprotein L1
C1QC:: Complement C1q subcomponent subunit C
C1R:: Complement C1r subcomponent
CFHR4:: Complement factor H-related protein 4
CRP:: C-reactive protein
CRTAC1:: Cartilage acidic protein 1
DBH:: Dopamine beta-hydroxylase
FBN1:: Fibrillin 1
FETUB:: Fetuin-B
F12:: Coagulation factor XII
GC_(VDBP):: Vitamin D-binding protein
GP1BA:: Platelet glycoprotein Ib alpha chain;Glycocalicin
GPX3:: Glutathione peroxidase;Glutathione peroxidase 3
HPR:: Haptoglobin-related protein
Ig:: Immunoglobulin
IGHD:: Ig delta chain C region
IGFALS:: Insulin-like growth factor-binding protein complex acid labile subunit
IGLV2-14:: Ig lambda chain V-II region TOG
IGLV4-69:: Ig lambda variable 4-69
IGKV2-24:: Ig kappa variable 2-24
IGLV2-23:: Ig lambda chain V-II region NEI
IGKV3-15:: Ig kappa chain V-III region POM
IGLV3-21:: Ig lambda chain V-III region LOI
IGHV3-35:: Ig heavy variable 3-35
IGHV3OR16-12:: Ig Heavy Variable 3/OR16-12 (Non-Functional)
ITIH4:: Inter-Alpha-Trypsin Inhibitor Heavy Chain 4
KDM4C/4B/4E:: Lysine-specific demethylase4C/4E/4B
KHSRP:: KH-Type Splicing Regulatory Protein
LYZ:: Lysozyme
MCP-1:: Monocyte chemoattractant protein-1
MS:: Mass spectrometry
OA:: Osteoarthritis
PCA:: Principal component analysis
PON1::: Serum paraoxonase/arylesterase 1
PROC::: Vitamin K-dependent protein C
PROS1:: Vitamin K-dependent protein S
PTGDS:: Prostaglandin-H2 D-isomerase
SEPP1:: Selenoprotein P
SERPINA6:: Corticosteroid-binding globulin
SERPINA3:: Alpha-1-antichymotrypsin
SERPINC1:: Antithrombin-III
SERPINF1:: Pigment epithelium-derived factor
sPLS-DA:: Sparse partial least squares regression discriminant analysis
S100A9:: S100 Calcium Binding Protein A9

References

Felson DT, Naimark A, Anderson J, Kazis L, Castelli W, Meenan RF. The prevalence of knee osteoarthritis in the elderly. The Framingham Osteoarthritis Study. Arthritis Rheum. 1987;30:914–8.
Article CAS PubMed Google Scholar
Cross M, Smith E, Hoy D, Nolte S, Ackerman I, Fransen M, et al. The global burden of hip and knee osteoarthritis: estimates from the global burden of disease 2010 study. Ann Rheum Dis. 2014;73(7):1323–30.
Article PubMed Google Scholar
Yoshimura N, Muraki S, Nakamura K, Tanaka S. Epidemiology of the locomotive syndrome: The research on osteoarthritis/osteoporosis against disability study 2005-2015. Mod Rheumatol. 2017;27(1):1–7.
Article PubMed Google Scholar
Neogi T. The epidemiology and impact of pain in osteoarthritis. Osteoarthritis Cartilage. 2013;21(9):1145–53.
Article CAS PubMed PubMed Central Google Scholar
Kingsbury SR, Gross HJ, Isherwood G, Conaghan PG. Osteoarthritis in Europe: impact on health status, work productivity and use of pharmacotherapies in five European countries. Rheumatology (Oxford). 2014;53(5):937–47.
Article PubMed Google Scholar
Xie F, Kovic B, Jin X, He X, Wang M, Silvestre C. Economic and humanistic burden of osteoarthritis: a systematic review of large sample studies. Pharmacoeconomics. 2016;34(11):1087–100.
Article PubMed Google Scholar
Heard BJ, Rosvold JM, Fritzler MJ, El-Gabalawy H, Wiley JP, Krawetz RJ. A computational method to differentiate normal individuals, osteoarthritis and rheumatoid arthritis patients using serum biomarkers. J R Soc Interface. 2014;11(97):20140428.
Article PubMed PubMed Central Google Scholar
Lourido L, Ayoglu B, Fernandez-Tajes J, Oreiro N, Henjes F, Hellstrom C, et al. Discovery of circulating proteins associated to knee radiographic osteoarthritis. Sci Rep. 2017;7(1):137.
Article PubMed PubMed Central CAS Google Scholar
Camacho-Encina M, Balboa-Barreiro V, Rego-Perez I, Picchi F, VanDuin J, Qiu J, et al. Discovery of an autoantibody signature for the early diagnosis of knee osteoarthritis: data from the Osteoarthritis Initiative. Ann Rheum Dis. 2019;78(12):1699–705.
Article CAS PubMed Google Scholar
Carlson AK, Rawle RA, Wallace CW, Brooks EG, Adams E, Greenwood MC, et al. Characterization of synovial fluid metabolomic phenotypes of cartilage morphological changes associated with osteoarthritis. Osteoarthritis Cartilage. 2019;27(8):1174–84.
Article CAS PubMed PubMed Central Google Scholar
Gharbi M, Deberg M, Henrotin Y. Application for proteomic techniques in studying osteoarthritis: a review. Front Physiol. 2011;2:90.
Article PubMed PubMed Central Google Scholar
Saleem S, Tariq S, Aleem I, Sadr-Ul S, Tahseen M, Atiq A, et al. Proteomics analysis of colon cancer progression. Clin Proteomics. 2019;16:44.
Article CAS PubMed PubMed Central Google Scholar
Borne Y, Fagerberg B, Sallsten G, Hedblad B, Persson M, Melander O, et al. Biomarkers of blood cadmium and incidence of cardiovascular events in non-smokers: results from a population-based proteomics study. Clin Proteomics. 2019;16:21.
Article PubMed PubMed Central CAS Google Scholar
Niu L, Geyer PE, Wewer Albrechtsen NJ, Gluud LL, Santos A, Doll S, et al. Plasma proteome profiling discovers novel proteins associated with non-alcoholic fatty liver disease. Mol Syst Biol. 2019;15(3):e8793.
Article PubMed PubMed Central CAS Google Scholar
Pena MJ, Mischak H, Heerspink HJ. Proteomics for prediction of disease progression and response to therapy in diabetic kidney disease. Diabetologia. 2016;59(9):1819–31.
Article CAS PubMed PubMed Central Google Scholar
Hanash S. Progress in mining the human proteome for disease applications. OMICS. 2011;15(3):133–9.
Article CAS PubMed PubMed Central Google Scholar
Gobezie R, Kho A, Krastins B, Sarracino DA, Thornhill TS, Chase M, et al. High abundance synovial fluid proteome: distinct profiles in health and osteoarthritis. Arthritis Res Ther. 2007;9(2):R36.
Article PubMed PubMed Central CAS Google Scholar
Fischer R, Trudgian DC, Wright C, Thomas G, Bradbury LA, Brown MA, et al. Discovery of candidate serum proteomic and metabolomic biomarkers in ankylosing spondylitis. Mol Cell Proteomics. 2012;11(2):M111 013904.
Article PubMed CAS Google Scholar
Takinami Y, Yoshimatsu S, Uchiumi T, Toyosaki-Maeda T, Morita A, Ishihara T, et al. Identification of potential prognostic markers for knee osteoarthritis by serum proteomic analysis. Biomark Insights. 2013;8:85–95.
Article PubMed PubMed Central CAS Google Scholar
Wanner J, Subbaiah R, Skomorovska-Prokvolit Y, Shishani Y, Boilard E, Mohan S, et al. Proteomic profiling and functional characterization of early and late shoulder osteoarthritis. Arthritis Res Ther. 2013;15(6):R180.
Article PubMed PubMed Central CAS Google Scholar
Ritter SY, Collins J, Krastins B, Sarracino D, Lopez M, Losina E, et al. Mass spectrometry assays of plasma biomarkers to predict radiographic progression of knee osteoarthritis. Arthritis Res Ther. 2014;16(5):456.
Article PubMed PubMed Central CAS Google Scholar
Sierra-Sanchez A, Garrido-Martin D, Lourido L, Gonzalez-Gonzalez M, Diez P, Ruiz-Romero C, et al. Screening and validation of novel biomarkers in osteoarticular pathologies by comprehensive combination of protein aarray technologies. J Proteome Res. 2017;16(5):1890–9.
Article CAS PubMed Google Scholar
Malekzadeh A, Leurs C, van Wieringen W, Steenwijk MD, Schoonheim MM, Amann M, et al. Plasma proteome in multiple sclerosis disease progression. Ann Clin Transl Neurol. 2019;6(9):1582–94.
Article CAS PubMed PubMed Central Google Scholar
Mun S, Lee J, Park A, Kim HJ, Lee YJ, Son H, et al. Proteomics approach for the discovery of rheumatoid arthritis biomarkers using mass spectrometry. Int J Mol Sci. 2019;20(18):4368.
Fernandez-Puente P, Mateos J, Fernandez-Costa C, Oreiro N, Fernandez-Lopez C, Ruiz-Romero C, et al. Identification of a panel of novel serum osteoarthritis biomarkers. J Proteome Res. 2011;10(11):5095–101.
Article CAS PubMed Google Scholar
Ritter SY, Subbaiah R, Bebek G, Crish J, Scanzello CR, Krastins B, et al. Proteomic analysis of synovial fluid from the osteoarthritic knee: comparison with transcriptome analyses of joint tissues. Arthritis Rheum. 2013;65(4):981–92.
Article CAS PubMed PubMed Central Google Scholar
Steinberg J, Ritchie GRS, Roumeliotis TI, Jayasuriya RL, Clark MJ, Brooks RA, et al. Integrative epigenomics, transcriptomics and proteomics of patient chondrocytes reveal genes and pathways involved in osteoarthritis. Sci Rep. 2017;7(1):8935.
Article PubMed PubMed Central CAS Google Scholar
Hsueh MF, Khabut A, Kjellstrom S, Onnerfjord P, Kraus VB. Elucidating the molecular composition of cartilage by proteomics. J Proteome Res. 2016;15(2):374–88.
Article CAS PubMed PubMed Central Google Scholar
Folkesson E, Turkiewicz A, Englund M, Onnerfjord P. Differential protein expression in human knee articular cartilage and medial meniscus using two different proteomic methods: a pilot analysis. BMC Musculoskelet Disord. 2018;19(1):416.
Article CAS PubMed PubMed Central Google Scholar
Coggon D, Reading I, Croft P, McLaren M, Barrett D, Cooper C. Knee osteoarthritis and obesity. Int J Obes Relat Metab Disord. 2001;25(5):622–7.
Article CAS PubMed Google Scholar
Johnson VL, Hunter DJ. The epidemiology of osteoarthritis. Best Pract Res Clin Rheumatol. 2014;28(1):5–15.
Article PubMed Google Scholar
Thijssen E, van Caam A, van der Kraan PM. Obesity and osteoarthritis, more than just wear and tear: pivotal roles for inflamed adipose tissue and dyslipidaemia in obesity-induced osteoarthritis. Rheumatology (Oxford). 2015;54(4):588–600.
Article CAS PubMed Google Scholar
Berenbaum F, Wallace IJ, Lieberman DE, Felson DT. Modern-day environmental factors in the pathogenesis of osteoarthritis. Nat Rev Rheumatol. 2018;14(11):674–81.
Article PubMed Google Scholar
Misra D, Fielding RA, Felson DT, Niu J, Brown C, Nevitt M, et al. Risk of knee osteoarthritis with obesity, sarcopenic obesity, and sarcopenia. Arthritis Rheumatol. 2019;71(2):232–7.
Article PubMed PubMed Central Google Scholar
Fontaine-Bisson B, Thorburn J, Gregory A, Zhang H, Sun G. Melanin-concentrating hormone receptor 1 polymorphisms are associated with components of energy balance in the Complex Diseases in the Newfoundland Population: Environment and Genetics (CODING) study. Am J Clin Nutr. 2014;99(2):384–91.
Article CAS PubMed Google Scholar
Werdyani S, Liu M, Zhang H, Sun G, Furey A, Randell EW, et al. Endotypes of primary osteoarthritis identified by plasma metabolomics analysis. Rheumatology (Oxford). 2021;60(6):2735–44.
Article CAS PubMed Google Scholar
Martel-Pelletier J, Tardif G, Rousseau Trepanier J, Abram F, Dorais M, Raynauld JP, et al. The ratio adipsin/MCP-1 is strongly associated with structural changes and CRP/MCP-1 with symptoms in obese knee osteoarthritis subjects: data from the Osteoarthritis Initiative. Osteoarthritis Cartilage. 2019;28(8):1163–73.
Article Google Scholar
Cox J, Hein MY, Luber CA, Paron I, Nagaraj N, Mann M. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol Cell Proteomics. 2014;13(9):2513–26.
Article CAS PubMed PubMed Central Google Scholar
Sheta R, Roux-Dalvai F, Woo CM, Fournier F, Bourassa S, Bertozzi CR, et al. Proteomic dataset for altered glycoprotein expression upon GALNT3 knockdown in ovarian cancer cells. Data Brief. 2016;8:342–9.
Article PubMed PubMed Central Google Scholar
Rappsilber J, Mann M, Ishihama Y. Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips. Nat Protoc. 2007;2(8):1896–906.
Article CAS PubMed Google Scholar
Geyer PE, Kulak NA, Pichler G, Holdt LM, Teupser D, Mann M. Plasma proteome profiling to assess human health and disease. Cell Syst. 2016;2(3):185–95.
Article CAS PubMed Google Scholar
Yang F, Shen Y, Camp DG 2nd, Smith RD. High-pH reversed-phase chromatography with fraction concatenation for 2D proteomic analysis. Expert Rev Proteomics. 2012;9(2):129–34.
Article CAS PubMed PubMed Central Google Scholar
Sheta R, Woo CM, Roux-Dalvai F, Fournier F, Bourassa S, Droit A, et al. A metabolic labeling approach for glycoproteomic analysis reveals altered glycoprotein expression upon GALNT3 knockdown in ovarian cancer cells. J Proteomics. 2016;145:91–102.
Article CAS PubMed PubMed Central Google Scholar
Adamczyk L, Adkins JK, Agakishiev G, Aggarwal MM, Ahammed Z, Alekseev I, et al. Beam-energy dependence of the directed flow of protons, antiprotons, and pions in Au+Au collisions. Phys Rev Lett. 2014;112(16):162301.
Article CAS PubMed Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2017. https://www.R-project.org/.
Lazar C, Gatto L, Ferro M, Bruley C, Burger T. Accounting for the multiple natures of missing values in label-free quantitative proteomics data sets to compare imputation strategies. J Proteome Res. 2016;15(4):1116–25.
Article CAS PubMed Google Scholar
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47.
Article PubMed PubMed Central CAS Google Scholar
Rohart F, Gautier B, Singh A, Cao K-AL. mixOmics: An R package for ‘omics feature selection and multiple data integration. PLoS Comput Biol. 2017;13(11):e1005752.
Article PubMed PubMed Central CAS Google Scholar
Jolliffe IT, Cadima J. Principal component analysis: a review and recent developments. Philos Trans Ser A Math Phys Eng Sci. 2016;374(2065):20150202.
Google Scholar
Le Cao KA, Boitard S, Besse P. Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems. BMC Bioinformatics. 2011;12:253.
Article PubMed PubMed Central Google Scholar
Steck E, Braun J, Pelttari K, Kadel S, Kalbacher H, Richter W. Chondrocyte secreted CRTAC1: a glycosylated extracellular matrix molecule of human articular cartilage. Matrix Biol. 2007;26(1):30–41.
Article CAS PubMed Google Scholar
Ijiri K, Zerbini LF, Peng H, Otu HH, Tsuchimochi K, Otero M, et al. Differential expression of GADD45beta in normal and osteoarthritic cartilage: potential role in homeostasis of articular chondrocytes. Arthritis Rheum. 2008;58(7):2075–87.
Article CAS PubMed PubMed Central Google Scholar
Aigner T, Fundel K, Saas J, Gebhard PM, Haag J, Weiss T, et al. Large-scale gene expression profiling reveals major pathogenetic pathways of cartilage degeneration in osteoarthritis. Arthritis Rheum. 2006;54(11):3533–44.
Article CAS PubMed Google Scholar
Styrkarsdottir U, Lund SH, Saevarsdottir S, Magnusson MI, Gunnarsdottir K, Norddahl GL, et al. The CRTAC1 protein in plasma associates with osteoarthritis and predicts progression to joint replacements: a large-scale proteomics scan in Iceland. Arthritis Rheumatol. 2021;73(11):2025–34.
Balakrishnan L, Nirujogi RS, Ahmad S, Bhattacharjee M, Manda SS, Renuse S, et al. Proteomic analysis of human osteoarthritis synovial fluid. Clin Proteomics. 2014;11(1):6.
Article PubMed PubMed Central CAS Google Scholar
Thomson J, Singh M, Eckersley A, Cain SA, Sherratt MJ, Baldock C. Fibrillin microfibrils and elastic fibre proteins: functional interactions and extracellular regulation of growth factors. Semin Cell Dev Biol. 2019;89:109–17.
Article CAS PubMed PubMed Central Google Scholar
Chaudhry SS, Cain SA, Morgan A, Dallas SL, Shuttleworth CA, Kielty CM. Fibrillin-1 regulates the bioavailability of TGFbeta1. J Cell Biol. 2007;176(3):355–67.
Article CAS PubMed PubMed Central Google Scholar
Nistala H, Lee-Arteaga S, Smaldone S, Siciliano G, Carta L, Ono RN, et al. Fibrillin-1 and -2 differentially modulate endogenous TGF-beta and BMP bioavailability during bone formation. J Cell Biol. 2010;190(6):1107–21.
Article CAS PubMed PubMed Central Google Scholar
Sengle G, Tsutsui K, Keene DR, Tufa SF, Carlson EJ, Charbonneau NL, et al. Microenvironmental regulation by fibrillin-1. PLoS Genet. 2012;8(1):e1002425.
Article CAS PubMed PubMed Central Google Scholar
Lee B, Godfrey M, Vitale E, Hori H, Mattei MG, Sarfarazi M, et al. Linkage of Marfan syndrome and a phenotypically related disorder to two different fibrillin genes. Nature. 1991;352(6333):330–4.
Article CAS PubMed Google Scholar
Ramirez F, Pereira L, Zhang H, Lee B. The fibrillin-Marfan syndrome connection. BioEssays. 1993;15(9):589–94.
Article CAS PubMed Google Scholar
Tan FK, Arnett FC, Antohi S, Saito S, Mirarchi A, Spiera H, et al. Autoantibodies to the extracellular matrix microfibrillar protein, fibrillin-1, in patients with scleroderma and other connective tissue diseases. J Immunol. 1999;163(2):1066–72.
CAS PubMed Google Scholar
Villano M, Borghini A, Manetti M, Gabbrielli E, Rossi A, Sestini P, et al. Systemic sclerosis sera affect fibrillin-1 deposition by dermal blood microvascular endothelial cells: therapeutic implications of cyclophosphamide. Arthritis Res Ther. 2013;15(4):R90.
Article PubMed PubMed Central Google Scholar
Bray C, Bell LN, Liang H, Haykal R, Kaiksow F, Mazza JJ, et al. Erythrocyte sedimentation rate and C-reactive protein measurements and their relevance in clinical medicine. WMJ. 2016;115(6):317–21.
PubMed Google Scholar
Soeki T, Sata M. Inflammatory biomarkers and atherosclerosis. Int Heart J. 2016;57(2):134–9.
Article CAS PubMed Google Scholar
Harrington MG, Fonteh AN, Biringer RG, AF RH, Cowan RP. Prostaglandin D synthase isoforms from cerebrospinal fluid vary with brain pathology. Dis Markers. 2006;22(1-2):73–81.
Article CAS PubMed Google Scholar
Cheung CL, Cheung TT, Lam KS, Cheung BM. Reduced serum beta-trace protein is associated with metabolic syndrome. Atherosclerosis. 2013;227(2):404–7.
Article CAS PubMed Google Scholar
White CA, Ghazan-Shahi S, Adams MA. beta-Trace protein: a marker of GFR and other biological pathways. Am J Kidney Dis. 2015;65(1):131–46.
Article CAS PubMed Google Scholar
Alves MR, Do Amaral NS, Marchi FA, Silva FIB, Da Costa A, Carvalho KC, et al. Prostaglandin D2 expression is prognostic in highgrade serous ovarian cancer. Oncol Rep. 2019;41(4):2254–64.
CAS PubMed Google Scholar
Choi DJ, An J, Jou I, Park SM, Joe EH. A Parkinson's disease gene, DJ-1, regulates anti-inflammatory roles of astrocytes through prostaglandin D2 synthase expression. Neurobiol Dis. 2019;127:482–91.
Article CAS PubMed Google Scholar
Bonakdari H, Jamshidi A, Pelletier JP, Abram F, Tardif G, Martel-Pelletier J. A warning machine learning algorithm for early knee osteoarthritis structural progressor patient screening. Ther Adv Musculoskel Dis. 2021;13:1–16.
Article Google Scholar
Haapasalo K, Meri S. Regulation of the complement system by pentraxins. Front Immunol. 2019;10:1750.
Article CAS PubMed PubMed Central Google Scholar
Delanghe JR, Speeckaert R, Speeckaert MM. Behind the scenes of vitamin D binding protein: more than vitamin D binding. Best Pract Res Clin Endocrinol Metab. 2015;29(5):773–86.
Article CAS PubMed Google Scholar
Brennan-Speranza TC, Mor D, Mason RS, Bartlett JR, Duque G, Levinger I, et al. Skeletal muscle vitamin D in patients with end stage osteoarthritis of the knee. J Steroid Biochem Mol Biol. 2017;173:180–4.
Article CAS PubMed Google Scholar
Lu J, Kishore U. C1 complex: an adaptable proteolytic module for complement and non-complement functions. Front Immunol. 2017;8:592.
Article PubMed PubMed Central CAS Google Scholar
Struglics A, Okroj M, Sward P, Frobell R, Saxne T, Lohmander LS, et al. The complement system is activated in synovial fluid from subjects with knee injury and from patients with osteoarthritis. Arthritis Res Ther. 2016;18(1):223.
Article PubMed PubMed Central CAS Google Scholar
Wang Q, Rozelle AL, Lepus CM, Scanzello CR, Song JJ, Larsen DM, et al. Identification of a central role for complement in osteoarthritis. Nat Med. 2011;17(12):1674–9.
Article CAS PubMed PubMed Central Google Scholar
Nakamura DS, Hollander JM, Uchimura T, Nielsen HC, Zeng L. Pigment Epithelium-Derived Factor (PEDF) mediates cartilage matrix loss in an age-dependent manner under inflammatory conditions. BMC Musculoskelet Disord. 2017;18(1):39.
Article PubMed PubMed Central CAS Google Scholar
Pfander D, Grimmer C, Aigner T, Swoboda B, Schmidt R, Cramer T. Pigment epithelium derived factor--the product of the EPC-1 gene--is expressed by articular chondrocytes and up regulated in osteoarthritis. Ann Rheum Dis. 2006;65(7):965–7.
Article CAS PubMed PubMed Central Google Scholar
Klinger P, Beyer C, Ekici AB, Carl HD, Schett G, Swoboda B, et al. The transient chondrocyte phenotype in human osteophytic cartilage: a role of pigment epithelium-derived factor? Cartilage. 2013;4(3):249–55.
Article PubMed PubMed Central Google Scholar
Becker J, Semler O, Gilissen C, Li Y, Bolz HJ, Giunta C, et al. Exome sequencing identifies truncating mutations in human SERPINF1 in autosomal-recessive osteogenesis imperfecta. Am J Hum Genet. 2011;88(3):362–71.
Article CAS PubMed PubMed Central Google Scholar
Homan EP, Rauch F, Grafe I, Lietman C, Doll JA, Dawson B, et al. Mutations in SERPINF1 cause osteogenesis imperfecta type VI. J Bone Miner Res. 2011;26(12):2798–803.
Article CAS PubMed Google Scholar
Zhuo L, Hascall VC, Kimata K. Inter-alpha-trypsin inhibitor, a covalent protein-glycosaminoglycan-protein complex. J Biol Chem. 2004;279(37):38079–82.
Article CAS PubMed Google Scholar
Kawaguchi H, Matsumoto I, Osada A, Kurata I, Ebe H, Tanaka Y, et al. Identification of novel biomarker as citrullinated inter-alpha-trypsin inhibitor heavy chain 4, specifically increased in sera with experimental and rheumatoid arthritis. Arthritis Res Ther. 2018;20(1):66.
Article PubMed PubMed Central CAS Google Scholar
Pagani S, Bellan M, Mauro D, Castello LM, Avanzi GC, Lewis MJ, et al. New insights into the role of Tyro3, Axl, and Mer receptors in rheumatoid arthritis. Dis Markers. 2020;2020:1614627.
Article PubMed PubMed Central CAS Google Scholar
Recarte-Pelz P, Tassies D, Espinosa G, Hurtado B, Sala N, Cervera R, et al. Vitamin K-dependent proteins GAS6 and Protein S and TAM receptors in patients of systemic lupus erythematosus: correlation with common genetic variants and disease activity. Arthritis Res Ther. 2013;15(2):R41.
Article CAS PubMed PubMed Central Google Scholar
Sun W, Wang X, Zou X, Song R, Du X, Hu J, et al. Selenoprotein P gene r25191g/a polymorphism and quantification of selenoprotein P mRNA level in patients with Kashin-Beck disease. Br J Nutr. 2010;104(9):1283–7.
Article CAS PubMed Google Scholar
Lee JH, Jung JH, Kim J, Baek WK, Rhee J, Kim TH, et al. Proteomic analysis of human synovial fluid reveals potential diagnostic biomarkers for ankylosing spondylitis. Clin Proteomics. 2020;17:20.
Article CAS PubMed PubMed Central Google Scholar
Sanchez-Enriquez S, Torres-Carrillo NM, Vazquez-Del Mercado M, Salgado-Goytia L, Rangel-Villalobos H, Munoz-Valle JF. Increase levels of apo-A1 and apo B are associated in knee osteoarthritis: lack of association with VEGF-460 T/C and +405 C/G polymorphisms. Rheumatol Int. 2008;29(1):63–8.
Article CAS PubMed Google Scholar
Oliviero F, Sfriso P, Baldo G, Dayer JM, Giunco S, Scanu A, et al. Apolipoprotein A-I and cholesterol in synovial fluid of patients with rheumatoid arthritis, psoriatic arthritis and osteoarthritis. Clin Exp Rheumatol. 2009;27(1):79–83.
CAS PubMed Google Scholar
Lu M, Lu Q, Zhang Y, Tian G. ApoB/apoA1 is an effective predictor of coronary heart disease risk in overweight and obesity. J Biomed Res. 2011;25(4):266–73.
Article CAS PubMed PubMed Central Google Scholar
Ruan X, Li Z, Zhang Y, Yang L, Pan Y, Wang Z, et al. Apolipoprotein A-I possesses an anti-obesity effect associated with increase of energy expenditure and up-regulation of UCP1 in brown fat. J Cell Mol Med. 2011;15(4):763–72.
Article CAS PubMed Google Scholar
de Seny D, Cobraiville G, Charlier E, Neuville S, Lutteri L, Le Goff C, et al. Apolipoprotein-A1 as a damage-associated molecular patterns protein in osteoarthritis: ex vivo and in vitro pro-inflammatory properties. PLoS One. 2015;10(4):e0122904.
Article PubMed PubMed Central CAS Google Scholar
Yanagisawa A, Ueda M, Sueyoshi T, Nakamura E, Tasaki M, Suenaga G, et al. Knee osteoarthritis associated with different kinds of amyloid deposits and the impact of aging on type of amyloid. Amyloid. 2016;23(1):26–32.
Article CAS PubMed Google Scholar
Ghosh P, Cheras PA. Vascular mechanisms in osteoarthritis. Best Pract Res Clin Rheumatol. 2001;15(5):693–709.
Article CAS PubMed Google Scholar
So AK, Varisco PA, Kemkes-Matthes B, Herkenne-Morard C, Chobaz-Peclat V, Gerster JC, et al. Arthritis is linked to local and systemic activation of coagulation and fibrinolysis pathways. J Thromb Haemost. 2003;1(12):2510–5.
Article CAS PubMed Google Scholar
Kaye SM, Pietilainen KH, Kotronen A, Joutsi-Korhonen L, Kaprio J, Yki-Jarvinen H, et al. Obesity-related derangements of coagulation and fibrinolysis: a study of obesity-discordant monozygotic twin pairs. Obesity. 2012;20(1):88–94.
Article CAS PubMed Google Scholar
Blokhin IO, Lentz SR. Mechanisms of thrombosis in obesity. Curr Opin Hematol. 2013;20(5):437–44.
Article CAS PubMed PubMed Central Google Scholar
Samad F, Ruf W. Inflammation, obesity, and thrombosis. Blood. 2013;122(20):3415–22.
Article CAS PubMed PubMed Central Google Scholar
Vilahur G, Ben-Aicha S, Badimon L. New insights into the role of adipose tissue in thrombosis. Cardiovasc Res. 2017;113(9):1046–54.
Article CAS PubMed Google Scholar
Boyan BD, Hart DA, Enoka RM, Nicolella DP, Resnick E, Berkley KJ, et al. Hormonal modulation of connective tissue homeostasis and sex differences in risk for osteoarthritis of the knee. Biol Sex Differ. 2013;4(1):3.
Article PubMed PubMed Central Google Scholar
Boyan BD, Tosi LL, Coutts RD, Enoka RM, Hart DA, Nicolella DP, et al. Addressing the gaps: sex differences in osteoarthritis of the knee. Biol Sex Differ. 2013;4(1):4.
Article PubMed PubMed Central Google Scholar
Pan Q, O'Connor MI, Coutts RD, Hyzy SL, Olivares-Navarrete R, Schwartz Z, et al. Characterization of osteoarthritic human knees indicates potential sex differences. Biol Sex Differ. 2016;7:27.
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank the OAI participants and OAI Coordinating Center for their work in generating the clinical data of the OAI cohorts and for making them publicly available. Mass spectrometry experiments were performed by the proteomics platform of the CHU de Québec Research Center, Québec, Canada. We also thank Santa Fiori for her assistance with the manuscript preparation.

Funding

This study was supported in part by grants from the Chair in Osteoarthritis of the University of Montreal, the Osteoarthritis Research Unit of the University of Montreal Hospital Research Centre, and The Arthritis Society. No funding bodies had any role in the study design; collection, analysis, and interpretation of data; writing of the manuscript; and decision to publish the manuscript.

Author information

Authors and Affiliations

Osteoarthritis Research Unit, University of Montreal Hospital Research Centre (CRCHUM), 900 Saint-Denis, Suite R11.412B, Montreal, QC, H2X 0A9, Canada
Ginette Tardif, Frédéric Paré, Hassan Fahmi, Jean-Pierre Pelletier & Johanne Martel-Pelletier
CHU de Québec Research Center, Laval University, Quebec, QC, G1V 4G2, Canada
Clarisse Gotti, Florence Roux-Dalvai & Arnaud Droit
Division of Biomedical Sciences (Genetics), Memorial University of Newfoundland, St. John’s, NL, A1B 3V6, Canada
Guangju Zhai
Discipline of Medicine, Memorial University of Newfoundland, St. John’s, NL, A1B 3V6, Canada
Guang Sun

Authors

Ginette Tardif
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Paré
View author publications
You can also search for this author in PubMed Google Scholar
Clarisse Gotti
View author publications
You can also search for this author in PubMed Google Scholar
Florence Roux-Dalvai
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Droit
View author publications
You can also search for this author in PubMed Google Scholar
Guangju Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Guang Sun
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Fahmi
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Pierre Pelletier
View author publications
You can also search for this author in PubMed Google Scholar
Johanne Martel-Pelletier
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JMP and JPP designed the study. AD, CG, FP, FRD, GZ, GS, GT, HF, and JMP acquired the data and analyzed the data. CG, FP, FRD, GT, JMP, and JPP interpreted the data. GT, FRD, and JMP were involved in drafting the first version of the article. All authors were involved in revising it critically for intellectual content. All authors approved the final version.

Corresponding author

Correspondence to Johanne Martel-Pelletier.

Ethics declarations

Ethics approval and consent to participate

The OAI is a public-private partnership comprised of five contracts (N01-AR-2-2258; N01-AR-2-2259; N01-AR-2-2260; N01- AR-2-2261; N01-AR-2-2262) funded by the National Institutes of Health, a branch of the Department of Health and Human Services, in four clinical sites (University of Maryland School of Medicine and Johns Hopkins University, Baltimore, MD; Ohio State University, Columbus, OH; University of Pittsburgh, PA; Memorial Hospital of Rhode Island, Pawtucket, RI) and conducted by the OAI study investigators. Private funding partners include Merck Research Laboratories, Novartis Pharmaceuticals Corporation, GlaxoSmithKline, and Pfizer Inc. Private sector funding for the OAI is managed by the Foundation for the National Institutes of Health, USA. All OAI participants provided written informed consent for participation in the OAI. Ethics approval was obtained by each OAI clinical site (University of Maryland Baltimore -Institutional Review Board, Ohio State University’s Biomedical Sciences Institutional Review Board, University of Pittsburgh Institutional Review Board, and Memorial Hospital of Rhode Island Institutional Review Board) and the OAI coordinating center (Committee on Human Research at the University of California, San Francisco, CA, USA [#10-00532]). For the NFOAS cohort, the ethics approval was from the Health Research Ethics Board of Newfoundland and Labrador [HREB #2011.311]. The Institutional Ethics Committee Board of the University of Montreal Hospital Research Centre [#BD04.001] approved the use of the human serum.

Consent for publication

All authors had full access to the data and take responsibility for the integrity of the data and the accuracy of the data analysis.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1

. Mass Spectrometry Protein Identification. Table S2. Pairwise differential expression analysis.

Additional file 2: Figure S1

. Protein validation in plasma.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Tardif, G., Paré, F., Gotti, C. et al. Mass spectrometry-based proteomics identify novel serum osteoarthritis biomarkers. Arthritis Res Ther 24, 120 (2022). https://doi.org/10.1186/s13075-022-02801-1

Download citation

Received: 20 September 2021
Accepted: 08 May 2022
Published: 23 May 2022
DOI: https://doi.org/10.1186/s13075-022-02801-1

Mass spectrometry-based proteomics identify novel serum osteoarthritis biomarkers

Abstract

Background

Methods

Results

Conclusion

Introduction

Material and methods

Study participants

Serum/plasma samples

Mass spectrometry

Preparation of serum samples

High-abundance protein depletion for building a matching library

Liquid chromatography (LC)-MSMS analysis

Database searching and label-free quantification

Protein assays

Data treatment and statistical analysis

Results

Subject characteristics

Quantitative proteomic analysis

Principal component analysis (PCA)

Pairwise differential expression analysis

Sparse partial least squares regression discriminant analysis (sPLS-DA)

Protein validation

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1: Table S1

Additional file 2: Figure S1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Arthritis Research & Therapy

Contact us