- Research article
- Open Access
The spectrum of association in HLA region with rheumatoid arthritis in a diverse Asian population: evidence from the MyEIRA case-control study
Arthritis Research & Therapy volume 23, Article number: 46 (2021)
Fine-mapping of human leukocyte antigen (HLA) region for rheumatoid arthritis (RA) risk factors has identified several HLA alleles and its corresponding amino acid residues as independent signals (i.e., HLA-A, HLA-B, HLA-DPB1, and HLA-DQA1 genes), in addition to the well-established genetic factor in HLA-DRB1 gene. However, this was mainly performed in the Caucasian and East Asian populations, and data from different Asian regions is less represented. We aimed to evaluate whether there are independent RA risk variants in both anti-citrullinated protein antibody (ACPA)-positive and ACPA-negative RA patients from the multi-ethnic Malaysian population, using the fine-mapping of HLA region strategy.
We imputed the classical HLA alleles, amino acids, and haplotypes using the Immunochip genotyping data of 1260 RA cases (i.e., 530 Malays, 259 Chinese, 412 Indians, and 59 mixed ethnicities) and 1571 controls (i.e., 981 Malays, 205 Chinese, 297 Indians, and 87 mixed ethnicities) from the Malaysian Epidemiological Investigation of Rheumatoid Arthritis (MyEIRA) population-based case-control study. Stepwise logistic regression was performed to identify the independent genetic risk factors for RA within the HLA region.
We confirmed that the HLA-DRB1 amino acid at position 11 with valine residue conferred the strongest risk effect for ACPA-positive RA (OR = 4.26, 95% CI = 3.30–5.49, PGWAS = 7.22 × 10−29) in the Malays. Our study also revealed that HLA-DRB1 amino acid at position 96 with histidine residue was negatively associated with the risk of developing ACPA-positive RA in the Indians (OR = 0.48, 95% CI = 0.37–0.62, PGWAS = 2.58 × 10−08). Interestingly, we observed that HLA-DQB1*03:02 allele was inversely related to the risk of developing ACPA-positive RA in the Malays (OR = 0.17, 95% CI = 0.09–0.30, PGWAS = 1.60 × 10−09). No association was observed between the HLA variants and risk of developing ACPA-negative RA in any of the three major ethnic groups in Malaysia.
Our results demonstrate that the RA-associated genetic factors in the multi-ethnic Malaysian population are similar to those in the Caucasian population, despite significant differences in the genetic architecture of HLA region across populations. A novel and distinct independent association between the HLA-DQB1*03:02 allele and ACPA-positive RA was observed in the Malays. In common with the Caucasian population, there is little risk from HLA region for ACPA-negative RA.
Extensive genetic studies during the last 40 years have demonstrated substantial contribution from the human leukocyte antigen DR beta chain 1 shared epitope (HLA-DRB1 SE) alleles in RA pathogenesis, specifically for the subtype of RA that is positive for anti-citrullinated protein antibody (ACPA) [1,2,3,4,5,6,7,8,9,10,11]. However, there are differences in the allelic frequency of certain HLA-DRB1 SE alleles across different populations. For instance, HLA-DRB1*04:01 and HLA-DRB1*04:04 alleles are common in RA patients with Caucasian ancestry, while HLA-DRB1*04:05 allele is common in the Asian population [3, 6, 7, 9, 12,13,14,15]. It is also evident that the overall frequency of HLA-DRB1 SE alleles in Asian populations is lower than in Caucasian populations, despite the similar RA prevalence between these populations. The reported population-specific risk allele HLA-DRB1*09:01 in the Japanese and Korean populations suggests genetic factors other than HLA-DRB1 SE are associated with risk of RA development [3, 16].
For the past decade, the cost-effective computational approach to infer HLA alleles using single nucleotide polymorphisms (SNP) genotypes within the HLA region has become a preferable method to study the HLA region in large-scale genetic association studies. This approach has also enabled the integration of functional data from large genomic data to understand the pathogenesis of RA [17, 18]. For instance, the polymorphic amino acid residues at position 11 (e.g., valine, HLA-DRB1 Val11 or leucine, HLA-DRB1 Leu11) within HLA-DRB1 protein explained most of the genetic risk of developing ACPA-positive RA [17, 19,20,21], instead of the amino acid residues previously defined at positions 71 and 74, the conserved amino acid region of the SE alleles . In addition, independent association signals between single amino acid position within other HLA proteins and risk of developing ACPA-positive RA demonstrated the importance of the HLA region in the pathogenesis of RA. For example, HLA-A amino acid residue at position 77 with asparagine residue (i.e., HLA-A Asn77), HLA-B amino acid residue at position 9 with aspartic acid (i.e., HLA-B Asp9), and HLA-DPB1 amino acid residue at position 9 with phenylalanine residue (i.e., HLA-DPB1 Phe9) are associated with risk of developing ACPA-positive RA [17, 21]. Furthermore, these predisposing HLA amino acid variants are located within the HLA molecules’ peptide-binding grooves, suggesting the role of antigen binding and involvement in antigen presentation in the adaptive immune response. A recent study of a Han Chinese population reported that the aspartic acid at position 160 within the HLA-DQA1 protein (i.e., HLA-DQA1 Asp160) was associated with an increased risk of developing ACPA-positive RA, instead of the well-described HLA-DRB1 alleles . Comparative modeling analysis showed that the additional negative charge of HLA-DQA1 Asp160 enhances the interaction between the dimers of the major histocompatibility complex (MHC) class II molecules, which may lead to an increase in T cell activation .
These findings were mainly reported in the Caucasian, African, and East Asian populations, and there is very limited information about RA-associated polymorphic HLA amino acid residues in Southeast Asian populations. There is a need to expand the field of study to multiple genetically dissimilar populations to investigate the implication of HLA risk factors in the disease pathogenesis.
Thus, we fine-mapped the HLA region in the ACPA-positive and ACPA-negative RA subsets from the multi-ethnic Malaysian population . We further investigated the association between different HLA-DRB1 amino acid variants and compared the HLA-DRB1 amino acid haplotypes in different ethnic groups for risk of developing different subsets of RA.
Study design and study population
This study utilized data from the Malaysian Epidemiological Investigation of Rheumatoid Arthritis (MyEIRA), a large population-based case-control study of RA conducted in the multi-ethnic Malaysian population. The study design of MyEIRA has been described elsewhere [9, 24]. Briefly, this study analyzed data from 1260 patients with early RA (i.e., 530 Malays, 259 Chinese, 412 Indians, and 59 mixed ethnicities), and 1571 matched controls (i.e., 981 Malays, 206 Chinese, 297 Indians, and 87 with mixed ethnicities).
The RA cases were identified from nine rheumatology clinics throughout Peninsular Malaysia between 2005 and 2009. All RA cases were diagnosed according to the 1987 revised American College of Rheumatology (ACR) classification of rheumatoid arthritis criteria by rheumatologists. For each RA case, a control was randomly selected from the general population, matched for age, sex, and residential area. All study subjects were unrelated and ethnicity background was self-reported, based on questions about ancestry.
Anti-citrullinated protein antibody measurement
The presence of ACPA in all individuals was assessed using anti-cyclic citrullinated peptide second-generation (anti-CCP2) ELISA kits (Immunoscan RA, Malmö, Sweden). Samples with results > 25 AU/mL were defined as ACPA-positive .
The experimental classical HLA genotyping for HLA-A, HLA-B, HLA-C, HLA-DRB1, and HLA-DQB1 genes was performed previously and described elsewhere [9, 25]. In brief, the HLA genotyping was performed for all DNA samples using the polymerase chain reaction and sequence-specific oligonucleotide probe hybridization (PCR-SSO) method (LABType® HLA test kits, One Lambda Inc., CA, USA) on the Luminex Multi-Analyte Profiling System (xMAP, Luminex Corporation, TX, USA). The HLA typing assignment was accomplished using the HLA Fusion software (version 1.3.0) provided by the manufacturer (One Lambda Inc., CA, USA).
Dense SNP genotyping and quality controls
All individuals were genotyped using the Illumina iSelect HD custom genotyping array designed by the Immunochip Consortium (Immunochip, Illumina, Inc., San Diego, CA, USA). The Immunochip array was custom-designed with a dense coverage of HLA region to perform deep replication of major autoimmune and inflammatory diseases, including RA [23, 26]. The genotyping quality control (QC) was performed using PLINK v1.07 software . The SNPs with call rate less than 99%, minor allele frequency (MAF) less than 0.01, and with significant departure from Hardy-Weinberg equilibrium (HWE) (p< 0.001), in both the RA cases and control groups, were excluded. Individuals with missing genotyping rate higher than 10% were also excluded. Then, a total of 25 individuals from the RA group (i.e., redundant RA and non-RA) were removed, followed by removal of a further 11 individuals (i.e., 6 RA cases and 5 matched normal controls) with missing genotyping rate > 10%, from the subsequent data analysis. A total of 113,576 SNPs in 2795 individuals (i.e., 1229 RA cases and 1566 controls) remained after QC. The individuals with mixed ethnicity parentage background were excluded from further analysis. Thus, the association testing was restricted to study subjects whose parents both came from the same ethnic group, giving a total of 1170 RA and 1479 controls for analysis after QC. The baseline demographic characteristics of the RA cases and controls are shown in Table 1.
Imputation of classical HLA alleles and polymorphic amino acids residues
A total of 6152 SNPs between positions 29 and 34 Mb in the HLA region on chromosome 6 (GRCh37) were extracted from the post-QC Immunochip dataset. Using the extracted SNP genotypes from the HLA region, we imputed the HLA variants (i.e., classical 2-digit and 4-digit HLA alleles, and polymorphic amino acid residues of the HLA genes), along with the SNPs from the Pan-Asian reference panel [18, 19]. The Pan-Asian reference panel comprised 530 unrelated individuals of Asian descent: i.e., Han Chinese (n = 247, 46.6%), Malays (n = 120, 22.6%), Tamil Indians (n = 119, 22.4%), and Japanese (n = 44, 8.3%). The reference panel included a total of 6173 SNPs associated to 94 classical 2-digit HLA alleles, 179 classical 4-digit HLA alleles, and 1799 polymorphic amino acid positions [19, 28]. All the RA cases and controls were imputed together using the SNP2HLA software .
HLA allele imputation accuracy assessment
We assessed the imputation accuracy for each imputed classical HLA allele in HLA-A, HLA-B, HLA-C, HLA-DRB1, and HLA-DQB1 genes using experimental and imputed classical HLA genotype datasets from the normal controls with Malays, Chinese, and Indians. In brief, concordance rate was defined as the count of matched imputed classical HLA allele to the experimental classical HLA allele at the individual level, divided by the total count of observed experimental classical HLA allele within the studied population. Imputation accuracy assessment only considers individuals with available data for both experimental and imputed HLA genotypes. The HLA alleles’ distributions and their allelic frequencies vary in different populations/ethnic groups, so we further assessed the imputation accuracy in the three ethnic groups, i.e., Malays, Chinese, and Indians. Imputation accuracy with concordance rate above 90% was considered as high imputation accuracy threshold in this study.
Association analysis of HLA alleles and amino acid polymorphisms
Referring to the data analysis described elsewhere, the logistic regression model was applied to test for the association between the imputed HLA variants and risk of developing different subsets of RA, separately in the Malay, Chinese, and Indian ethnic groups, with adjustment for age and sex [17, 19, 29]. The imputed HLA variants were defined by including the biallelic SNPs, classical 2-digit HLA alleles, classical 4-digit HLA alleles, and polymorphic HLA amino acid residues [17, 19, 29]. The analyses were conducted in PLINK v1.07 software . The significance threshold of p value (PGWAS) was less than 5 × 10−8 in this study.
We implemented a stepwise logistic regression conditioned by the most associated variants, to search for the independent effects across the HLA region. All variables (i.e., imputed HLA variants) were systematically removed/added to obtain the best fit model based on the PGWAS threshold. The Akaike information criterion (ΔAIC) and the improvement in the Bayesian information criterions (ΔBIC) were also considered to assess the best fit model. A modified version of a public Python 3.0 script (http://trevor-smith.github.io/stepwise-post/), which uses the Statsmodels module , was used in this analysis.
HLA amino acid haplotype analysis
A group of RA-related classical HLA-DRB1 alleles encoding a conserved amino acid sequence (70QRRAA74 or 70KRRAA74 or 70RRRAA74) at positions 70 to 74 in the third hypervariable region of the first domain of DRB1 was defined as shared epitope (SE) . The HLA-DRB1 SE alleles are the most established genetic risk factors for RA [2, 9, 31]. Nevertheless, the recent studies demonstrated that polymorphic HLA-DRB1 amino acid residues at positions 11 and 13 were the top association signals for risk of RA, instead of positions 70–74 [17, 19]. Hence, we aimed to replicate the investigation of HLA-DRB1 amino acid haplotypes and risk for ACPA-positive RA in the Caucasian and East Asian populations [17, 19], for the Malays, Chinese, and Indians.
We constructed the haplotypes manually based on the RA risk HLA-DRB1 haplotype model (i.e., defined by the polymorphic amino acid residues at positions 11, 13, 71, and 74), by filtering the subsets of HLA-DRB1 11-13-71-74 haplotypes in PLINK v1.07 software. First, we assessed the association between these HLA-DRB1 amino acid haplotypes and risk for ACPA-positive RA in all three ethnic groups. Then, we observed the risk effect (expressed as odds ratio, OR) between the published data and findings from the Malay, Chinese, and Indian ethnic groups.
Meta-analysis and comparative analysis with published data
To test the generalizability of the polymorphic amino acid residues at position 11 within the HLA-DRB1 protein and risk of developing ACPA-positive RA in the Malay, Chinese, and Indian ethnic groups, we performed a meta-analysis using the Mantel-Haenszel method, with the random-effect model by means of cumulative OR with 95% confidence interval (95% CI). The heterogeneity between the studied ethnic groups was assessed using the Cochran Q-statistic (P < 0.10 considered significant). In addition, the I2 metric [I2 = (Q − df)/Q] was used to describe the percentage of variation across the different ethnic groups due to heterogeneity. I2 values of 25%, 50%, and 75% were considered as low, moderate, and high estimates, respectively. All analyses were performed in the PLINK v1.07 and Review Manager v5.3 (Copenhagen, The Nordic Cochrane Centre, The Cochrane Collaboration, 2014) software.
We compared the findings from this study with the published RA-associated genetic variants within the HLA region from different populations/ethnic groups (i.e., Caucasian, East Asian, African, and Han Chinese) to investigate the spectrum of association in the HLA region with risk of developing RA [17, 19,20,21,22]. Here, we restricted the selection of published RA-associated HLA variants to those computationally imputed from dense SNP genotypes within the HLA region.
Imputed HLA variants and imputation accuracy assessment
We imputed a total of 3239 markers comprising 90 classical 2-digit HLA alleles, 175 classical 4-digit HLA alleles, 1799 specific HLA amino acid positions, and 1175 SNPs from the Pan-Asian reference panel. Our data demonstrated the overall concordance rate of the classical 2-digit HLA alleles satisfied the suggested concordance rate threshold of 90% for all five HLA genes, while decreased overall rates (ranged between 71.5 and 85.7%) were observed at 4-digit resolution (supplementary Table 1). Notably, the decreased overall concordance rates were attributed to the increased polymorphisms detected in these HLA genes. We further observed the concordance rates varied among the imputed classical HLA alleles at 2-digit and 4-digit resolutions for all HLA genes, where the variations were influenced by the distribution of the common/rare HLA alleles and its allelic frequency varies across different ethnic groups (supplementary Tables 2 and 3).
HLA-DRB1 variants associated with risk of developing RA
The logistic regression analysis demonstrated that the genome-wide significant threshold (PGWAS) for association analysis was satisfied by 15 classical HLA class II alleles, which were located in HLA-DRB1 (n = 6), HLA-DQA1 (n = 4), and HLA-DQB1 (n = 5) genes; 74 amino acid polymorphisms (47.3% in the HLA-DRB1 protein, 28.4% in the HLA-DQB1 protein, 20.3% in the HLA-DQA1 protein, and only 4.1% in the HLA Class I protein); and 128 SNP variants (supplementary Table 4). However, the distribution of these identified significant risk variants varied across the Malay, Chinese, and Indian ethnic groups, with the majority observed in the Malays.
In the Malay ethnic group, the most significant association across all variants tested was observed at HLA-DRB1 Val11 (OR = 4.26, 95% CI = 3.30–5.49, PGWAS = 7.22 × 10−29), followed by HLA-DRB1 amino acid at position 120 with asparagine residue (i.e., HLA-DRB1 Asn120) (OR = 4.23, 95% CI = 3.28–5.45, PGWAS = 8.59 × 10−29), which is in tight linkage disequilibrium (LD, D’ = 1.00) with HLA-DRB1 Val11 (Table 2 and supplementary Table 5). Further peptide alignment analysis of the HLA-DRB1 protein revealed that the HLA-DRB1 Val11 and HLA-DRB1 Asn120 are exclusive characteristics for all the HLA-DRB1*04 and HLA-DRB1*10 alleles, indicating HLA-DRB1 Asn120 is not an independent risk factor for developing ACPA-positive RA among the Malay patients (online HLA alignment database https://www.ebi.ac.uk/cgi-bin/imgt/hla/align.cgi). Our observations implied that we have convincingly replicated the previous published data showing the association of the HLA-DRB1*04:05 allele (i.e., corresponding to HLA-DRB1 Val11 and HLA-DRB1 Asn120) with increased risk of ACPA-positive RA in the Malay ethnic group .
Our findings in the Chinese ethnic group showed that HLA-DRB1*04:05 allele was the top association signal for risk of ACPA-positive RA (OR = 5.22, 95% CI = 2.95–9.25, PGWAS = 1.52 × 10−08) (Table 2), in agreement with the previously published data using the experimental classical HLA genotype dataset . Furthermore, we observed the association between HLA-DRB1 Asn120 and risk of ACPA-positive RA in the Chinese ethnic group; however, the signal was below the suggested PGWAS threshold (OR = 3.05, 95% CI = 2.03–4.58, PGWAS = 8.43 × 10−08). Moreover, HLA-DRB1*04:05 allele is one of the corresponding alleles to HLA-DRB1 Val11 and HLA-DRB1 Asn120, which are in tight LD (D’ = 1.00), based on the online database of HLA peptide sequence (supplementary Table 5, online HLA alignment database https://www.ebi.ac.uk/cgi-bin/ipd/imgt/hla/align.cgi). In view of the evidence for HLA-DRB1 Val11 as a common risk factor for ACPA-positive RA across different populations [17, 19, 20, 22], we tested the association between this variant and risk of ACPA-positive RA in the Chinese ethnic group. Our finding confirmed the increased risk of ACPA-positive RA in the Chinese ethnic group (OR = 2.87, 95% CI = 1.91–4.30, PGWAS = 3.63 × 10−7), although this did not reach genome-wide significance (supplementary Table 5).
Further stratification analysis by ethnicity revealed the strongest association signal at amino acid position 96 within HLA-DRB1 peptide with histidine residue (i.e., HLA-DRB1 His96) among Indian patients with ACPA-positive RA (OR = 0.48, 95% CI = 0.37–0.62, PGWAS = 2.58 × 10−08) (Table 2). The HLA-DRB1 peptide alignment showed that HLA-DRB1 His96 corresponded to specific alleles from HLA-DRB1*03/*07/*08/*09/*11/*12/*13/*14 allele groups (online HLA alignment database https://www.ebi.ac.uk/cgi-bin/ipd/imgt/hla/align.cgi). Of these HLA-DRB1 alleles, the HLA-DRB1*13 allele group was inversely associated with the risk of ACPA-positive RA in the Caucasian, Japanese, and Indian Tamil populations in previous studies [32,33,34]. Although the commonly shared variant of HLA-DRB1 Val11 also increased the risk of ACPA-positive RA among the Indian patients, this association did not reach the genome-wide significant threshold (OR = 1.99, 95% CI = 1.52–2.61, PGWAS = 6.60 × 10−07) (supplementary Table 5).
We did not observe any significant association between the imputed HLA variants and risk of developing ACPA-negative RA in any of the three major ethnic groups (data not shown).
Risk factor independent from HLA-DRB1 in the ACPA-positive RA subset
To look for independent effects across the HLA region, we conducted a stepwise logistic regression. Conditioning by the most associated risk variant, i.e., HLA-DRB1 Val11, with ACPA-positive RA in the Malay ethnic group revealed an inverse association of HLA-DQB1*03:02 allele with risk of developing ACPA-positive RA (OR = 0.17, 95% CI = 0.09–0.30, PGWAS = 1.60 × 10−09) (Fig. 1). No further independent risk variants were detected within the HLA region for ACPA-positive RA in the Malay ethnic group. This finding was confirmed by using the experimental classical HLA genotype dataset that demonstrated an inverse association between HLA-DQB1*03:02 allele and risk of ACPA-positive RA (OR = 0.27, 95% CI = 0.17–0.45, p = 2.30 × 10−07) (supplementary Table 6).
“The stepwise logistic regression demonstrated that the top association signals for the Chinese and Indian ethnic groups were HLA-DRB1*04:05 allele (OR = 3.99, 95% CI = 2.22-7.18, p = 3.90 × 10−06) and HLA-DRB1 His96 (OR = 0.40, 95% CI = 0.30–0.52, p = 5.25 × 10−08), respectively. However, these two variants did not satisfy the genome-wide significant threshold (supplementary figure 1). Conditioning on these top association signals showed that no further independent HLA risk variants were detected for ACPA-positive RA in the Chinese and Indian ethnic groups.”
Comparative analysis for the independent effects of HLA amino acid variants and risk of ACPA-positive RA across different populations
We compared the published independent RA-associated polymorphic HLA amino acid positions across different populations/ethnic groups and the data is presented in Table 3. The HLA-DRB1 Val11 was the most common HLA amino acid variant significantly associated (PGWAS< 5 × 10−08) with increased risk of ACPA-positive RA in all the studied populations included in this study (Table 3), and this association was validated in our study among the Malay ethnic group. Within the same HLA protein, amino acid position 13 with histidine residue was associated with increased risk for ACPA-positive RA in the East Asian and African populations; it was, however, in tight LD with HLA-DRB1 Val11. Furthermore, different amino acid positions, i.e., positions 37, 57, and 74, were reported as RA-associated genetic variants in the ACPA-positive RA from the East Asian population (Table 3). We did not observe the RA-associated polymorphic HLA-DRB1 amino acid at positions 13, 37, 57, and 74 associated with the risk of developing ACPA-positive RA in our study population. However, the independent effect of HLA-DRB1 His96 associated with decreased risk for ACPA-positive RA in the Malaysian Indian patients was not reported in any of the published data from these studied populations.
This observation supported the genetic association of the HLA region to RA and that this is commonly attributed to HLA-DRB1 genes. Furthermore, the observed risk effects of the different amino acids from the same HLA-DRB1 protein suggested that while some may promote the pathogenic process in RA, others may counteract the process.
We further observed that the polymorphic HLA amino acid positions independent of HLA-DRB1 gene were associated with the risk of developing ACPA-positive RA in a population-specific manner. For instance, HLA-A Asn77, HLA-B Asp9, and HLA-DPB1 Phe9 were reported as RA-associated genetic variants in the Caucasian populations, while HLA-DRB1 His13 was RA-associated in the East Asian and African populations. More recently, the HLA-DQA1 Asp160 was reported in Han Chinese to be associated with an increased risk of ACPA-positive RA. However, we did not observe any association for these amino acid variants with ACPA-positive RA in our study population with Malay, Chinese, or Indian origins.
To summarize, these predisposing HLA-specific amino acid positions may exhibit shared-genetic component or population-specific risk signals, suggesting the existence of ethnogenetic heterogeneity in the RA population.
HLA-DRB1 amino acid haplotypes as risk factors for ACPA-positive RA
Of the 16 possible HLA-DRB1 amino acid haplotypes at positions 11, 13, 71, and 74 , we observed only 10, 12, and 12 haplotypes, respectively, in Malay, Chinese, and Indian ethnic groups to be associated with ACPA-positive RA (Table 4). Our findings revealed that the Val11-His13-Arg71-Ala74 haplotype was strongly associated with risk of ACPA-positive RA in the Malay (OR = 5.28, 95% CI = 3.06–9.09, p = 1.22 × 10−09), Chinese (OR = 10.33, 95% CI = 4.39–24.31, p = 9.81 × 10−09), and Indian (OR = 3.84, 95% CI = 3.75–4.75, p = 0.03) populations (Table 4). Meanwhile, we observed the Val11-Phe13-Arg71-Ala74 haplotype was associated with increased risk of ACPA-positive RA in the Malays (OR = 4.35, 95% CI = 2.27–8.32, p = 9.78 × 10−06) and Indians (OR = 2.00, 95% CI = 1.10–3.68, p = 0.03), but not in the Chinese. Interestingly, while Ser11-Ser13-Glx71-Ala74 conferred significant risk for ACPA-positive RA among the Chinese (OR = 12.91, 95% CI = 2.55–65.34, p = 6.98 × 10−04), it demonstrated an inverse association to ACPA-positive RA in the Indian population (OR = 0.40, 95% CI = 0.18–0.86, p = 0.03).
Comparing our findings from the multi-ethnic Malaysian population with the published data from other populations with European and East Asian origins, the Val11-His13-Arg71-Ala74 was the most significant and commonly shared risk factor among the European and Asian populations (Table 4). The decreased risk of ACPA-positive RA associated with the Ser11-Ser13-Glx71-Ala74 haplotype observed in the European and East Asian populations was consistently replicated in the Malaysian Indian ethnic group. In contrast, this haplotype conferred risk for ACPA-positive RA in the Malaysian Chinese ethnic group. Notably, this haplotype is encoded by HLA-DRB1*13:01 and HLA-DRB1*13:03 alleles. It has been previously reported that the HLA-DRB1*13 allele has a dual role: as genetic modulator of ACPA positivity, whereby it was inversely associated with risk of ACPA-positive RA; but also, in combination with HLA-DRB1*03, it decreased the risk of ACPA-negative RA . Our observation in the Malaysian Chinese ethnic group was not in line with the inverse association to ACPA-positive reported in the Caucasian and East Asian populations, suggesting different immune reactions may occur in RA with different ethnicity/population backgrounds.
Amino acid polymorphisms at position 11 within HLA-DRB1 protein and risk of RA
We investigated the frequency of the polymorphic amino acid residues (i.e., valine, serine, proline, leucine, glycine, and aspartic acid) at position 11 in the HLA-DRB1 protein of the Malaysian population with Malay, Chinese, and Indian origins and further compared these frequencies with the published data from Caucasian and East Asian populations . Our data demonstrated that while the frequency of valine residue was higher in RA cases as compared to the normal control group in all the populations, the frequency of serine residue was lower in the RA cases in comparison with the normal controls (Fig. 2). Interestingly, leucine residue, which encodes the classical HLA-DRB1*01 alleles, was commonly found among the individuals of European ancestry (> 10% in both RA cases and control group), but was found in less than 5% of the Malay and Indian ethnic groups, and was absent in the Chinese RA cases and controls. It is noteworthy that the aspartic acid residue was commonly found in the Chinese individuals. This amino acid residue corresponds to the classical HLA-DRB1*09, an allele which was reported as a risk factor for RA development, independent of the HLA-DRB1 SE alleles . The frequencies of proline and glycine residues, which encode the classical HLA-DRB1*15 and HLA-DRB1*07 alleles respectively, were comparable between RA cases and control group for all populations (Fig. 2).
Next, we performed meta-analyses to investigate the generalizability of the effect of the polymorphic HLA-DRB1 amino acid residues at position 11 on the risk for ACPA-positive RA in the Malay, Chinese, and Indian ethnic groups. Our finding demonstrated significant cumulative OR of the HLA-DRB1 Val11 for risk of ACPA-positive RA (ORcumulative 2.86 2.90, p < 0.0001); however, we observed high heterogeneity within studies (I2 =91%) (supplementary Figure 2a). Interestingly, we observed a decreased risk of developing ACPA-positive RA associated with serine and glycine residues (i.e., serine, ORcumulative 0.49, p < 0.00001, I2 =0%; glycine, ORcumulative 0.68, p < 0.002, I2= 0%) (Supplementary Figure. 2b and c).
Our study confirmed that the HLA-DRB1 genes with their functional characteristics are the major determinants in the pathogenesis of RA, specifically in the ACPA-positive RA subset in the multi-ethnic Malaysian population, supporting the notion of shared RA risk across different populations. We found HLA-DRB1 Val11 conferred the strongest risk effect in the ACPA-positive RA in the Malay population, one of the predominant ethnic groups in Southeast Asia. Additionally, HLA-DQB1*03:02 demonstrated a novel and independent protective effect for ACPA-positive RA in the Malay group. Interestingly, Indian RA patients carrying HLA-DRB1 His96 are protected from risk of developing ACPA-positive RA.
The observed RA risk of HLA-DRB1 Val11 in our study population is generally concordant with the published data from different large-scale genetic association studies of Caucasian, African, and East Asian populations, in terms of the amino acid position as well as magnitude of risk. It is notable that HLA-DRB1 Val11 is located within the peptide-binding groove of the HLA Class II molecules. This suggests the pathogenic role of the identified amino acid at position 11 of the HLA-DRB1 protein (i.e., HLA-DRB1 position 11), which enables peptide binding and further recognition of MHC-peptide complexes by T cells involved in providing help to B cells expressing and producing ACPA IgG. Future study of this replicated and validated risk variant, i.e., HLA-DRB1 position 11, is needed to generate new insights and better understanding of the implication of the risk variant for the pathophysiology of ACPA-positive RA.
The valine or leucine residue at position 11 within the HLA-DRB1 protein (i.e., HLA-DRB1 Val11 and HLA-DRB1 Leu11) is associated, predominately in the Caucasian and Spanish populations, with increased risk of severe radiographic progression in ACPA-positive RA, independent of HLA-DRB1 SE status . The present extension of this observation to other populations, including the Malaysian population, may lead to better understanding of the pathogenic role of HLA-DRB1 Val11 and/or HLA-DRB1 Leu11 and their effect on the clinical phenotype of the disease. In this current study, the clinical data from the recruited RA cases were limited. Future studies of the implications of the identified RA risk factor on the disease progression will provide new insights/knowledge that may aid in the characterization of the RA phenotype in the clinical setting.
HLA-DRB1 His13 was reported to have the strongest association with risk of ACPA-positive RA in a mixed East Asian population comprising South Korean and Han Chinese in Beijing , while an earlier study in a homogenous Korean population demonstrated HLA-DRB1 Val11 was strongly associated with risk for ACPA-positive RA . However, both HLA-DRB1 Val11 and HLA-DRB1 His13 are in tight LD. The HLA-DRB1 His13 observed in the mixed population study could be due to the influence of the different genetic profile of Han Chinese individuals (16.8% in RA cases and 20.2% in controls). Although the South Korean and Chinese populations have common ancestry, the genetic profiles of these populations are distinctive .
Interestingly, our findings demonstrated HLA-DQB1*03:02 allele as a novel potentially protective factor regarding risk of developing ACPA-positive RA in the Malay ethnic group. Of a different note, the HLA-DQB1*03:02 allele was reported to associate with increased risk of developing celiac disease in the Iranian population . Taken together, it is suggested that HLA-DQB1*03:02 allele may have opposing effects, being a protective allele in one disease and a risk factor in another disease.
Recently, aspartic acid residue at position 160 within the HLA-DQA1 protein was reported to be the most significant risk factor for ACPA-positive RA in the Han Chinese population of Beijing, with HLA-DRB1 Val11 as the second strongest risk factor . This pattern was however not observed in the Malaysian Chinese ethnic group in our study. The most plausible explanation is the genetic differences between the Beijing Han Chinese and the Malaysian Chinese. The Malaysian Chinese are mainly descendants of nineteenth and early twentieth century Han Chinese immigrants from Southern China (particularly the provinces of Fujian, Guandong, and Hainan) . Furthermore, genetic population studies have shown that the Southern and Northern Han Chinese are two distinctive populations [39, 41].
High imputation accuracy observed in our studied dataset suggested the suitability of the Caucasian-based Immunochip microarray  and usefulness of the admixture Pan-Asian reference panel  for HLA imputation in the multi-ethnic Malaysian population. Based on these local evidences, utilizing the Immunochip microarray and admixture Pan-Asian reference panel for fine-mapping of HLA variants in other autoimmune diseases such as systemic lupus erythematosus, multiple sclerosis, and ankylosing spondylitis can be recommended.
Our new findings in Southeast Asian populations are in concordance with the data from other populations, suggesting HLA-DRB1 Val11 valine as the most important genetic component for the risk of ACPA-positive RA. Notably, our data also showed a novel protective allele in the HLA-DQB1 gene (i.e., HLA-DQB1*03:02) associated with the risk of developing ACPA-positive RA in the Malay ethnic group. The different risk and protective residues of HLA-DRB1 amino acid at positions 11 and 96 in the Malay and Indian patients with ACPA-positive suggested different amino acid residues within the same HLA protein may promote or counteract the pathogenesis of RA. In common with the Caucasian population, there is little risk from HLA locus for ACPA-negative RA in the multi-ethnic Malaysian population.
Availability of data and materials
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Human leukocyte antigen
Human leukocyte antigen A
Human leukocyte antigen B
Human leukocyte antigen DP beta 1
Human leukocyte antigen DQ alpha 1
Anti-citrullinated protein antibody
Malaysian Epidemiological Investigation of Rheumatoid Arthritis
- 95% CI:
95% confidence interval
- PGWAS :
Genome-wide significance threshold of less than 5 × 10−08
- HLA-DRB1 SE:
Human leukocyte antigen DR beta 1 shared epitope
Single nucleotide polymorphisms
- HLA-DRB1 Val11:
HLA-DRB1 amino acid at position 11 with valine residue
- HLA-DRB1 Leu11:
HLA-DRB1 amino acid at position 11 with leucine residue
- HLA-A Asn77:
HLA-A amino acid at position 77 with asparagine residue
- HLA-B Asp9:
HLA-B amino acid at position 9 with aspartic acid
- HLA-DPB1 Phe9:
HLA-DPB1 amino acid at position 9 with phenylalanine residue
- HLA-DQA1 Asp160:
HLA-DQA1 amino acid at position 160 with aspartic acid
Major histocompatibility complex
American College of Rheumatology
Anti-cyclic citrullinated peptide second-generation (anti-CCP2)
Polymerase chain reaction sequence-specific oligonucleotide
Minor allele frequency
Akaike information criterion
Bayesian information criterion
- HLA-DRB1 Asn120:
HLA-DRB1 amino acid at position 120 with asparagine residue
- HLA-DRB1 His96:
HLA-DRB1 amino acid at position 96 with histidine residue
Stastny P. Association of the B-cell alloantigen DRw4 with rheumatoid arthritis. N Engl J Med. 1978;298(16):869–71.
Gregersen PK, Silver J, Winchester RJ. The shared epitope hypothesis. An approach to understanding the molecular genetics of susceptibility to rheumatoid arthritis. Arthritis Rheum. 1987;30(11):1205–13.
Lee HS, et al. Increased susceptibility to rheumatoid arthritis in Koreans heterozygous for HLA-DRB1*0405 and *0901. Arthritis Rheum. 2004;50(11):3468–75.
Huizinga TW, et al. Refining the complex rheumatoid arthritis phenotype based on specificity of the HLA-DRB1 shared epitope for antibodies to citrullinated proteins. Arthritis Rheum. 2005;52(11):3433–8.
Klareskog L, et al. A new model for an etiology of rheumatoid arthritis: smoking may trigger HLA-DR (shared epitope)-restricted immune reactions to autoantigens modified by citrullination. Arthritis Rheum. 2006;54(1):38–46.
Lin L, et al. The association of HLA-DRB1 alleles with rheumatoid arthritis in the Chinese Shantou population: a follow-up study. Biochem Cell Biol. 2007;85(2):227–38.
Liu SC, et al. Influence of HLA-DRB1 genes and the shared epitope on genetic susceptibility to rheumatoid arthritis in Taiwanese. J Rheumatol. 2007;34(4):674–80.
Balsa A, et al. Influence of HLA DRB1 alleles in the susceptibility of rheumatoid arthritis and the regulation of antibodies against citrullinated proteins and rheumatoid factor. Arthritis Res Ther. 2010;12(2):R62.
Chun-Lai T, et al. Shared epitope alleles remain a risk factor for anti-citrullinated proteins antibody (ACPA)--positive rheumatoid arthritis in three Asian ethnic groups. PLoS One. 2011;6(6):e21069.
Terao C, et al. Brief report: Main contribution of DRB1*04:05 among the shared epitope alleles and involvement of DRB1 amino acid position 57 in association with joint destruction in anti-citrullinated protein antibody-positive rheumatoid arthritis. Arthritis Rheumatol. 2015;67(7):1744–50.
Liu WX, et al. HLA-DRB1 shared epitope allele polymorphisms and rheumatoid arthritis: a systemic review and meta-analysis. Clin Invest Med. 2016;39(6):E182–203.
Xue Y, et al. The HLA-DRB1 shared epitope is not associated with antibodies against cyclic citrullinated peptide in Chinese patients with rheumatoid arthritis. Scand J Rheumatol. 2008;37(3):183–7.
Wakitani S, et al. An association between the natural course of shoulder joint destruction in rheumatoid arthritis and HLA-DRB1*0405 in Japanese patients. Scand J Rheumatol. 1998;27(2):146–8.
Lagha A, et al. HLA DRB1/DQB1 alleles and DRB1-DQB1 haplotypes and the risk of rheumatoid arthritis in Tunisians: a population-based case-control study. HLA. 2016;88(3):100–9.
Alrogy A, et al. Association of human leukocyte antigen DRB1 with anti-cyclic citrullinated peptide autoantibodies in Saudi patients with rheumatoid arthritis. Ann Saudi Med. 2017;37(1):38–41.
Okada Y, et al. HLA-DRB1*0901 lowers anti-cyclic citrullinated peptide antibody levels in Japanese patients with rheumatoid arthritis. Ann Rheum Dis. 2010;69(8):1569–70.
Raychaudhuri S, et al. Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis. Nat Genet. 2012;44(3):291–6.
Jia X, et al. Imputing amino acid polymorphisms in human leukocyte antigens. PLoS One. 2013;8(6):e64683.
Okada Y, et al. Risk for ACPA-positive rheumatoid arthritis is driven by shared HLA amino acid polymorphisms in Asian and European populations. Hum Mol Genet. 2014;23(25):6916–26.
Govind N, et al. HLA-DRB1 amino acid positions and residues associated with antibody-positive rheumatoid arthritis in black South Africans. J Rheumatol. 2019;46(2):138–44.
Han B, et al. Fine mapping seronegative and seropositive rheumatoid arthritis to shared and distinct HLA alleles by adjusting for the effects of heterogeneity. Am J Hum Genet. 2014;94(4):522–32.
Guo J, et al. Sequencing of the MHC region defines HLA-DQA1 as the major genetic risk for seropositive rheumatoid arthritis in Han Chinese population. Ann Rheum Dis. 2019;78(6):773–80.
Trynka G, et al. Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease. Nat Genet. 2011;43(12):1193–201.
Yahya A, et al. Smoking is associated with an increased risk of developing ACPA-positive but not ACPA-negative rheumatoid arthritis in Asian populations: evidence from the Malaysian MyEIRA case-control study. Mod Rheumatol. 2012;22(4):524–31.
Tan LK, et al. HLA-A, -B, -C, -DRB1 and -DQB1 alleles and haplotypes in 951 Southeast Asia Malays from Peninsular Malaysia. Hum Immunol. 2016;77(10):818–9.
Cortes A, Brown MA. Promise and pitfalls of the Immunochip. Arthritis Res Ther. 2011;13(1):101.
Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Pillai NE, et al. Predicting HLA alleles from high-resolution SNP data in three Southeast Asian populations. Hum Mol Genet. 2014;23(16):4443–51.
Hinks A, et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nat Genet. 2013;45(6):664–9.
Seabold, S.a.P., J., Statsmodels: econometric and statistical modeling with Python. Proceedings of the 9th Python in Science Conference, 2010.
Holoshitz J. The rheumatoid arthritis HLA-DRB1 shared epitope. Curr Opin Rheumatol. 2010;22(3):293–8.
van Heemst J, et al. Protective effect of HLA-DRB1*13 alleles during specific phases in the development of ACPA-positive RA. Ann Rheum Dis. 2016;75(10):1891–8.
Mariaselvam CM, et al. HLA class II alleles influence rheumatoid arthritis susceptibility and autoantibody status in South Indian Tamil population. HLA. 2016;88(5):253–8.
Oka S, et al. Protective effect of the HLA-DRB1*13:02 allele in Japanese rheumatoid arthritis patients. PLoS One. 2014;9(6):e99453.
Too CL, et al. HLA-A, -B, -C, -DRB1 and -DQB1 alleles and haplotypes in 194 Southeast Asia Chinese from Peninsular Malaysia. Hum Immunol. 2019;80(11):906–7.
Lundstrom E, et al. Opposing effects of HLA-DRB1*13 alleles on the risk of developing anti-citrullinated protein antibody-positive and anti-citrullinated protein antibody-negative rheumatoid arthritis. Arthritis Rheum. 2009;60(4):924–30.
Nurul-Aain AF, Tan LK, Heselynn H, Nor-Shuhailan S, Eashwary M, Wahinuddin S, Lau IS, Gun SC, Mohd-Shahrir MS, Ainon MM, Azmillah R, Muhaini O, Shahnaz M, Too CL. HLA-A, -B, -C, -DRB1 and -DQB1 alleles and haplotypes in 271 Southeast Asia Indians from Peninsular Malaysia. Human Immunol. 2020;81(6):263–4.
van Steenbergen HW, et al. Association of valine and leucine at HLA-DRB1 position 11 with radiographic progression in rheumatoid arthritis, independent of the shared epitope alleles but not independent of anti-citrullinated protein antibodies. Arthritis Rheumatol. 2015;67(4):877–86.
Wang Y, et al. Genetic structure, divergence and admixture of Han Chinese, Japanese and Korean populations. Hereditas. 2018;155:19.
Zamani M, et al. The involvement of the HLA-DQB1 alleles in the risk and the severity of Iranian coeliac disease patients. Int J Immunogenet. 2014;41(4):312–7.
Xu S, et al. Genomic dissection of population substructure of Han Chinese and its implication in association studies. Am J Hum Genet. 2009;85(6):762–74.
The authors would like to thank the Director General of Health, Ministry of Health Malaysia, for supporting the work described in this article. Special thanks to the members of the MyEIRA study group and the rheumatologists for their dedication and excellent assistance in this study. We truly value the patients and controls for their generous participation. Dr. Daniel Ramskold’s help with the data analysis using Python language is highly appreciated. We would also like to thank Ms. Janet Ahlberg for language editing.
This study was supported by grants from the Ministry of Health, Malaysia (i.e., MRG-2005-12, JPP-IMR 07-017, JPP-IMR 07-046, JPP-IMR 08-006, JPP-IMR 08-012) and The Swedish National Research Council (DNR 348-2009-6468).
Ethics approval and consent to participate
This study was approved by the Medical Research and Ethics Committee, Ministry of Health Malaysia (KKM/JEPP/02 Jld 1 (86); (14) dlm. KKM/NIHSEC/08/0804/MRG-2005-12) and Stockholm Regional Ethics Committee, Sweden (2012/1381-31/1). All participants gave written informed consent.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Overall concordance rate of HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the control group, stratified by ethnicity. Supplementary Table 2. Classical 2-digit HLA alleles imputation accuracy in the control group, stratified by ethnicity. Supplementary Table 3. Classical 4-digit HLA alleles imputation accuracy in the control group, stratified by ethnicity. Supplementary Table 4. Number of imputed classical HLA alleles and amino acid polymorphisms achieved published genome-wide threshold of p< 5 × 10− 08. Supplementary Table 5. Logistic regression results of the association between imputed HLA amino acids and alleles, and risk of developing ACPA-positive rheumatoid arthritis in the Malay, Chinese and Indian ethnic groups. Supplementary Table 6. Stepwise logistic regression analysis for risk of ACPA-positive RA in the Malay ethnic group. Supplementary Figure 1. Plot of stepwise logistic regression analysis to fine-map HLA variants as risk factor for ACPA-positive RA in the Chinese and Indian ethnic groups. Supplementary Figure 2. Meta-analysis of polymorphic HLA-DRB1 amino acid residues position 11 and risk of developing ACPA-positive RA in Malay, Chinese and Indian ethnic groups.
About this article
Cite this article
Tan, L.K., Too, C.L., Diaz-Gallo, L.M. et al. The spectrum of association in HLA region with rheumatoid arthritis in a diverse Asian population: evidence from the MyEIRA case-control study. Arthritis Res Ther 23, 46 (2021). https://doi.org/10.1186/s13075-021-02431-z
- Rheumatoid arthritis
- HLA amino acid residues
- Risk variants
- HLA fine-mapping
- Multi-ethnic Malaysian population