- Research article
- Open Access
MHC associations of ankylosing spondylitis in East Asians are complex and involve non-HLA-B27 HLA contributions
Arthritis Research & Therapy volume 22, Article number: 74 (2020)
The association of HLA-B*27 with AS is amongst the strongest of any known association of a common variant with any human disease. Nonetheless, there is strong evidence indicating that other HLA-B alleles are involved in the disease. European ethnicity studies have demonstrated risk associations with HLA-B*40 and multiple other HLA-B, HLA-A, and HLA class II alleles, and demonstrated that in that ethnic group, the amino acid sequence at position 97 in HLA-B is the key determinant of HLA associations with AS. A recent study in Korean AS cases and controls additionally identified association at HLA-C*15:02. In the current study, we examined the MHC associations of AS in an expanded East Asian cohort.
A total of 1637 Chinese, Taiwanese, and Korean AS cases meeting the modified New York Criteria for AS, and 1589 ethnically matched controls, were genotyped with the Illumina Immunochip, including a dense coverage of the MHC region. HLA genotypes and amino acid composition were imputed using the SNP2HLA programme using the Han-MHC reference panel based on the data of Han Chinese subjects (n = 9689), and association tested using logistic regression controlling for population stratification effects.
A strong association was seen with HLA-B*27 (odds ratio (OR) = 205.3, P = 5.76 × 10−244). Controlling for this association, the strongest risk association is seen with HLA-C*15 at genome-wide significant level (OR = 7.62, P = 9.30 × 10−19), and confirmed association is also seen with HLA-B*40 at suggestive level (OR = 1.65, P = 2.54 × 10−4). At amino acid level, the strongest association seen in uncontrolled analysis was with histidine at position 114 in HLA-B (P = 7.24 × 10−241), but conditional analyses suggest that the primary amino acid associations are with lysine at position 70 and asparagine at position 97. Restriction of the ERAP1 association with HLA-B27-positive AS, previously reported in European subjects, was confirmed in East Asians.
This study confirms in East Asians that the HLA associations of AS are multiple, including previously reported associations at HLA-B*27, HLA-B*40, and HLA-C*15, as well as novel association with HLA-DQB1*04. The HLA-B associations are driven by the amino acids at positions 70 and 97, in the B pocket of HLA-B.
Ankylosing spondylitis (AS) is a highly heritable rheumatic disease characteristically causing chronic inflammation of the spine and sacroiliac joints, as well as in some patients affecting the peripheral joints, the anterior uvea, and less commonly other organs. The worldwide distribution of AS is closely related to the prevalence of HLA-B*27, although the underlying mechanism remains unclear. Whilst the HLA-B*27 allele is found in approximately 85% of patients, there is strong evidence indicating that other HLA-B alleles and MHC genes are involved in the disease, as well as non-MHC loci.
Direct genotyping studies in European case-control cohorts have demonstrated risk associations consistently with HLA-B*40 and variably reported associations with multiple other HLA-B, HLA-A, and HLA class II alleles. The development of accurate HLA imputation methods from single nucleotide polymorphism (SNP) microarray data has enabled far larger case-control studies to be performed, with, for the first time, proper control for population stratification effects. Using this approach and studying 22,647 AS cases and controls of European descent, Cortes et al. demonstrated that the amino acid sequence of HLA-B at position 97, in the epitope-binding groove, is the key determinant of HLA associations with AS. After controlling for the associated alleles in HLA-B, independent associations with variants in the HLA-A, HLA-DPB1, and HLA-DRB1 loci were observed .
Differences in HLA-B*27 subtype distributions between Asian and European descent populations have been well reported, and further non-HLA-B*27 HLA class I associations in East Asian AS have been reported. Also using HLA imputation methods, a study in 654 Korean cases of AS and 3166 controls additionally identified association at HLA-C*15:02 . Additionally, using direct genotyping in 360 Han Chinese AS cases and 350 controls with no genomic control for population stratification, risk association of HLA-B*40 and protective association of HLA-B*07 have been demonstrated .
In this study, using HLA imputation methods, we analyse the associations of AS with major histocompatibility complex (MHC) polymorphisms to identify functional and potentially causal variants using a large cohort of East Asian ancestry AS cases and controls . In addition to our primary analysis of this cohort, we perform fine mapping of the MHC region with imputation of SNPs, HLA class I and II classical alleles, and amino acid residues within the classical HLA proteins. In addition to HLA-B*27, we identify further HLA-B and other HLA class I and II alleles associated with AS.
Subjects and SNP data
A total of 1637 Chinese, Taiwanese, and Korean AS cases meeting the modified New York Criteria for AS  as confirmed by qualified rheumatologists, and 1589 ethnically matched controls (Table 1), were genotyped with the customised SNP array (Illumina Immunochip ), including a dense coverage of the MHC region. Cohort descriptions and genotyping protocols are as previously reported . By standard quality control procedures, SNPs with a minor allele frequency of at least 1% (MAF > 0.01), call rates of ≥ 0.98, and P values in Hardy-Weinberg disequilibrium tests ≤ 10−7 were analysed in this study. To confirm ethnicity, we performed a continental principal components analysis (PCA), merging the study genotype data available from 51 available populations genotyped by Illumina 650Y from the Human Genome Diversity Panel (HGDP-CEPH) . Cases or controls lying more than 6 standard deviations from the population mean on principal components (PCs) 1–10 were then excluded.
HLA imputation and association analysis
We conducted a 2-step imputation. We densely imputed SNPs across the MHC using the Michigan Imputation Server  and the 1000 Genomes Phase 3 reference dataset (26 populations across the world), then further using the Han-MHC reference panel , to ensure maximum SNP coverage to enable accurate imputation of HLA-B alleles, including of particular interest, HLA-B27. Using this SNP data and the Han Chinese reference panel (N = 9869), the programme SNP2HLA was used to impute the classic HLA alleles and amino acid residues of the 8 HLA genes (HLA-A, HLA-B, HLA-C, HLA-DPB1, HLA-DQB1, HLA-DRB1, HLA-DPA1, HLA-DQA1) in a total of 3007 East Asian subjects. In the output file of SNP2HLA, imputed classical HLA alleles and HLA protein amino acid positions were defined as binary markers coding the presence or absence of the allele or residue being tested, and each different allele or residue was tested as a biallelic position. Association with AS was then tested using logistic regression function in PLINK  by including all allele/residues/SNP conditioning on 10 principal components to control for population stratification effects. We then performed conditional analysis repeatedly in an iterative fashion by adding the dosage of HLA-B*27 allele and other significant alleles/residues/SNPs as covariates until no significant allele/residue/SNP was observed. Only HLA alleles or amino acids with imputation information scores > 0.5 were considered. All results are presented unadjusted for multiple testing.
PCA indicated that all study subjects were ethnically East Asian (Supplementary Figure 1). The genomic inflation factor calculated using a set of 1767 negative control SNPs in regions included on Immunochip for studies of reading and writing disabilities, psychosis, and schizophrenia was 1.03 (lambda (1000) = 1.02). No evidence of statistical inflation is seen in the Q-Q plot (Supplementary Figure 2). After quality control and imputation, 15,748 SNPs across the MHC (from 25 to 35 Mb, hg18) were available for analysis in 1482 cases and 1512 controls. Imputed HLA-B allele frequencies amongst controls in the current study were not significantly different from those in previously reported directly genotyped studies (P > 0.05), confirming the high accuracy of HLA imputation, particularly at a two-digit resolution .
The strongest SNP association with AS observed was a missense variant of HLA-B, rs1071652 (SNP-B*31432180_CG, odds ratio (OR) = 180, P = 4.45 × 10−256, Fig. 1). The previously reported East Asian HLA-B*27 tagSNP rs13202464 (31452562)  was also found significantly associated with AS (OR = 58.73, P = 1.92 × 10−211). Controlling for rs1071652, residual association is seen with HLA-B*27 (4.48 × 10−22) and SNP rs41553720 (SNP-B*31432843_A, P = 3.87 × 10−31), indicating that combinations of SNPs are currently required to tag HLA-B*27 in East Asian populations, in contrast to the situation in European descent populations .
After SNP imputation in the MHC region, the expected strong association was observed with HLA-B*27 (odds ratio (OR) = 205, P = 5.76 × 10−244, Table 2). Controlling for the HLA-B*27 association and studying other HLA-B alleles, risk association is seen with HLA-B*40 at suggestive level (OR = 1.65, P = 2.54 × 10−4). Controlling for both HLA-B*27 and HLA-B*40, no association was observed in the 2-digit HLA-B allele with MAF > 1% (only rare alleles HLA-B*53 and HLA-B*38 were associated at suggestive level; P values were 2.9 × 10−4 and 3.3 × 10−4, respectively).
At the amino acid level, the strongest association seen in the uncontrolled analysis was with histidine at position 114 in HLA-B (P = 7.24 × 10−241), followed by multiple HLA-B amino acids including lysine at 70 (P = 1.49 × 10−237) and asparagine 97 (P = 2.51 × 10−237) (Table 3). Asparagine 97 and histidine 114 were previously reported to be the main amino acid determining HLA-B associations with AS in European descent and Korean populations, respectively [1, 2].
Conditional analyses for these individual amino acids and their combinations reveal that only the association of histidine 114 can be attenuated by conditioning on asparagine 97 (Table 4). No other individual amino acid explains the association of the other amino acids. For HLA-B alleles, both combinations of lysine 70 + asparagine 97 and lysine 70 + histidine 114, but not asparagine 97 + histidine 114, controlled for association with any other 2-digit HLA-B allele (P > 0.0017, correcting for 30 2-digit HLA-B alleles tested). Controlling for all of lysine 70, asparagine 97, and histidine 114, the strongest HLA amino acid association remains with several positions including HLA-B position 97 (serine, found on HLA-B*7, *8, *15, *2707, *40, *41, *48; OR = 2.14, P = 2.14 × 10−5).
Controlling for HLA-B*27 alone or in combination with HLA-B*40 did not fully control for the association of asparagine 97, lysine 70, or histidine 114 (P < 5 × 10−8 for both analyses, Table 3).
Non-HLA-B susceptibility loci in the MHC
Considering HLA alleles other than HLA-B, several HLA-A, HLA-C, and HLA class II alleles showed significant associations (Table 2). Controlling for the HLA-B*27 association, independent risk association was confirmed with HLA-C*15 (OR = 2.13, P = 9.30 × 10−19) and HLA-C*1502 (OR = 7.62, P = 6.78 × 10−23), both associations at the amino acid level tagged by a leucine 116 in HLA-C (P = 6.61 × 10−21) located in the HLA-C epitope-binding groove. Stepwise conditional analyses on both HLA-B*27 and HLA-C*15 demonstrated significant associations with HLA-DQB1*04 (OR = 2.13, P = 7.91 × 10−5) after correcting for multiple comparisons (499 signals across MHC, as defined by regions with LD r2 < 0.2, Bonferroni correction threshold = 10−4). Conditioning on both HLA-B*27 and HLA-B*40, the association was confirmed with HLA-C*15 (OR = 4.97, P = 9.88 × 10−17) and HLA-DQB1*04 (OR = 2.42, P = 1.86 × 10−6).
ERAP1 variants in association with AS
The key ERAP1 variant associated with AS is rs30187 (ccc-5-96150086-T-C, chr5:96150086[hg18], encoding K528R) [4, 13] (Table 5). It has previously been observed in European populations that the association with the variant rs30187 in the ERAP1 locus is restricted to HLA-B*27-positive subjects, or HLA-B*40-positive, HLA-B27-negative subjects, consistent with epistatic interactions. Here, we investigated the possibility of interaction between the HLA-B*27 and HLA-B*40 alleles and the previously reported tag SNP of ERAP1 locus (rs30187) . When testing for interaction with the HLA-B*27 alleles, we found that rs30187-A risk allele increased the risk of disease in the strata where HLA-B*27 was present (OR = 1.29; P = 2.71 × 10−6) (Table 5), but no association was seen in HLA-B27-negative cases (OR = 1.06, P = 0.61). No evidence of interaction was observed between rs30187 and the HLA-B*40 allele, although the power to identify this was low as the number of HLA-B27-negative cases was low.
This study confirms that in East Asians, the primary MHC associations with AS are with HLA-B*27 and HLA-B*40, and confirms the risk association of HLA-C*1502 with the disease. The association of HLA-B*40 with AS has been convincingly demonstrated now in both European descent [1, 14,15,16] and East Asian studies [3, 17], using both direct genotyping- and imputation-based methods. HLA-B*4001 has also been shown to be associated with IgA nephropathy (OR = 1.34, P = 5.64 × 10−7) , a known though uncommon association of AS. The functional mechanism of association of this allele has been little studied. It does not share the lysine 70, asparagine 97, or histidine 114 residues found in most HLA-B*27 alleles. As with HLA-B*27, it is known to interact with AS-associated ERAP1 variants to cause AS, suggesting that it is likely to operate by the same mechanism. Further studies to compare its properties with HLA-B27, such as its peptide-binding characteristics, folding rate, and whether it forms homodimers, are indicated to investigate its association further.
No protective association was seen with HLA-B*07 as has previously been reported in East Asians  and European descent cohorts [1, 15], although the allele frequency was very low and the study may not have had adequate power to detect any association with the allele (frequency = 0.024).
The study indicates that in East Asians, the key amino acid drivers of the HLA-B associations in AS are amino acid positions 70 and 97. These remain AS-associated controlling for any other HLA-B amino acid. HLA-B position 97 was previously shown in European descent cohorts to be the key amino acid association in the broad ethnicity, whereas in a Korean study, the association of histidine 114 could not be distinguished from associations with lysine 70 and asparagine 97 . The difference in these findings may be explained by three key factors, sample size, ethnicity, and the reference haplotype dataset. Cortes et al.’s study of European descent subjects involved 9069 AS cases and 13578 controls, over seven times as many subjects as involved in the current study (1637 cases, 1589 controls) and nearly six times the number involved in the previous Korean study (654 cases, 3166 controls). Therefore, the European descent study had greater power, potentially explaining the absence of signal in the East Asian cohorts for some of the HLA-B allele and the HLA class II associations seen in the European dataset. The European descent study also has greater power in conditional analyses, potentially explaining the differences in results regarding the role of lysine 70, which remains positively associated with AS after conditioning on asparagine 97 in the current study, but not in the European descent dataset. The different studies have also used different reference haplotype datasets, potentially affecting the accuracy of the imputation data. Ethnic differences could also play a role through differences in HLA-B*27 subtypes or other HLA-B allele frequencies, particularly comparing the European descent and East Asian cohorts.
Both HLA-B amino acid residues 70 and 97 are found within the B pocket of the HLA-B peptide-binding groove. However, it has been noted that position 70 is tightly coupled with positions 67 and 97 and that position 70 hardly changes the peptide-binding repertoire, suggesting that position 70 is “hitch-hiking” along with positions 67 and 97 in their ability to change the peptide-binding repertoire . Our study and the previous HLA amino acid imputation studies suggest that other amino acid positions in addition to 70 (like position 97 and 114) are also involved in HLA-B risk attribution. The association of these amino acids independent of other amino acids found in the HLA-B27 B pocket, and having controlled for HLA-B27, indicates that their effect on disease risk is partially independent of HLA-B27.
Although the HLA allele frequencies imputed in controls in this study closely match those reported by direct genotyping studies in Han Chinese , the accuracy of imputation in such studies is very dependent on the ethnic matching of the imputed and reference datasets. Whilst the Han-MHC reference dataset used here is of large size (n = 9689), the number of East Asian in 1000 Genomes Phase 3 (n = 524), which we used in the Michigan Imputation Server, is far smaller than the European dataset used in Cortes et al. (Type 1 Diabetes Genetics Consortium dataset, n = 5225) . The smaller reference dataset size precluded imputation to four-digit levels and may have affected the accuracy of the imputation of low-frequency alleles in particular. As SNP-based HLA imputation is a highly efficient method enabling large-scale HLA association studies, there is a clear need for much larger publicly available HLA imputation reference datasets for Asian populations.
In this study, we have also confirmed the interaction between ERAP1 and HLA-B*27, with association only observed of the key ERAP1 variant, rs30187, only seen in HLA-B*27-positive individuals. This confirms the original finding in Europeans  and the previous finding in a case-only analysis of Taiwanese AS patients of different ERAP1 genotypes in HLA-B*27-positive and HLA-B*27-negative cases . We did not see an association of ERAP1 variants in HLA-B*27-negative and HLA-B*40-positive individuals as previously reported , although the sample size was not large. The confirmation of the gene-gene interaction in an East Asian population increases the evidence that this is a true-positive interaction and is critical to AS pathogenesis.
This study confirms that the HLA associations of AS are complex and that multiple non-HLA-B*27 alleles, including both HLA class I and likely HLA class II variants, also contribute to risk and protection from the disease. Further investigation of the mechanisms involved in these associations is likely to assist in determining the pathogenesis of this disease.
Availability of data and materials
Summary data for the datasets used are available at Harvard Dataverse (https://doi.org/10.7910/DVN/NJ7XSO) and on request from the authors.
Human Genome Diversity Panel
Major histocompatibility complex
Minor allele frequency
Principal components analysis
Single nucleotide polymorphism
Cortes A, Pulit SL, Leo PJ, Pointon JJ, Robinson PC, Weisman MH, Ward M, Gensler LS, Zhou X, Garchon HJ, et al. Major histocompatibility complex associations of ankylosing spondylitis are complex and involve further epistasis with ERAP1. Nat Commun. 2015;6:7146.
Kim K, Bang SY, Lee S, Lee HS, Shim SC, Kang YM, Suh CH, Sun C, Nath SK, Bae SC, et al. An HLA-C amino-acid variant in addition to HLA-B*27 confers risk for ankylosing spondylitis in the Korean population. Arthritis Res Ther. 2015;17:342.
Yi L, Wang J, Guo X, Espitia MG, Chen E, Assassi S, Zou H, Reveille JD, Zhou X. Profiling of HLA-B alleles for association studies with ankylosing spondylitis in the Chinese population. Open Rheumatol J. 2013;7:51–4.
International Genetics of Ankylosing Spondylitis C, Cortes A, Hadler J, Pointon JP, Robinson PC, Karaderi T, Leo P, Cremin K, Pryce K, Harris J, et al. Identification of multiple risk variants for ankylosing spondylitis through high-density genotyping of immune-related loci. Nat Genet. 2013;45(7):730–8.
van der Linden S, Valkenburg HA, Cats A. Evaluation of diagnostic criteria for ankylosing spondylitis. A proposal for modification of the New York criteria. Arthritis Rheum. 1984;27(4):361–8.
Cortes A, Brown MA. Promise and pitfalls of the Immunochip. Arthritis Res Ther. 2011;13(1):101.
Lopez Herraez D, Bauchet M, Tang K, Theunert C, Pugach I, Li J, Nandineni MR, Gross A, Scholz M, Stoneking M. Genetic variation and recent positive selection in worldwide human populations: evidence from nearly 1 million SNPs. PLoS One. 2009;4(11):e7888.
Das S, Forer L, Schonherr S, Sidore C, Locke AE, Kwong A, Vrieze SI, Chew EY, Levy S, McGue M, et al. Next-generation genotype imputation service and methods. Nat Genet. 2016;48(10):1284–7.
Zhou F, Cao H, Zuo X, Zhang T, Zhang X, Liu X, Xu R, Chen G, Zhang Y, Zheng X, et al. Deep sequencing of the MHC region in the Chinese population contributes to studies of complex disease. Nat Genet. 2016;48(7):740–6.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Lin Z, Bei JX, Shen M, Li Q, Liao Z, Zhang Y, Lv Q, Wei Q, Low HQ, Guo YM, et al. A genome-wide association study in Han Chinese identifies new susceptibility loci for ankylosing spondylitis. Nat Genet. 2011;44(1):73–7.
Evans DM, Spencer CC, Pointon JJ, Su Z, Harvey D, Kochan G, Oppermann U, Dilthey A, Pirinen M, Stone MA, et al. Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat Genet. 2011;43(8):761–7.
Roberts AR, Appleton LH, Cortes A, Vecellio M, Lau J, Watts L, Brown MA, Wordsworth P. ERAP1 association with ankylosing spondylitis is attributable to common genotypes rather than rare haplotype combinations. Proc Natl Acad Sci U S A. 2017;114(3):558–61.
Robinson WP, van der Linden SM, Khan MA, Rentsch HU, Cats A, Russell A, Thomson G. HLA-Bw60 increases susceptibility to ankylosing spondylitis in HLA-B27+ patients. Arthritis Rheum. 1989;32(9):1135–41.
Diaz-Pena R, Vidal-Castineira JR, Lopez-Vazquez A, Lopez-Larrea C. HLA-B*40:01 is associated with ankylosing spondylitis in HLA-B27-positive populations. J Rheumatol. 2016;43(6):1255–6.
Brown MA, Pile KD, Kennedy LG, Calin A, Darke C, Bell J, Wordsworth BP, Cornelis F. HLA class I associations of ankylosing spondylitis in the white population in the United Kingdom. Ann Rheum Dis. 1996;55(4):268–70.
Wei JC, Tsai WC, Lin HS, Tsai CY, Chou CT. HLA-B60 and B61 are strongly associated with ankylosing spondylitis in HLA-B27-negative Taiwan Chinese patients. Rheumatology (Oxford). 2004;43(7):839–42.
Yu XQ, Li M, Zhang H, Low HQ, Wei X, Wang JQ, Sun LD, Sim KS, Li Y, Foo JN, et al. A genome-wide association study in Han Chinese identifies multiple susceptibility loci for IgA nephropathy. Nat Genet. 2011;44(2):178–82.
van Deutekom HW, Kesmir C. Zooming into the binding groove of HLA molecules: which positions and which substitutions change peptide binding most? Immunogenetics. 2015;67(8):425–36.
Wang CM, Ho HH, Chang SW, Wu YJ, Lin JC, Chang PY, Wu J, Chen JY. ERAP1 genetic variations associated with HLA-B27 interaction and disease severity of syndesmophytes formation in Taiwanese ankylosing spondylitis. Arthritis Res Ther. 2012;14(3):R125.
The authors would like to thank Erika de Guzman, Sharon Song, and Lisa Anderson from the Australian Translational Genomics Centre (https://research.qut.edu.au/translationalgenomicsgroup/atgc/) for their assistance in SNP microarray genotyping.
MAB was funded by a National Health and Medical Research Council (Australia) Senior Principal Research Fellowship (1024879) and Queensland State Premier’s Fellowship for Science. Support for this study was received from a National Health and Medical Research Council (Australia) programme grant (566938) and project grant (569829). This research was funded/supported by the National Institute for Health Research (NIHR) Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust and King’s College London and/or the NIHR Clinical Research Facility. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health. HX was supported by the National Natural Science Foundation of China (Grant 31821003) and the China Ministry of Science and Technology (Grant 2018AAA0100300).
Ethics approval and consent to participate
This study protocol was reviewed and approved by the relevant ethics committees of the hospitals and institutions involved. All subjects provided written informed consent.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wang, G., Kim, TH., Li, Z. et al. MHC associations of ankylosing spondylitis in East Asians are complex and involve non-HLA-B27 HLA contributions. Arthritis Res Ther 22, 74 (2020). https://doi.org/10.1186/s13075-020-02148-5
- Ankylosing spondylitis