Skip to main content
  • Research article
  • Open access
  • Published:

Bias in effect size of systemic lupus erythematosus susceptibility loci across Europe: a case-control study



We aimed to investigate whether the effect size of the systemic lupus erythematosus (SLE) risk alleles varies across European subpopulations.


European SLE patients (n = 1,742) and ethnically matched healthy controls (n = 2,101) were recruited at 17 centres from 10 different countries. Only individuals with self-reported ancestry from the country of origin were included. In addition, participants were genotyped for top ancestry informative markers and for 25 SLE associated SNPs. The results were used to compare effect sizes between the Central Eureopan and Southern European subgroups.


Twenty of the 25 SNPs showed independent association with SLE, These SNPs showed a significant bias to larger effect sizes in the Southern subgroup, with 15/20 showing this trend (P = 0.019) and a larger mean odds ratio of the 20 SNPs (1.46 vs. 1.34, P = 0.02) as well as a larger difference in the number of risk alleles (2.06 vs. 1.63, P = 0.027) between SLE patients and controls than for Central Europeans. This bias was reflected in a very significant difference in the cumulative genetic risk score (4.31 vs. 3.48, P = 1.8 × 10-32). Effect size bias was accompanied by a lower number of SLE risk alleles in the Southern subjects, both patients and controls, the difference being more marked between the controls (P = 1.1 × 10-8) than between the Southern and Central European patients (P = 0.016). Seven of these SNPs showed significant allele frequency clines.


Our findings showed a bias to larger effect sizes of SLE loci in the Southern Europeans relative to the Central Europeans together with clines of SLE risk allele frequencies. These results indicate the need to study risk allele clines and the implications of the polygenic model of inheritance in SLE.


The systemic lupus erythematosus (SLE) genetic component has been partially elucidated thanks to large studies that have uncovered more than 30 loci reaching very convincing disease association [112]. These studies have shown that a large fraction of the SLE loci (such as STAT4, TNFSF4 or BLK) are shared in the different ethnic groups; however, other loci are not (such as PTPN22, which is exclusive of Europeans). These latter loci can be due to the absence or rarity of the polymorphism in one of the ethnic groups (as for PTPN22, which is absent in Asians), but other SLE loci show a similar frequency in discordant populations (as for PXK or FCGR2A). Possible explanations for these conflicting results have been envisaged, including differences in linkage between the causal polymorphism and the analysed SNPs, limitations of study design, or differences in the interactions with other genetic loci or with environmental exposures [13].

Gene-gene interaction could be behind the observation that SLE loci show variability in effect sizes in function of the genetic background. For example, the Amerindian genetic background is associated with a higher effect size of several SLE loci in Hispanics [14]. We wondered whether genetic heterogeneity of this type could exist among European subjects. Support for this hypothesis is provided by the recent evidence of differences in SLE clinical features among Europeans [1517] and by opposed results of SLE association with PDCD1 [18, 19]. Notably, these two observations showed a North-South axis of variations, which is the main axis of European population differentiation [2023].

Our study included 25 top-associated SNPs in the better known SLE loci studied in 1,742 patients with SLE and in 2,101 controls from 17 collections recruited in 10 European countries, each of them with homogeneous local ancestry. The results showed a bias to larger effect sizes of the risk alleles in the Southern Europeans relative to the Central Europeans. We also found clines of risk allele frequencies.

Materials and methods

Patient data

We used DNA samples from European SLE patients and ethnically matched healthy controls recruited at 17 centres from 10 different countries (Table 1). Most of these samples have already been described [24, 25]. Each recruiting centre was asked for about 100 patients with SLE according to the revised American College of Rheumatology classification criteria [26] and for about 100 controls, producing a total of 1,742 cases and 2,101 controls. Only individuals with self-reported ancestry from the country of origin were included. All participants gave their written informed consent to participate and the study was approved by the relevant ethics committees.

Table 1 Sample collections, proportion of women, frequencies of the three more informative AIMs and North/South score


DNA samples were amplified in a multiplex PCR with the KAPA2G fast HotStart (Kapa Biosystems, Woburn, MA, USA) in a final volume of 10 μl (20 ng genomic DNA), using 3 mM MgCl2 and 0.2 μM each primer. Products were purified by Exo-SAP digestion with exonuclease I (Epicentre, Madison, WI, USA) and shrimp alkaline phosphatase (GE Healthcare, Barcelona, Spain). Subsequently, single-base extension reactions were performed with the SNaPshot Multiplex kit (Applied Biosystems, Foster City, CA, USA). The genotyping call rate success of the newly studied SNPs was 99.12%. Sequences of primers and probes are available from the authors upon request.

Selection of SNPs

Six ancestry informative markers (AIMs) were selected (see Table S1 in Additional file 1). Three of these are the most informative AIMs in differentiating Northern Europeans from Southern Europeans according to a large study [22]. Another two AIMs are the most informative for East-West place of origin according to the same study [22]. rs12913832 is a SNP associated with large differences in frequency across Europe and unrelated to the previous [23]. In addition, we used genotype data from another 25 SNPs tagging 22 SLE loci reported in large European studies (see Table S1 in Additional file 1). These included nine SNPs in nine SLE loci we had already replicated and that made up the first phase of the current study [24]. We selected 14 additional SNPs for de novo genotyping in our samples. These SNPs were the top SLE-associated SNPs in large previous studies [2, 3, 7, 27, 28]. Not all of them have reached a genome-wide association level, but they are considered solid because they were found in large studies and with an odds ratio (OR) of the risk allele > 1.15 in at least one study. These 14 SNPs together with two IRF5 SNPs we had already studied [25] made up the 16 SNPs included in the second phase of our study.

Statistical analysis

Analysis of results was based on R and Statistica 7.0 (StatSoft, Tulsa, OK, USA). Conformity with Hardy-Weinberg equilibrium was tested in control samples. Allele frequencies of the AIMs were compared between patients and controls from each collection with 2 × 2 contingency tables. We created a global score by sample collection for the North-South axis of European population differentiation (N/S score) with the allele frequencies of the AIMs. First, we defined AIM allele frequencies as a function of the allele more common in Northern European populations. The most informative, nonredundant three AIMs were then selected. Finally, as a normalisation step we rescaled the frequencies from each of these three AIMs to 0 to 100%. This was done by considering 0% the frequency in the sample collection where it was less abundant and 100% the frequency where it was most abundant. The rescaled values of the three AIMs were averaged to obtain a combined normalised unique score for each collection.

Case-control allele frequencies were compared with fixed-effects and random-effects models stratifying by sample collection. For the fixed-effect model, the Mantel-Haenszel approach was used. For the random-effect model, an inverse variance meta-analysis approach was followed. Heterogeneity of effect sizes was evaluated with the inconsistency parameter I2 derived from the Cochran Q statistic. A high, moderate and low level of inconsistency was attributed to levels of I2 over 75%, 50% and 25%, respectively, as described previously [29].

Distributions of ORs in Central European and Southern European populations were compared with the binomial distribution. Geometric mean (Gmean) values of the ORs were obtained and compared. The sum of SLE risk alleles was obtained for the 20 SNPs showing independent association with SLE. The sum of genetic risk scores (GRSs) was also calculated for the same 20 SNPs. The total GRS for each patient with SLE was the sum of the products of the natural logarithm of the OR by the number of risk alleles at each locus that was carried by this patient. The ORs used to calculate GRS were the specific Mantel-Haenszel ORs for the corresponding European subgroup. The Gmean OR, mean of sum of risk alleles and mean cumulative GRS values were compared between groups with Student t tests. Correlation between the N/S score and sum of risk alleles or the mean of the natural logarithm of the OR was analysed with the weighted Pearson correlation coefficient. The threshold for significance was set at P ≤ 0.05.


Analysis of population differentiation

Our study included samples from 1,742 patients with SLE and 2,101 healthy controls recruited in 17 centres (Table 1). Recruiters at each centre asked the patients and the controls for their ancestry, and only those reporting uniform known ancestry from the respective country were included. In addition, we checked with six top AIMs informative for European population differentiation whether there were differences between cases and controls from each recruitment centre. Five of the six AIMs provided completely independent information (pairwise r2 between them < 0.03) - only rs6730157 and rs4988235 were redundant (r2 = 0.85). This analysis showed significant differences in the samples from two collections, the Czech Republic and Belgium. Consequently, these two collections of samples were discarded from all subsequent analyses.

Next, we divided the remaining 15 collections following the major axis of European population differentiation, the North-South axis. According to previous studies we expected two groups [2022]: one with all samples from Central Europe, with samples from the Netherlands, Germany, Hungary and Slovakia (383 patients with SLE and 463 healthy controls); and a second with all samples from Portugal, Spain, Italy and Greece (1,111 patients and 1,432 controls). The AIM frequencies in samples from each collection were congruent with this division (Table 1). We obtained the N/S score (for the place of each collection on the North-South axis), which was as expected from geographical distribution and previous studies [2022] and was in agreement with the division we made (Table 1).

Nine SLE susceptibility loci with a Southern bias

We have already replicated association of top SNPs in nine SLE susceptibility loci in the samples included in this study (Table 2) [24]. When these SNPs were analysed separately in the two subgroups, Central European and Southern European, we found a bias for stronger association in the latter (Figure 1A). This bias was observed in eight of the nine SNPs. This distribution is significantly different from that expected by chance (P = 0.039), and was observed with the Mantel-Haenszel OR (Figure 1A) and with the random-effect meta-analysis OR (see Figure S1 in Additional file 1). Mantel-Haenszel analysis was preferred because none of the 18 analyses showed high heterogeneity and only three showed a moderate level of inconsistency. It should be noted that none of the SNPs taken individually was significantly different between the two groups.

Table 2 Association results of the 25 systemic lupus erythematosus SNPs
Figure 1
figure 1

Bias for a stronger association in Southern Europeans than in Central Europeans. (A) Nine systemic lupus erythematosus (SLE) loci from Suarez-Gestal and colleagues [24]. (B) Eleven newly studied SLE-associated SNPs. Mantel-Haenszel odds ratio (OR) for the risk allele and their 95% confidence interval presented in descending order from left to right. *Loci showed significant differences between the two groups. MECP2 data only from women because it maps to the X chromosome. SNPs used for each locus are detailed in Table 2.

Association analysis of additional SLE loci

We wanted to assess whether the bias found was a general phenomenon of SLE loci. We therefore selected 16 SNPs identifying other SLE genetic loci in Europeans [2, 3, 7, 27, 28]. All of the SNPs were genotyped successfully and all were in Hardy-Weinberg equilibrium when analysed by collection (the two IRF5 SNPs have already been studied in a fraction of the samples [25]). The combined data showed significant differences between SLE cases and controls for 12 SNPs (Table 2). All of the significant differences were in the same direction as originally reported. Only four SNPs were similar in cases and controls, so they were excluded from further analysis. To avoid redundancy, we checked with conditional logistic regression whether the two associated SNPs in IRF5, TNFAIP3, TNFSF4 or the two SNPs in the HLA (rs2187668 and rs3131379) contributed independently to the association. One of the TNFSF4 SNPs (rs844644) showed no association when conditioned in the other TNFSF4 SNP (P = 0.072), and therefore was no longer considered. On the contrary, the two IRF5, the two TNFAIP3 and the two HLA SNPs remained associated and were included in the following analyses.

Stratification of the SLE patients and controls in Central Europeans and Southern Europeans showed that two of the 11 SNPs (rs3131379 in MSH5, P = 0.003; and rs2187668 in HLA-DQA1, P = 0.046) were significantly more associated in the Southern subgroup than in the Central European subgroup in the Mantel-Haenszel meta-analysis (Figure 1B); but none was significantly different in the random-effects meta-analysis (see Figure S2 in Additional file 1). The Mantel-Haenszel meta-analysis was favoured because none of the 22 analyses showed high inconsistency and only five showed a moderate level. In total, seven of the 11 associated SNPs were numerically more associated in the Southern European subgroup (Figure 1B). Only three SNPs were more associated in the Central Europeans (rs573775 in ATG5, rs729302 in IRF5, and rs5754217 in UBE2L3) and one was equally associated in the two subgroups (rs2205960 in TNFSF4), but none of these differences were significant.

Southern bias for all of the SLE-associated SNPs together

When the 20 SNPs (nine from the first phase and 11 from the second) that have shown independent association in our samples were considered together, a bias towards a stronger association in the Southern subgroup was observed both as a significant deviation of the OR from a random binomial distribution (P = 0.019) and as a significant difference between the OR means, which was larger in the Southern samples (Gmean = 1.46 ± 1.30) than in the Central European samples (Gmean = 1.34 ± 1.17, P = 0.02). Because it was possible that a fraction of the effect size attributed to a SNP is dependent on other SLE-associated SNPs, we also compared the mean of the OR for each SNP conditional on all the other SNPs. This analysis also showed a larger effect size in the Southern subjects (Gmean = 1.43 ± 1.24) than in the Central Europeans (Gmean = 1.28 ± 1.18, P = 0.02).

We also analysed our data in a different way by counting the SLE risk alleles carried by each subject. Although it was possible to have from zero to 40 risk alleles, none of the subjects had less than five or more than 24 risk alleles. The distribution of frequencies stratified by disease status and by Central versus Southern subpopulations showed a gradient of values (Figure 2). The lowest number of risk alleles was observed in the healthy controls from the Southern European group (mean ± standard deviation = 12.0 ± 2.5). Immediately higher was the number of risk alleles corresponding to the Central European controls (12.8 ± 2.7, P = 1.1 × 10-8 vs. the Southern European controls). This group was followed for the Southern European SLE patients (14.0 ± 2.5, P = 2.0 × 10-17 vs. the Central European controls) and, finally, for the Central European patients (14.4 ± 2.8, P = 0.016 vs. the Southern SLE patients).

Figure 2
figure 2

Systemic lupus erythematosus risk alleles in Central and Southern European patients and controls. Distribution of the number of systemic lupus erythematosus (SLE) risk alleles in Central European and Southern European SLE patients and controls. y axis, percentage of subjects in each of the four strata with the corresponding total number of SLE risk alleles. Distance-weighted least-squares fitting lines are presented. White bars and dotted line, Southern European controls; light grey bars and short-slashed line, Central European controls; medium grey bars and long-slashed line, Southern European SLE patients; dark grey bars and continuous line, Central European SLE patients.

The differences in number of risk alleles were not due to confounding by deviations from Hardy-Weinberg equilibrium in any of the four sample groups for any of the SNPs or by differences in call rate between the four groups (data not shown). We also checked that the differences and order hold when only women were analysed, with the lowest number in Southern controls (12.0 ± 2.4) followed by Central controls (13.0 ± 2.6), Southern SLE patients (14.1 ± 2.5) and Central European patients (14.4 ± 2.8) - showing significant differences between each of these groups except the last two (P = 5.3 × 10-9, P = 2.1 × 10-8 and P = 0.056, respectively). The same sequence was observed when the comparison was made in men (Southern controls 12.0 ± 2.7 and Central controls 12.7 ± 3.0, P = 0.01; Southern SLE patients 13.5 ± 2.5, P = 0.036; Central SLE patients 14.7 ± 2.8, P = 0.027). In this case, the differences between groups were all significant in spite of the small size of the SLE patient groups.

These results showed from a different perspective the same bias in effect sizes that has been described in the previous paragraphs, because the difference in number of risk alleles between SLE patients and controls of Southern origin (difference = 2.06, 95% confidence interval = 1.85 to 2.26) was significantly larger than for patients and controls of Central European origin (difference = 1.63, 95% confidence interval = 1.25 to 2.01; P = 0.027). In addition, these results showed that the SLE risk alleles were less frequent in the Southern European subjects overall, but the difference was significantly more marked between controls from the Southern and Central subgroups (difference = 0.81, 95% confidence interval = 0.53 to 1.09) than between SLE patients from the same subgroups (difference = 0.38, 95% confidence interval = 0.07 to 0.69; P = 0.023).

We also compared the cumulative GRS between Southern and Central European SLE patients. This parameter includes information from the sum of risk alleles and from the OR, and therefore is not independent of previous comparisons (Figure 3). The mean sum GRS was significantly larger in Southern European patients than in Central European patients (4.31 ± 1.17 vs. 3.48 ± 0.93, P = 1.8 × 10-32). The difference persisted after excluding the two HLA SLE-associated SNPs from the analysis (3.74 ± 0.84 vs. 2.92 ± 0.68, P = 2.4 × 10-58).

Figure 3
figure 3

Sum of genotypic risk score in Central and Southern European patients with systemic lupus erythematosus. y axis, percentage of subjects in each of the two groups with sum genetic risk score (GRS) over 20 SNPs in the indicated intervals. Distance-weighted least-squares fitting lines are presented. Light grey bars and long-slashed line, Southern European systemic lupus erythematosus (SLE) patients; dark grey bars and continuous line, Central European SLE patients.

Finally, we wanted to analyse the mean number of SLE risk alleles for each sample collection as a function of its position along the North-South axis of population differentiation (Figure 4). This analysis showed that the number of SLE risk alleles in controls and in patients with SLE increased with the N/S score (R2 = 0.67, P = 0.002 for controls; and R2 = 0.45, P = 0.002 for patients). Similarly, the mean of the natural logarithm of OR correlated with the N/S score (R2 = 0.40, P = 0.012). The gradual change in function of the score indicates that the previously performed analyses were not sensitive to the point used to separate Central European from Southern European populations.

Figure 4
figure 4

Correlation between North-South population differentiation axis and number of systemic lupus erythematosus risk alleles. Mean number of SLE risk alleles per sample collection is represented against the North/South score (N/S score) obtained in the same samples. Collection letter codes are as in Table 1.

Frequency clines of the SLE risk alleles

The previous results suggest the possibility of frequency gradients or clines of SLE risk alleles along the North-South axis of population differentiation. We therefore analysed this possibility for each of the SNPs. For this analysis we only used data from controls because they are more representative of the general population. None of the SLE-associated SNPs showed significant linkage disequilibrium with the AIMs included in this study (all pairwise r2 between AIMs and SLE-associated SNPs < 0.05). However, 10 SNPs were significantly different between Central Europeans and Southern Europeans (Table 3). Five of these showed differences in excess of 5%, including the two SNPs in the HLA region, PTPN22 (which is already known to show a cline in Europe [30]), BLK and PXK. Eight of the 10 SNPs were more common in controls from the Central group than from the Southern group in accordance with the direction of change observed with the sum of all risk alleles. The two exceptions were the ITGAM and FCGR2A SNPs. These two SNPs, however, have shown the same bias in effect sizes as the other eight. We also checked how the SNP frequencies in controls fitted a linear regression as a function of the N/S score. Results were similar to the obtained with the Central versus Southern group comparisons, except for SNPs with modest differences and for the SNP in BLK (Table 3).

Table 3 Risk allele frequencies in Central and Southern European controls and relationship with the North/South score


Two main aspects of our results should be highlighted: a significant bias to larger effect sizes of the SLE susceptibility loci in subjects from Southern Europe than those from Central Europe, and a lower frequency of the SLE risk alleles at these loci in subjects from Southern Europe.

The bias to stronger association among Southern Europeans was shown with four types of analysis. The first, comparing the number of SLE loci showing a trend to stronger associations beyond the expected at random, is independent of the loci characteristics. In contrast, comparison of the mean OR reflects the magnitude of the differences in effect size. An advantage of these two analyses is that they were done with meta-analysis approaches to account for sample collection factors. Their limitation is that they convey little information about the nature of the differences. The next analysis, comparison of the number of SLE risk alleles carried by each subject, is more informative but does not account for sample collection effects. The last analysis, comparison of sum GRS, combines information from the two previous analyses and is therefore not independent. Concordance of results from the different analyses is reassuring.

Further confidence was gained from the gradual change of the number of SLE risk alleles per sample collection as a function of the N/S score, implying insensitivity of the results to the specific partition of Europeans we have used. It is also important to note that the studies which identified SLE risk loci were carried out with subjects of a dominant Northern European ancestry (full references in Table S1 in Additional file 1), and therefore the stronger association we have found in Southern Europeans cannot be attributed to ascertainment bias. In other words, any bias due to the discovery of SLE loci in Northern-Central Europeans will favour a stronger effect size in that subpopulation, which is the opposite of our findings. This makes it very unlikely that our results are due to a tighter linkage disequilibrium between causal SNPs and the studied SNPs in the Southern subjects. All of these considerations support the validity of the our findings. One should note that it was an average effect, however, because not all of the SLE risk alleles showed a trend to stronger effect sizes in the Southern samples and, in most of those that showed the trend, differences were small and not significant when assessed individually.

The second aspect of our results is the differential distribution of SLE risk alleles, with a lower frequency in Southern Europeans than in Central Europeans. The difference was observed both in SLE patients and in controls, but was more marked in the latter. It was maximal between the subjects from Greece and Italy, on one side, and those for the Netherlands on the other. All of the remaining groups were in between. Seven of the SNPs showed a frequency cline correlating with the N/S score. Only one of these clines has already been described for the risk allele of PTPN22 [30].

The existence of allele frequency clines in the European population and their main axis of differentiation are well established [2023]. What is surprising about our results is that the number of SLE risk alleles followed a gradient along this axis instead of varying randomly, with some alleles more common towards the North and others towards the South. This observation suggests the possible effect of selective forces acting along the history of the European populations, probably through resistance to infections [31, 32]. These ideas demand new studies aiming to explore the relationship of autoimmunity with infection vulnerability.

A notable facet of the cline of SLE risk alleles in our study was that the difference between Southern Europeans and Central Europeans is less marked in SLE patients than in controls. We propose that this is due to the genetic structure of SLE that makes patients more similar at SLE loci across European subpopulations than the average member of the same subpopulations. This hypothesis is derived from the polygenic model of genetic inheritance [33], which includes the concept of liability threshold: disease appears when the contribution of multiple genetic factors to disease liability overcomes a threshold. Diseased subjects are therefore, on average, more similar for the genes involved in the disease than are the controls. More complex scenarios with involvement of differential environmental factors are also possible, however, and the risk allele of a particular locus could follow a different pattern because of specific factors besides the generic effect proposed here.

The validity of our results will be reinforced by replication in other sample collections. This will be particularly important in relation to the Central European subgroup - this was the smallest in our study, causing lower precision in the OR values and lower power to detect differential effects. Data from additional populations would also be interesting, in particular from the more extreme European subpopulations, because coverage of the full European spectrum would allow detecting additional clines. In addition, the AIMs we used were sufficient for group-level analyses but not for classification of individual subjects, which could have made our analyses more powerful. The studied AIMs were able to show the European population substructure along the North-South differentiation, however, as in the studies where these AIMs were selected [22, 23].

Our study also provides independent replication of SLE loci. Most of these loci were already strongly established and do not require comment (full references in Table S1 in Additional file 1). In contrast, four SNPs with previous solid association were not replicated in our analysis. Probably the most solid of them is rs10156091 in ICA1. This SNP was first associated with SLE in a large genome-wide association scan (GWAS; P = 1.9 × 10-7, OR = 1.32) [3]. This result was confirmed in an even larger replication study, but with a lower effect size (OR = 1.16, P = 6.5 × 10-4) [7]. The allelic frequency of this SNP in controls (10.0%) implies that our study had 97% power to detect the originally described effect with P < 0.05, but only 51% power for an OR like that observed in the replication study.

Also very solid is the record of rs2667978 in LYN. This was discovered in the same GWAS (P = 5.1 × 10-8, OR = 0.81) [3] and confirmed in a large replication study with European descent samples, but more weakly (P = 0.016) [34]. However, this association was not confirmed in the already mentioned large replication study where a different SNP was used, rs7829816 [7]. Our study was powered to detect an effect as that originally described with P < 0.01. The rs4240671 SNP in XKR6 is also backed by strong evidence. It was identified in the same SLE GWAS as ICA1 and LYN with very strong evidence including five SNPs with P < 5.0 × 10-8 [3]. The association was not uniformly observed across sample collections in this study, however, and no replication in any large study has been reported. The only independent replication was obtained for one of the SNPs in a 245-family study in Canadians (rs6985109; P = 0.008) [35]. Our study had enough power to detect the originally reported association (OR = 0.75) with P < 10-7 given its elevated allele frequency (49%). Finally, the weakest previous support was for rs6922466 in PERP. This SNP was associated with SLE (P = 1.0 × 10-4) only in a large study [28].


Our study has uncovered a bias for stronger effect sizes of SLE risk alleles in Southern Europeans than in Central Europeans. This bias was accompanied by a lower frequency of the risk alleles in the Southern European group. Difference in frequencies was more marked in controls than in patients with SLE. These results should be taken into account for genetic studies of SLE and for understanding the genetic structure of the disease and the possible presence of autoimmune disease risk allele clines and their causes. In addition, these results call for exploration of the assumptions and implications of the liability threshold concept - in particular, whether a constant threshold is consistent with GWAS data - as well as for exploration of environmental or other factors that could explain the effect size bias. Our findings therefore contribute to define the genetic epidemiology of SLE and suggest new lines of research for understanding the deep genetic structure of SLE.



ancestry informative marker


confidence interval

G mean :

geometric mean


genetic risk score


genome-wide association scan

N/S score:

North/South score


odds ratio


polymerase chain reaction


systemic lupus erythematosus


single nucleotide polymorphism.


  1. Yang W, Shen N, Ye DQ, Liu Q, Zhang Y, Qian XX, Hirankarn N, Ying D, Pan HF, Mok CC, Chan TM, Wong RW, Lee KW, Mok MY, Wong SN, Leung AM, Li XP, Avihingsanon Y, Wong CM, Lee TL, Ho MH, Lee PP, Chang YK, Li PH, Li RJ, Zhang L, Wong WH, Ng IO, Lau CS, Sham PC, et al: Genome-wide association study in Asian populations identifies variants in ETS1 and WDFY4 associated with systemic lupus erythematosus. PLoS Genet. 2010, 6: e1000841-

    Article  Google Scholar 

  2. Hom G, Graham RR, Modrek B, Taylor KE, Ortmann W, Garnier S, Lee AT, Chung SA, Ferreira RC, Pant PV, Ballinger DG, Kosoy R, Demirci FY, Kamboh MI, Kao AH, Tian C, Gunnarsson I, Bengtsson AA, Rantapaa-Dahlqvist S, Petri M, Manzi S, Seldin MF, Ronnblom L, Syvanen AC, Criswell LA, Gregersen PK, Behrens TW: Association of systemic lupus erythematosus with C8orf13-BLK and ITGAM-ITGAX. N Engl J Med. 2008, 358: 900-909.

    Article  CAS  Google Scholar 

  3. Harley JB, Alarcon-Riquelme ME, Criswell LA, Jacob CO, Kimberly RP, Moser KL, Tsao BP, Vyse TJ, Langefeld CD, Nath SK, Guthridge JM, Cobb BL, Mirel DB, Marion MC, Williams AH, Divers J, Wang W, Frank SG, Namjou B, Gabriel SB, Lee AT, Gregersen PK, Behrens TW, Taylor KE, Fernando M, Zidovetzki R, Gaffney PM, Edberg JC, Rioux JD, Ojwang JO, et al: Genome-wide association scan in women with systemic lupus erythematosus identifies susceptibility variants in ITGAM, PXK, KIAA1542 and other loci. Nat Genet. 2008, 40: 204-210.

    Article  CAS  Google Scholar 

  4. Kozyrev SV, Abelson AK, Wojcik J, Zaghlool A, Linga Reddy MV, Sanchez E, Gunnarsson I, Svenungsson E, Sturfelt G, Jonsen A, Truedsson L, Pons-Estel BA, Witte T, D'Alfonso S, Barizzone N, Danieli MG, Gutierrez C, Suarez A, Junker P, Laustrup H, Gonzalez-Escribano MF, Martin J, Abderrahim H, Alarcon-Riquelme ME: Functional variants in the B-cell gene BANK1 are associated with systemic lupus erythematosus. Nat Genet. 2008, 40: 211-216.

    Article  CAS  Google Scholar 

  5. Han JW, Zheng HF, Cui Y, Sun LD, Ye DQ, Hu Z, Xu JH, Cai ZM, Huang W, Zhao GP, Xie HF, Fang H, Lu QJ, Li XP, Pan YF, Deng DQ, Zeng FQ, Ye ZZ, Zhang XY, Wang QW, Hao F, Ma L, Zuo XB, Zhou FS, Du WH, Cheng YL, Yang JQ, Shen SK, Li J, Sheng YJ, et al: Genome-wide association study in a Chinese Han population identifies nine new susceptibility loci for systemic lupus erythematosus. Nat Genet. 2009, 41: 1234-1237.

    Article  CAS  Google Scholar 

  6. Graham RR, Cotsapas C, Davies L, Hackett R, Lessard CJ, Leon JM, Burtt NP, Guiducci C, Parkin M, Gates C, Plenge RM, Behrens TW, Wither JE, Rioux JD, Fortin PR, Graham DC, Wong AK, Vyse TJ, Daly MJ, Altshuler D, Moser KL, Gaffney PM: Genetic variants near TNFAIP3 on 6q23 are associated with systemic lupus erythematosus. Nat Genet. 2008, 40: 1059-1061.

    Article  CAS  Google Scholar 

  7. Gateva V, Sandling JK, Hom G, Taylor KE, Chung SA, Sun X, Ortmann W, Kosoy R, Ferreira RC, Nordmark G, Gunnarsson I, Svenungsson E, Padyukov L, Sturfelt G, Jonsen A, Bengtsson AA, Rantapaa-Dahlqvist S, Baechler EC, Brown EE, Alarcon GS, Edberg JC, Ramsey-Goldman R, McGwin G, Reveille JD, Vila LM, Kimberly RP, Manzi S, Petri MA, Lee A, Gregersen PK, et al: A large-scale replication study identifies TNIP1, PRDM1, JAZF1, UHRF1BP1 and IL10 as risk loci for systemic lupus erythematosus. Nat Genet. 2009, 41: 1228-1233.

    Article  CAS  Google Scholar 

  8. Yang J, Yang W, Hirankarn N, Ye DQ, Zhang Y, Pan HF, Mok CC, Chan TM, Wong RW, Mok MY, Lee KW, Wong SN, Leung AM, Li XP, Avihingsanon Y, Rianthavorn P, Deekajorndej T, Suphapeetiporn K, Shotelersuk V, Baum L, Kwan P, Lee TL, Ho MH, Lee PP, Wong WH, Zeng S, Zhang J, Wong CM, Ng IO, Garcia-Barcelo MM, et al: ELF1 is associated with systemic lupus erythematosus in Asian populations. Hum Mol Genet. 2011, 20: 601-607.

    Article  CAS  Google Scholar 

  9. Zhao J, Wu H, Khosravi M, Cui H, Qian X, Kelly JA, Kaufman KM, Langefeld CD, Williams AH, Comeau ME, Ziegler JT, Marion MC, Adler A, Glenn SB, Alarcon-Riquelme ME, Pons-Estel BA, Harley JB, Bae SC, Bang SY, Cho SK, Jacob CO, Vyse TJ, Niewold TB, Gaffney PM, Moser KL, Kimberly RP, Edberg JC, Brown EE, Alarcon GS, Petri MA, et al: Association of genetic variants in complement factor H and factor H-related genes with systemic lupus erythematosus susceptibility. PLoS Genet. 2011, 7: e1002079-

    Article  CAS  Google Scholar 

  10. Sanchez E, Comeau ME, Freedman BI, Kelly JA, Kaufman KM, Langefeld CD, Brown EE, Alarcon GS, Kimberly RP, Edberg JC, Ramsey-Goldman R, Petri M, Reveille JD, Vila LM, Merrill JT, Tsao BP, Kamen DL, Gilkeson GS, James JA, Vyse TJ, Gaffney PM, Jacob CO, Niewold TB, Richardson BC, Harley JB, Alarcon-Riquelme ME, Sawalha AH: Identification of novel genetic susceptibility loci in African American lupus patients in a candidate gene association study. Arthritis Rheum. 2011, 63: 3493-3501.

    Article  CAS  Google Scholar 

  11. Lessard CJ, Adrianto I, Kelly JA, Kaufman KM, Grundahl KM, Adler A, Williams AH, Gallant CJ, Anaya JM, Bae SC, Boackle SA, Brown EE, Chang DM, Criswell LA, Edberg JC, Freedman BI, Gregersen PK, Gilkeson GS, Jacob CO, James JA, Kamen DL, Kimberly RP, Martin J, Merrill JT, Niewold TB, Park SY, Petri MA, Pons-Estel BA, Ramsey-Goldman R, Reveille JD, et al: Identification of a systemic lupus erythematosus susceptibility locus at 11p13 between PDHX and CD44 in a multiethnic study. Am J Hum Genet. 2011, 88: 83-91.

    Article  CAS  Google Scholar 

  12. Han S, Kim-Howard X, Deshmukh H, Kamatani Y, Viswanathan P, Guthridge JM, Thomas K, Kaufman KM, Ojwang J, Rojas-Villarraga A, Baca V, Orozco L, Rhodes B, Choi CB, Gregersen PK, Merrill JT, James JA, Gaffney PM, Moser KL, Jacob CO, Kimberly RP, Harley JB, Bae SC, Anaya JM, Alarcon-Riquelme ME, Matsuda K, Vyse TJ, Nath SK: Evaluation of imputation-based association in and around the integrin-alpha-M (ITGAM) gene and replication of robust association between a non-synonymous functional variant within ITGAM and systemic lupus erythematosus (SLE). Hum Mol Genet. 2009, 18: 1171-1180.

    Article  CAS  Google Scholar 

  13. Fu J, Festen EA, Wijmenga C: Multi-ethnic studies in complex traits. Hum Mol Genet. 2011, 20: R206-R213.

    Article  CAS  Google Scholar 

  14. Sanchez E, Webb RD, Rasmussen A, Kelly JA, Riba L, Kaufman KM, Garcia-de la Torre I, Moctezuma JF, Maradiaga-Cecena MA, Cardiel-Rios MH, Acevedo E, Cucho-Venegas M, Garcia MA, Gamron S, Pons-Estel BA, Vasconcelos C, Martin J, Tusie-Luna T, Harley JB, Richardson B, Sawalha AH, Alarcon-Riquelme ME: Genetically determined Amerindian ancestry correlates with increased frequency of risk alleles for systemic lupus erythematosus. Arthritis Rheum. 2010, 62: 3722-3729.

    Article  Google Scholar 

  15. Chung SA, Tian C, Taylor KE, Lee AT, Ortmann WA, Hom G, Graham RR, Nititham J, Kelly JA, Morrisey J, Wu H, Yin H, Alarcon-Riquelme ME, Tsao BP, Harley JB, Gaffney PM, Moser KL, Manzi S, Petri M, Gregersen PK, Langefeld CD, Behrens TW, Seldin MF, Criswell LA: European population substructure is associated with mucocutaneous manifestations and autoantibody production in systemic lupus erythematosus. Arthritis Rheum. 2009, 60: 2448-2456.

    Article  Google Scholar 

  16. Richman IB, Chung SA, Taylor KE, Kosoy R, Tian C, Ortmann WA, Nititham J, Lee AT, Rutman S, Petri M, Manzi S, Behrens TW, Gregersen PK, Seldin MF, Criswell LA: European population substructure correlates with systemic lupus erythematosus endophenotypes in North Americans of European descent. Genes Immun. 2010, 11: 515-521.

    Article  CAS  Google Scholar 

  17. Alonso-Perez E, Suarez-Gestal M, Calaza M, Witte T, Papasteriades C, Marchini M, Migliaresi S, Kovacs A, Ordi-Ros J, Bijl M, Santos MJ, Ruzickova S, Pullmann R, Carreira P, Skopouli FN, D'Alfonso S, Sebastiani GD, Suarez A, Blanco FJ, Gomez-Reino JJ, Gonzalez A: Association of systemic lupus erythematosus clinical features with European population genetic substructure. PLoS One. 2011, 6: e29033-

    Article  CAS  Google Scholar 

  18. Ferreiros-Vidal I, Gomez-Reino JJ, Barros F, Carracedo A, Carreira P, Gonzalez-Escribano F, Liz M, Martin J, Ordi J, Vicario JL, Gonzalez A: Association of PDCD1 with susceptibility to systemic lupus erythematosus: evidence of population-specific effects. Arthritis Rheum. 2004, 50: 2590-2597.

    Article  CAS  Google Scholar 

  19. Ferreiros-Vidal I, D'Alfonso S, Papasteriades C, Skopouli FN, Marchini M, Scorza R, Migliaresi S, Sebastiani GD, Endreffy E, Mavromati M, Kappou-Rigatou I, Ruzickova S, Dostal C, Schmidt RE, Witte T, Gomez-Reino JJ, Gonzalez A: Bias in association studies of systemic lupus erythematosus susceptibility due to geographical variation in the frequency of a programmed cell death 1 polymorphism across Europe. Genes Immun. 2007, 8: 138-146.

    Article  CAS  Google Scholar 

  20. Seldin MF, Shigeta R, Villoslada P, Selmi C, Tuomilehto J, Silva G, Belmont JW, Klareskog L, Gregersen PK: European population substructure: clustering of northern and southern populations. PLoS Genet. 2006, 2: e143-

    Article  Google Scholar 

  21. Tian C, Kosoy R, Nassir R, Lee A, Villoslada P, Klareskog L, Hammarstrom L, Garchon HJ, Pulver AE, Ransom M, Gregersen PK, Seldin MF: European population genetic substructure: further definition of ancestry informative markers for distinguishing among diverse European ethnic groups. Mol Med. 2009, 15: 371-383.

    Article  CAS  Google Scholar 

  22. Tian C, Plenge RM, Ransom M, Lee A, Villoslada P, Selmi C, Klareskog L, Pulver AE, Qi L, Gregersen PK, Seldin MF: Analysis and application of European genetic substructure using 300 K SNP information. PLoS Genet. 2008, 4: e4-

    Article  Google Scholar 

  23. Han J, Kraft P, Nan H, Guo Q, Chen C, Qureshi A, Hankinson SE, Hu FB, Duffy DL, Zhao ZZ, Martin NG, Montgomery GW, Hayward NK, Thomas G, Hoover RN, Chanock S, Hunter DJ: A genome-wide association study identifies novel alleles associated with hair color and skin pigmentation. PLoS Genet. 2008, 4: e1000074-

    Article  Google Scholar 

  24. Suarez-Gestal M, Calaza M, Endreffy E, Pullmann R, Ordi-Ros J, Domenico Sebastiani G, Ruzickova S, Jose Santos M, Papasteriades C, Marchini M, Skopouli FN, Suarez A, Blanco FJ, D'Alfonso S, Bijl M, Carreira P, Witte T, Migliaresi S, Gomez-Reino JJ, Gonzalez A: Replication of recently identified systemic lupus erythematosus genetic associations: a case-control study. Arthritis Res Ther. 2009, 11: R69-

    Article  Google Scholar 

  25. Ferreiro-Neira I, Calaza M, Alonso-Perez E, Marchini M, Scorza R, Sebastiani GD, Blanco FJ, Rego I, Pullmann R, Pullmann R, Kallenberg CG, Bijl M, Skopouli FN, Mavromati M, Migliaresi S, Barizzone N, Ruzickova S, Dostal C, Schmidt RE, Witte T, Papasteriades C, Kappou-Rigatou I, Endreffy E, Kovacs A, Ordi-Ros J, Balada E, Carreira P, Gomez-Reino JJ, Gonzalez A: Opposed independent effects and epistasis in the complex association of IRF5 to SLE. Genes Immun. 2007, 8: 429-438.

    Article  CAS  Google Scholar 

  26. Hochberg MC: Updating the American College of Rheumatology revised criteria for the classification of systemic lupus erythematosus. Arthritis Rheum. 1997, 40: 1725-

    Article  CAS  Google Scholar 

  27. Cunninghame Graham DS, Graham RR, Manku H, Wong AK, Whittaker JC, Gaffney PM, Moser KL, Rioux JD, Altshuler D, Behrens TW, Vyse TJ: Polymorphism at the TNF superfamily gene TNFSF4 confers susceptibility to systemic lupus erythematosus. Nat Genet. 2008, 40: 83-89.

    Article  CAS  Google Scholar 

  28. Musone SL, Taylor KE, Lu TT, Nititham J, Ferreira RC, Ortmann W, Shifrin N, Petri MA, Kamboh MI, Manzi S, Seldin MF, Gregersen PK, Behrens TW, Ma A, Kwok PY, Criswell LA: Multiple polymorphisms in the TNFAIP3 region are independently associated with systemic lupus erythematosus. Nat Genet. 2008, 40: 1062-1064.

    Article  CAS  Google Scholar 

  29. Higgins JP, Thompson SG, Deeks JJ, Altman DG: Measuring inconsistency in meta-analyses. BMJ. 2003, 327: 557-560.

    Article  Google Scholar 

  30. Gregersen PK, Lee HS, Batliwalla F, Begovich AB: PTPN22: setting thresholds for autoimmunity. Semin Immunol. 2006, 18: 214-223.

    Article  CAS  Google Scholar 

  31. Fumagalli M, Sironi M, Pozzoli U, Ferrer-Admettla A, Pattini L, Nielsen R: Signatures of environmental genetic adaptation pinpoint pathogens as the main selective pressure through human evolution. PLoS Genet. 2011, 7: e1002355-

    Article  CAS  Google Scholar 

  32. Barreiro LB, Quintana-Murci L: From evolutionary genetics to human immunology: how selection shapes host defence genes. Nat Rev Genet. 2010, 11: 17-30.

    Article  CAS  Google Scholar 

  33. Cummings M: Human Heredity: Principles and Issues. 2010, Belmot: Brooks/Cole

    Google Scholar 

  34. Lu R, Vidal GS, Kelly JA, Delgado-Vega AM, Howard XK, Macwana SR, Dominguez N, Klein W, Burrell C, Harley IT, Kaufman KM, Bruner GR, Moser KL, Gaffney PM, Gilkeson GS, Wakeland EK, Li QZ, Langefeld CD, Marion MC, Divers J, Alarcon GS, Brown EE, Kimberly RP, Edberg JC, Ramsey-Goldman R, Reveille JD, McGwin G, Vila LM, Petri MA, Bae SC, et al: Genetic associations of LYN with systemic lupus erythematosus. Genes Immun. 2009, 10: 397-403.

    Article  CAS  Google Scholar 

  35. Budarf ML, Goyette P, Boucher G, Lian J, Graham RR, Claudio JO, Hudson T, Gladman D, Clarke AE, Pope JE, Peschken C, Smith CD, Hanly J, Rich E, Boire G, Barr SG, Zummer M, Fortin PR, Wither J, Rioux JD: A targeted association study in systemic lupus erythematosus identifies multiple susceptibility alleles. Genes Immun. 2011, 12: 51-58.

    Article  CAS  Google Scholar 

Download references


EA-P is the recipient of a predoctoral bursary from the Instituto de Salud Carlos III (Spain). This work was supported by Fondo de Investigacion Sanitaria of the Instituto de Salud Carlos III (Spain) grant 08/0744 and by RETICS Program RD08/0075 that are partially financed by the European Regional Development Fund of the European Union, and by grants from the Xunta de Galicia and BMBF KN Rheuma (C2.12 to TW).

Other contributors to the European Consortium of SLE DNA Collections: Myriam Liz, Laboratorio de Investigacion 10 and Rheumatology Unit, Instituto de Investigacion Sanitaria - Hospital Clinico Universitario de Santiago, Santiago de Compostela, Spain; Reinhold E Schmidt, Division of Clinical Immunology, Department of Internal Medicine of the Hannover Medical School, Hannover, Germany; Iris Kappou-Rigatou, Department of Histocompatibility and Immunology, Evangelismos Hospital, Athens, Greece; Raffaella Scorza, Clinical Immunology, University of Milan and Fondazione IRCCS Ospedale Maggiore Policlinico, Mangiagalli e Regina Elena, Milan, Italy; Emöke Endereffy, Department of Paediatrics, Albert Szent-Györgyi Clinical Centre, University of Szeged, Szeged, Hungary; Eva Balada, Internal Medicine, Research Laboratory in Autoimmune Diseases Hospital Vall d'Hebron, Barcelona, Spain; Cees G Kallenberg, Department of Rheumatology and Clinical Immunology, University Medical Center Groningen, Groningen, the Netherlands; Filipe Vinagre, Rheumatology Department, Hospital Garcia de Orta, Almada (Portugal) and Rheumatology Research Unit, Instituto Medicina Molecular, Facultade de Medicina da Universidade de Lisboa, Portugal; Ctibor Dostal, Molecular Biology and Immunogenetics Department, Institute of Rheumatology. Prague, Czech Republic; Rudolf Pullmann Jr, Gerontology Research Center, National Institute on Aging, Nathan Shock Drive, Baltimore, Maryland, USA; Maria Mavromati, Pathophysiology Department, Athens University Medical School, Athens, Greece; Sandra D'Alfonso, Department of Medical Sciences and IRCAD, Eastern Piedmont University, Novara, Italy; Carmen Gutierrez, Department of Functional Biology, Hospital Universitario Central de Asturias, Universidad de Oviedo, Oviedo, Spain; and Ignacio Rego, Laboratorio de Investigación Osteoarticular y del Envejecimiento, Servicio de Reumatología, CH Universitario A Coruña, A Coruña, Spain.

Author information

Authors and Affiliations



Corresponding author

Correspondence to Antonio Gonzalez.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

EA-P and MS-G participated in the design of the study, genotyped the samples, and participated in the interpretation of the results and writing the manuscript. MC participated in the statistical analysis and in the interpretation of results. GDS, RP, CP, AK, FNS, MB, AS, MM, SM, PC, JO-R, TW, SR, MJS, NB, FJB, BRL and JJG-R participated in the acquisition of clinical data and collection of samples and in the analysis and interpretation of results. AG participated in the design of the study and in coordination of the acquisition of clinical data and collection of samples, and supervised laboratory work and interpretation and writing of the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Table S1 presenting a detailed description of all the SNPs included in the study, and Figures S1 and S2 with random effect meta-analysis results corresponding to the same data as Figure 1 in the manuscript. (DOC 143 KB)

Authors’ original submitted files for images

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alonso-Perez, E., Suarez-Gestal, M., Calaza, M. et al. Bias in effect size of systemic lupus erythematosus susceptibility loci across Europe: a case-control study. Arthritis Res Ther 14, R94 (2012).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: