rs582757

TNFAIP3 TNFAIP3 A20 regulatory variant

Strong Risk Factor

TNFAIP3 rs582757 — A Common NF-kB Brake Variant Linked to Psoriasis

The TNFAIP3 gene on chromosome 6q23 encodes A20, the immune system's primary brake on NF-κB inflammatory signaling11 A20, the immune system's primary brake on NF-κB inflammatory signaling
A20 is a dual-function ubiquitin-editing enzyme — it removes activating K63-linked ubiquitin chains from signaling proteins (deubiquitinase activity) and attaches inhibitory K48-linked chains (E3 ligase activity), together terminating NF-κB activation after an immune response has served its purpose
. Multiple independent genetic signals cluster within and around this gene, each perturbing A20-mediated NF-κB suppression through a different mechanism. rs582757 is an intronic variant in intron 5 of TNFAIP3 that has been independently identified as the strongest signal at this locus in psoriasis GWAS — distinct from the upstream intergenic variants (rs6920220, rs10499194/rs13207033) that dominate the RA and SLE literature. Its C allele is the psoriasis risk allele at a population frequency of approximately 28% in European populations, making heterozygous CT carriers a common and clinically relevant group.

The Mechanism

rs582757 resides in intron 5 of TNFAIP3, placing it within the gene body in a regulatory context distinct from the intergenic signals upstream. Like other intronic GWAS hits, it likely acts as a proxy for one or more causal variants within the same linkage disequilibrium block22 linkage disequilibrium block
A chromosomal region where alleles tend to be inherited together; rs582757 tags a haplotype in LD within the TNFAIP3 gene body whose causal functional element has not been fully resolved
. Possible functional mechanisms include effects on intronic regulatory elements, enhancer sequences, or pre-mRNA splicing efficiency — any of which could reduce the amount or activity of A20 produced in immune-relevant cell types such as epidermal keratinocytes and dermal dendritic cells.

The downstream consequence is impaired A20-mediated termination of NF-κB signaling in skin. A20 expression is decreased in psoriatic lesional skin compared to non-lesional areas33 expression is decreased in psoriatic lesional skin compared to non-lesional areas
Sohn et al. 2016 — A20 significantly inhibited poly(I:C)-induced cytokine production in keratinocytes, and its downregulation in psoriatic skin heightens keratinocyte sensitivity to external triggers that drive plaque formation
, and this genetic variant's association with psoriasis is consistent with A20 acting as a gatekeeper of keratinocyte NF-κB activation. When A20 activity is reduced, inflammatory cytokines including TNF-α, IL-17, and IL-23 drive the hyperproliferative keratinocyte response that defines psoriatic plaques.

The psoriasis risk haplotype tagged by rs582757-C is genetically distinct from the haplotypes driving RA and SLE risk44 genetically distinct from the haplotypes driving RA and SLE risk
The Nititham 2015 meta-analysis explicitly demonstrated that the psoriasis risk haplotype at the TNFAIP3 locus does not overlap with haplotypes reported for other autoimmune diseases, suggesting disease-specific regulatory architecture within the same chromosomal region
. This genetic specificity parallels the clinical biology: A20's anti-inflammatory role in skin keratinocytes is a tissue-specific function that differs mechanistically from its role in B-cell tolerance and synovial fibroblast biology.

The Evidence

The strongest evidence for rs582757 comes from a meta-analysis of five European-ancestry psoriasis cohorts55 meta-analysis of five European-ancestry psoriasis cohorts
4,704 psoriasis cases and 7,805 controls; Nititham et al. Genes Immun 2015 — five independent replication datasets all showed consistent direction of effect with the C allele
totaling 4,704 cases and 7,805 controls. rs582757 emerged as the top TNFAIP3-region signal for psoriasis (P=6.07×10⁻¹², OR=1.23). Conditional analysis on rs582757 revealed a secondary independent signal at rs6918329 (OR=1.15, P=7.22×10⁻⁵), indicating at least two independent genetic effects within the TNFAIP3 region for psoriasis — a pattern consistent with the multi-signal architecture seen for RA at this locus.

For rheumatoid arthritis, fine-mapping of the 6q23 locus in 1,651 Spanish RA patients and 1,619 controls66 fine-mapping of the 6q23 locus in 1,651 Spanish RA patients and 1,619 controls
Dieguez-Gonzalez et al. Arthritis Res Ther 2009 — weak evidence for rs582757 association as part of the TNFAIP3 haplotype structure; effect direction and precise magnitude not fully reported in the abstract
found rs582757 in the context of a broader TNFAIP3 haplotype associated with RA susceptibility, though the evidence for this variant's independent RA effect is weaker than for psoriasis.

In a Chinese Han population study of rheumatic heart disease77 rheumatic heart disease
RHD is an acquired autoimmune condition where streptococcal infection triggers immune responses that damage heart valves, mechanistically involving NF-κB-driven inflammation similar to psoriasis and RA
(239 cases, 478 controls), rs582757 showed a striking allele-frequency difference: the C allele was protective with OR 0.57 per allele (95% CI 0.42–0.78, P=0.0004). This means TT homozygotes were at substantially higher RHD risk in this population, illustrating that the risk directionality of rs582757 is disease-specific and population-dependent.

The mechanistic evidence connecting TNFAIP3 to psoriasis pathology is reinforced by expression data: A20 is downregulated in psoriatic lesional skin88 downregulated in psoriatic lesional skin
Sohn et al. 2016 — A20 mRNA and protein reduced in plaques vs peri-lesional skin; restoring A20 inhibited poly(I:C)-induced keratinocyte cytokine production
and its overexpression in keratinocytes attenuates the cytokine production that drives plaque formation, placing TNFAIP3 function directly in the disease mechanism, not merely associated with it statistically.

The actionable intervention evidence comes from the VITAL randomized controlled trial99 VITAL randomized controlled trial
5-year RCT, 25,871 adults, vitamin D3 2,000 IU/day or omega-3 1 g/day vs placebo; Costenbader et al. BMJ 2022 — incident autoimmune disease as primary endpoint, confirmed by medical record review
which found vitamin D3 supplementation reduced new autoimmune disease by 22% (HR 0.78, P=0.05) and omega-3 by 15%, both acting on the NF-κB pathway that TNFAIP3/A20 regulates.

Practical Actions

The C allele at rs582757 primarily increases psoriasis susceptibility. For CT and CC carriers, the practical implications center on two areas: skin-directed NF-κB modulation and monitoring for early psoriasis onset. Anti-TNF biologics (adalimumab, etanercept, infliximab) are standard second-line treatments for moderate-to-severe psoriasis and directly target the TNF-α cytokine that A20 normally controls — making TNFAIP3 genotype data relevant to treatment response assessment. Vitamin D, which suppresses NF-κB through nuclear VDR signaling, has both an VITAL-supported systemic autoimmune benefit and topical efficacy as a psoriasis treatment.

Interactions

rs582757 operates within a complex multi-signal TNFAIP3 locus. The psoriasis-specific haplotype it tags is structurally distinct from the RA/SLE-associated haplotypes defined by rs6920220, rs10499194/rs13207033, and rs5029937. A secondary independent psoriasis signal at rs6918329 was identified in the same meta-analysis, suggesting at least two independent regulatory effects within the TNFAIP3 region for psoriasis specifically.

The TNFAIP3 missense variant rs2230926 (F127C) impairs A20 enzymatic activity through a completely different mechanism — reducing the catalytic function of the A20 protein rather than altering its expression or splicing. Carriers of both rs582757 C allele (potentially reduced A20 expression) and rs2230926 G allele (reduced A20 enzymatic function) may face compounded impairment of A20-mediated NF-κB suppression.

TNIP1 (TNFAIP3-interacting protein 1) variants interact functionally with TNFAIP3 — TNIP1 stabilizes A20 protein and directs its substrate specificity. GWAS studies of psoriasis and other autoimmune diseases consistently find both TNFAIP3 and TNIP1 locus signals, suggesting a functional gene-gene interaction worth capturing in compound action analysis.

The Cleavage Trap — How a Single Amino Acid Change Dismantles Your Clotting Glue

Von Willebrand factor is the body's primary molecular glue at sites of vascular injury. Synthesized by endothelial cells and platelets, VWF11 VWF
von Willebrand factor — a large glycoprotein that bridges damaged blood vessel walls and circulating platelets to form an initial platelet plug
normally circulates as giant multimeric strings. The largest of these high-molecular-weight (HMW) multimers are the most adhesive: they are best at capturing platelets under the shear stress of fast-flowing blood. rs61750584 (G allele) introduces a single amino acid change — isoleucine to threonine at position 1628 (I1628T) — that makes the VWF protein physically fragile at exactly the point where it needs to be strong. The result is von Willebrand disease type 2A, the most common type 2 bleeding disorder, characterized by loss of HMW multimers and a lifelong tendency toward mucocutaneous bleeding.

The Mechanism

The VWF protein contains a central A2 domain22 A2 domain
a compact globular domain flanked by disulfide bonds that conceals the ADAMTS13 cleavage site under physiological conditions
. Normally, this domain unfolds only when VWF experiences tensile forces at sites of vascular injury — exposing the Tyr1605–Met1606 bond to proteolysis by ADAMTS1333 ADAMTS13
a metalloprotease that trims excessively large VWF multimers and prevents pathological platelet clumping
. The I1628T substitution sits in a hydrophobic core of the A2 domain. The threonine residue is more polar and bulkier than the native isoleucine, disrupting the network of contacts that hold the domain folded.

Molecular dynamics simulations44 Molecular dynamics simulations
computational models applying tensile force to the protein chain and measuring domain stability
show that I1628T lowers the tensile force needed to separate the terminal helix α6 from the A2 domain body — the first step in unfolding. Experimental studies confirm that this destabilization translates directly to enhanced ADAMTS13 cleavage: Hassenpflug et al.55 Hassenpflug et al.
Impact of mutations in the von Willebrand factor A2 domain on ADAMTS13-dependent proteolysis. Blood 2006;107:2339-45
showed that I1628T increases susceptibility to ADAMTS13 proteolysis under non-denaturing (physiological) conditions, with the in vitro proteolytic pattern closely paralleling the multimer defect seen in affected patients. The consequence is that HMW VWF multimers are cleaved away continuously in the circulation, leaving only small, less-adhesive forms.

The Evidence

ClinVar classifies this variant as Pathogenic for von Willebrand disease type 2A with a 4-star expert panel review (ClinGen VWD Variant Curation Expert Panel, FDA-recognized, last evaluated August 2024). The evidence package meeting this classification includes: at least 16 documented patients with VWD type 2A carrying this variant, segregation with disease across three generations in the original kindred, laboratory confirmation of very low VWF ristocetin cofactor activity and absent HMW multimers in affected carriers, and consistent computational predictions (REVEL score 0.703).

The variant is absent from gnomAD across all major ancestry groups (fewer than 2 alleles observed in 1.4 million chromosomes), consistent with a highly penetrant pathogenic variant under purifying selection pressure. Unlike the common VWD type 1 variants, which typically reduce VWF quantity without structural abnormality, the I1628T mutation causes a qualitative defect: the protein is produced normally but lacks functional HMW multimers.

A landmark functional study by Hassenpflug et al. (2006)66 A landmark functional study by Hassenpflug et al. (2006)
examining 13 A2 domain mutations and their ADAMTS13 susceptibility
established that nearly all type 2A mutations in the A2 region share this property of enhanced proteolysis, and that the degree of proteolysis in the laboratory predicts the clinical bleeding phenotype. A subsequent structural dynamics study by Interlandi et al. (2012)77 structural dynamics study by Interlandi et al. (2012)
PLoS One simulation study of I1628T, L1657I and E1638K
confirmed the molecular basis: the threonine substitution reduces the energy barrier for A2 unfolding and is sufficient to render the normally cryptic cleavage site constitutively accessible.

Practical Actions

Carriers of the G allele have pathogenic type 2A VWD inherited in an autosomal dominant pattern — one copy is sufficient to cause disease. Clinical management focuses on three areas: diagnosing the subtype correctly, preparing for bleeding challenges (surgery, trauma, dental procedures), and monitoring for heavy menstrual bleeding.

Laboratory testing should include VWF antigen level, VWF ristocetin cofactor activity (VWF:RCo), VWF:RCo/VWF:Ag ratio (typically less than 0.6 in type 2A), and multimer analysis to confirm the HMW multimer defect. Because type 2A is caused by structural abnormality rather than quantity deficiency, desmopressin (DDAVP)88 desmopressin (DDAVP)
which releases VWF from endothelial storage sites but cannot correct the structural defect
is generally ineffective or only transiently helpful in type 2A disease. VWF concentrate replacement therapy (plasma-derived or recombinant VWF) is the preferred treatment during bleeding episodes and peri-operative cover.

Interactions

The mucocutaneous bleeding phenotype of VWD type 2A can be compounded by concurrent use of platelet-function inhibitors (aspirin, NSAIDs, clopidogrel), which impair the platelet side of haemostasis that VWF already bridges less effectively. Any prescription involving antiplatelet or anticoagulant drugs requires explicit discussion with a haematologist aware of the VWD diagnosis.

Inherited thrombocytopenia variants (affecting platelet number) could theoretically worsen the bleeding phenotype additively, though published compound heterozygosity data for this specific combination are limited. Factor VIII levels should be checked at baseline, as VWF normally stabilizes FVIII in the circulation; in type 2A, FVIII may be modestly reduced.

EDNRA Upstream Variant — Reduced Endothelin Receptor Expression and Intracranial Aneurysm Risk

The endothelin system is one of the most potent regulators of vascular tone in the human body. Endothelin-1 (ET-1) acts through two receptor subtypes — EDNRA (type A) and EDNRB (type B) — to orchestrate vasoconstriction, vascular smooth muscle proliferation, and arterial wall remodeling. EDNRA is the dominant receptor in vascular smooth muscle and mediates the sustained vasoconstrictive response that maintains cerebrovascular tone11 mediates the sustained vasoconstrictive response that maintains cerebrovascular tone
EDNRA activates Gq-protein signaling, driving IP3-mediated calcium release and PKC activation in smooth muscle cells, producing contraction that can last minutes to hours
. rs6841581 sits approximately 900 bp upstream of the EDNRA coding sequence in a region that controls how much receptor protein the cell produces — and one allele makes substantially less of it.

The Mechanism

rs6841581 is a regulatory variant, not a protein-coding change. It lies in the promoter or 5' regulatory region of EDNRA on chromosome 4q31.22 and alters the binding affinity of nuclear transcription factors. Low et al. (2012, Hum Mol Genet) demonstrated that the two alleles of rs6841581 have measurably different affinities for a nuclear protein, and that the susceptible allele drives significantly lower transcriptional activity in luciferase reporter assays22 Low et al. (2012, Hum Mol Genet) demonstrated that the two alleles of rs6841581 have measurably different affinities for a nuclear protein, and that the susceptible allele drives significantly lower transcriptional activity in luciferase reporter assays
Low SK et al. "Genome-wide association study for intracranial aneurysm in the Japanese population identifies three candidate susceptible loci and a functional genetic variant at EDNRA." Human Molecular Genetics, 2012
. The result is that carriers of the A allele produce less EDNRA protein per cell.

With fewer functional EDNRA receptors on vascular smooth muscle, the vasoconstrictive response to endothelin-1 is blunted. In cerebral arteries, this impairs the normal maintenance of wall tension and hemodynamic resistance. A heterozygous EDNRA deletion rat model confirmed that reduced EDNRA function is sufficient to trigger intracranial aneurysm formation when combined with hypertensive stress33 A heterozygous EDNRA deletion rat model confirmed that reduced EDNRA function is sufficient to trigger intracranial aneurysm formation when combined with hypertensive stress
Lampmann et al. 2022 (Brain Sci); EDNRA mutant rats showed more extensive aneurysm formation than wild-type controls under equivalent hypertensive conditions
. This animal model provides direct causal evidence that the GWAS signal at rs6841581 reflects true EDNRA dysfunction rather than an indirect association.

The Evidence

The original discovery GWAS by Yasuno et al. (2011, PNAS), combining Japanese discovery cohorts with European replication, linked rs6841581 to intracranial aneurysm risk across 5,891 cases and 14,181 controls44 The original discovery GWAS by Yasuno et al. (2011, PNAS), combining Japanese discovery cohorts with European replication, linked rs6841581 to intracranial aneurysm risk across 5,891 cases and 14,181 controls
Yasuno K et al. "Common variant near the endothelin receptor type A (EDNRA) gene is associated with intracranial aneurysm risk." Proc Natl Acad Sci USA, 2011
. The association reached genome-wide significance (OR 1.22, P = 2.2 × 10⁻⁸) and held in both East Asian and European populations, making it one of the first trans-ethnic GWAS findings in cerebrovascular genetics.

A meta-analysis of more than 116,000 individuals across 61 studies confirmed rs6841581 as among the most robust genetic risk factors for sporadic intracranial aneurysm (OR 1.22, 95% CI 1.14–1.31)55 A meta-analysis of more than 116,000 individuals across 61 studies confirmed rs6841581 as among the most robust genetic risk factors for sporadic intracranial aneurysm (OR 1.22, 95% CI 1.14–1.31)
Alg VS et al. Neurology, 2013
. The consistency of effect across diverse populations and study designs places this association in the strong evidence tier for genetic risk research.

An updated East Asian meta-analysis encompassing 20,609 individuals across 6 populations (Hong et al. 2019, World Neurosurg) refined the estimate to OR 1.244 (95% CI 1.174–1.318, P = 1.36 × 10⁻¹³)66 An updated East Asian meta-analysis encompassing 20,609 individuals across 6 populations (Hong et al. 2019, World Neurosurg) refined the estimate to OR 1.244 (95% CI 1.174–1.318, P = 1.36 × 10⁻¹³)
Hong EP et al. "Association of Endothelin Receptor Type A with Intracranial Aneurysm in 20,609 East Asians."
. The A allele frequency is nearly twice as high in East Asian populations (~28%) compared to Europeans (~14%), meaning the absolute genetic attributable risk is greater in East Asian individuals.

Genome-wide pairwise interaction analyses in a Korean cohort identified 11 SNPs showing genome-wide significant interactions with rs6841581, including variants in RYK (a Wnt co-receptor implicated in vascular development) and TNIK (a MAP3K involved in cytoskeletal regulation)77 Genome-wide pairwise interaction analyses in a Korean cohort identified 11 SNPs showing genome-wide significant interactions with rs6841581, including variants in RYK (a Wnt co-receptor implicated in vascular development) and TNIK (a MAP3K involved in cytoskeletal regulation)
Hong EP et al. J Korean Neurosurg Soc, 2023
. This interaction architecture suggests that rs6841581's effect on IA risk is amplified by co-occurring variants in vascular developmental pathways.

Practical Actions

Carriers of one or two A alleles at rs6841581 face an incrementally elevated lifetime risk of intracranial aneurysm (unruptured or ruptured as subarachnoid hemorrhage). The per-allele OR of ~1.22–1.24 is modest in absolute terms — population prevalence of unruptured intracranial aneurysms is roughly 2–5% — but the risk is amplifiable by modifiable factors including uncontrolled hypertension and smoking, both of which are potent independent risk factors for aneurysm formation and rupture.

Blood pressure management is the most evidence-supported modifiable lever. The EDNRA pathway mediates ET-1-driven vasoconstriction; reduced EDNRA signaling already impairs vascular wall tension maintenance, and superimposed hypertension exacerbates the hemodynamic stress that triggers aneurysm development. Smoking independently upregulates ET-1 production and promotes vascular inflammation, creating a compounding pro-aneurysm environment for A allele carriers.

MRI angiography (MRA) without contrast can detect unruptured intracranial aneurysms ≥3 mm with high sensitivity and involves no radiation. For A allele carriers with additional risk factors (family history of IA or subarachnoid hemorrhage, hypertension, or smoking history), discussing screening MRA with a neurologist or vascular neurosurgeon is a proportionate response to the genetic finding.

Interactions

rs5335, a 3'UTR variant in the same EDNRA gene, also alters EDNRA expression and is associated with blood pressure variation. Carrying risk alleles at both rs6841581 and rs5335 would be expected to compound the EDNRA downregulation phenotype. The two variants are located at opposite ends of the EDNRA gene (rs6841581 upstream regulatory vs. rs5335 3'UTR) and likely act through independent regulatory mechanisms.

rs6842241, a nearby intergenic variant in the same GWAS locus (Low et al. 2012, OR 1.25), may tag the same functional haplotype as rs6841581 or represent a second independent signal at the EDNRA locus. Both are captured when sequencing this chromosomal region.

rs767603 (LOC105378189) is another intracranial aneurysm susceptibility locus in the heart_inflammation category — the two variants act through independent mechanisms (EDNRA downregulation vs. putative non-coding RNA regulation of vascular wall integrity genes). Both A allele carriers at rs6841581 and T allele carriers at rs767603 benefit from the same blood pressure and smoking avoidance actions.

SERPINA1 and ANCA Vasculitis — A Protease Balance Variant

Alpha-1 antitrypsin (AAT), encoded by SERPINA1, is the body's most abundant serine protease inhibitor and serves as the primary natural inhibitor of Proteinase 3 (PR3)11 primary natural inhibitor of Proteinase 3 (PR3)
PR3 is a neutrophil-derived serine protease that is the main antigenic target of PR3-ANCA antibodies in granulomatosis with polyangiitis (GPA)
. When AAT function is compromised, unbound PR3 accumulates on neutrophil surfaces and in the circulation, driving autoantibody formation and vascular inflammation. The rs7151526 variant lies approximately 6.7 kilobases downstream of the SERPINA1 coding sequence, in a non-coding regulatory region near the gene's 3' flank. Though not itself a coding change, it has been identified as a risk factor for ANCA-associated vasculitis (AAV)22 ANCA-associated vasculitis (AAV)
AAV encompasses granulomatosis with polyangiitis (GPA), microscopic polyangiitis (MPA), and eosinophilic GPA — a group of rare but serious autoimmune conditions affecting small blood vessels
, with the A allele significantly overrepresented among GPA patients.

The Mechanism

SERPINA1 encodes AAT, whose primary job is to neutralize PR3 released by activated neutrophils during inflammatory episodes. When PR3 is left uninhibited — whether due to reduced AAT levels or impaired AAT function — it can trigger an autoimmune cascade: unbound PR3 on neutrophil surfaces becomes an exposed antigen, stimulating production of anti-PR3 ANCA antibodies. These antibodies then activate more neutrophils in a self-amplifying loop, causing necrotizing inflammation of small vessels — the hallmark of AAV.

The rs7151526 variant is located in a non-coding region (primarily affecting lncRNA transcripts) with "MODIFIER" impact by VEP annotation. Its exact molecular mechanism of action is not fully characterized; however, regulatory variants in the 3' flanking region of genes can affect mRNA stability, expression level, or splicing of nearby transcripts. Research shows that even in patients with normal AAT protein concentrations, AAT functional activity is dramatically lower in Wegener's granulomatosis33 normal AAT protein concentrations, AAT functional activity is dramatically lower in Wegener's granulomatosis
Mota et al. found TIC and specific AAT activity significantly decreased despite equivalent protein levels (p=0.001)
, suggesting that subtle changes in AAT expression or function — potentially mediated by non-coding regulatory variants like rs7151526 — can tip the protease-antiprotease balance toward disease.

The Evidence

A meta-analysis of 18 eligible studies44 meta-analysis of 18 eligible studies
Banerjee et al., International Journal of Rheumatic Diseases, 2022
confirmed rs7151526-A as a significant predisposing allele for GPA (Meta-OR = 2.70, 95% CI 1.51–4.85, p = 0.0008). This finding was consistent across studies and confirms that carrying the A allele approximately doubles to triples the odds of developing GPA compared to CC homozygotes.

The clinical stakes are considerable. A 2024 prospective cohort study of 115 Brazilian AAV patients55 2024 prospective cohort study of 115 Brazilian AAV patients
Giardini et al., Clinics (São Paulo), 2024
— predominantly GPA (65.2%) — found SERPINA1 polymorphisms to be "the most significant factor linked to mortality" in multivariate analysis (HR = 6.2, 95% CI 1.4–27.1, p = 0.015). Carriers of rs7151526 had a mean survival of 57.4 years, compared to 68.0 years in non-carriers — a reduction of more than a decade. A French retrospective cohort of 142 AAV patients66 French retrospective cohort of 142 AAV patients
Deshayes et al., Journal of Rheumatology, 2019
found that patients with deficient AAT alleles had significantly higher rates of intraalveolar hemorrhage (p < 0.01), a severe and potentially fatal pulmonary complication.

Population frequencies confirm this is a predominantly European variant: the A allele occurs at ~4.8% in Europeans, versus ~0.1% in East Asians and ~0.9% in Africans. The overall global minor allele frequency is approximately 3.4% (gnomAD v4). Given the additive genetic architecture and the rarity of AAV itself (~3 per 100,000 per year), the absolute risk increase is modest, but the prognostic impact once disease develops appears substantial.

Practical Actions

Carrying the A allele does not mean you will develop AAV — the condition remains rare even in genetically predisposed individuals, and environmental triggers (infections, silica exposure, drugs) are required. However, awareness of this genetic signal supports a lower threshold for clinical evaluation when relevant symptoms appear. ANCA-associated vasculitis characteristically presents with upper respiratory tract symptoms77 upper respiratory tract symptoms
ENT symptoms including chronic sinusitis, nosebleeds, saddle-nose deformity in GPA; subglottic stenosis
, lower respiratory disease (cough, hemoptysis), kidney involvement (hematuria, proteinuria, rising creatinine), and systemic features (fatigue, weight loss, fever). Prompt ANCA testing and rheumatology referral at symptom onset significantly improves outcomes.

For those who carry this variant, early disease recognition is the most actionable step. ANCA-associated vasculitis treated in early disease stages — before irreversible organ damage — has substantially better outcomes than disease diagnosed late with severe renal or pulmonary involvement.

Interactions

rs7151526 is distinct from the classical SERPINA1 deficiency alleles (rs28929474, the Z allele E342K, and rs17580, the S allele E264V). The Z allele (rs28929474) is strongly pathogenic for classic AAT deficiency lung and liver disease, and is independently associated with even stronger GPA risk (Meta-OR = 12.60 for the Z allele versus 2.70 for rs7151526 in Banerjee 2022). The related SNP rs28929454 (also studied in the Giardini 2024 mortality cohort) showed even more pronounced survival reduction. Individuals carrying rs7151526-A alongside classical deficiency alleles would likely face compounded risk, though direct interaction data is limited.

The mechanism connects rs7151526 to the broader ANCA vasculitis genetic architecture, which includes variants in CTLA4 (rs231775) and other immune regulatory genes identified in the same meta-analysis. PR3-ANCA positive AAV has distinct genetic risks from MPO-ANCA positive disease, and rs7151526's association appears primarily driven by GPA (PR3-ANCA) cases.

UGT1A1 Q331R — A Rare Cause of Crigler-Najjar Syndrome Type II

Every day your liver handles a continuous stream of spent red blood cells, converting their released haemoglobin into bilirubin and then rendering that bilirubin water-soluble so it can be excreted in bile. The enzyme that carries out this final conjugation step is UGT1A111 UGT1A1
UDP-glucuronosyltransferase 1A1, a phase II detoxification enzyme that attaches glucuronic acid to unconjugated bilirubin, making it soluble and excretable
, encoded by the UGT1A1 gene on chromosome 2q37. When this enzyme is severely reduced, unconjugated bilirubin — fat-soluble and capable of crossing the blood-brain barrier — accumulates in blood and tissues, causing jaundice and, at extreme levels, neurological damage.

The rs72551348 variant (c.992A>G) causes a glutamine-to-arginine substitution at position 331 of the UGT1A1 protein (p.Gln331Arg, also written Q331R). This missense change disrupts the enzyme's catalytic efficiency for bilirubin conjugation, leaving only a fraction of normal activity. ClinVar classifies it as Pathogenic22 ClinVar classifies it as Pathogenic
RCV000013058, associated with Crigler-Najjar syndrome type II
, and it was first described by Moghrabi et al.33 Moghrabi et al.
Moghrabi N et al. Identification of an A-to-G missense mutation in exon 2 of the UGT1 gene complex that causes Crigler-Najjar syndrome type 2. Genomics, 1993
in a 72-year-old Irish man born of consanguineous parents, whose diagnosis was established when phenobarbital treatment significantly lowered his chronically elevated bilirubin.

The variant affects a shared exon of the UGT1A gene complex, meaning the same nucleotide change disrupts multiple UGT1A isoforms (UGT1A1, UGT1A3–A10 all read from this exon). However, UGT1A1 is the only isoform directly responsible for bilirubin glucuronidation, and Crigler-Najjar syndrome is the primary clinical consequence.

The Mechanism

Gln331 sits in the C-terminal membrane-anchoring domain of UGT1A1, a region important for proper enzyme folding and substrate binding. The arginine substitution introduced by the G allele alters local protein conformation, substantially impairing the enzyme's ability to glucuronidate bilirubin. Unlike type I Crigler-Najjar mutations, which completely abolish UGT1A1 activity, Q331R leaves residual activity — estimated at less than 10% of normal in type II patients — a threshold established by functional expression studies44 a threshold established by functional expression studies
Seppen J et al. Discrimination between Crigler-Najjar type I and II by expression of mutant bilirubin uridine diphosphate-glucuronosyltransferase. J Clin Invest, 1994
. This residual activity is the mechanistic basis for the key clinical feature of type II disease: phenobarbital responsiveness. Phenobarbital is an inducer of UGT1A1 gene expression; boosting transcription from the residual functional allele can meaningfully increase the amount of conjugated bilirubin produced, lowering serum levels by 30% or more.

The Evidence

Crigler-Najjar syndrome type II is rare — fewer than 300 cases have been documented in the medical literature. The Q331R variant (rs72551348) is ultrarare even among UGT1A1 disease alleles, with a G allele frequency of approximately 3.7 × 10⁻⁵ in the gnomAD exome dataset (predominantly in individuals of European ancestry). No homozygous individuals have been observed in population databases, consistent with the rarity of type II Crigler-Najjar syndrome.

Functional characterisation established55 Functional characterisation established
Seppen et al. 1994
that the hallmark of type II mutations is partial enzyme inactivation (residual activity 4–38% across patients), versus zero activity in type I. This residual activity keeps serum bilirubin below 20 mg/dL in most type II patients — well below the levels that cause kernicterus — and explains the far better neurological prognosis compared to type I.

A comprehensive genotype-phenotype review66 A comprehensive genotype-phenotype review
Kadakol A et al. Hum Mutat, 2000
cataloguing more than 50 UGT1A1 mutations confirmed that the partial-vs-complete enzyme inactivation distinction maps reliably to type II vs type I phenotype, with the critical threshold being whether any residual bilirubin-conjugating activity remains.

Practical Actions

For homozygous carriers (GG) — clinically the relevant genotype for Crigler-Najjar type II — the management priorities are phenobarbital therapy to upregulate residual UGT1A1 expression, bilirubin monitoring, and awareness of drug interactions. Because UGT1A1 metabolises several chemotherapy agents (particularly irinotecan and belinostat), any cancer treatment plan should account for the severely reduced enzyme activity. UV-A phototherapy can be used adjunctively during bilirubin spikes. Liver transplantation is curative and is typically reserved for cases where phenobarbital cannot maintain safe bilirubin levels or when quality of life is severely impaired.

Heterozygous carriers (AG) — one G allele with one normal A allele — have approximately 50% of normal UGT1A1 activity and are clinically normal. Their primary concern is reproductive: if both partners carry a pathogenic UGT1A1 allele, there is a 25% probability of a homozygous child with Crigler-Najjar syndrome.

Interactions

rs72551348 falls on the same UGT1A1 gene that harbours several other clinically significant variants. Compound heterozygosity with the common UGT1A1*28 promoter variant (rs8175347, extra TA repeat reducing expression by ~70% when homozygous) or with *6 Gly71Arg (rs4148323, prevalent in East Asia) can produce a combined UGT1A1 impairment intermediate between Gilbert syndrome and Crigler-Najjar type II. Any individual carrying one Q331R allele should be tested for other UGT1A1 variants, particularly if bilirubin levels are higher than expected for a simple carrier.

rs727428

SHBG SHBG +1091 C>T

Strong Risk Factor

SHBG +1091 C>T — The Hormone Bioavailability Switch Downstream of SHBG

Sex hormone-binding globulin is the liver's primary traffic controller for testosterone and estradiol. Only about 1–3% of testosterone circulates as "free" bioactive hormone — the rest is bound to SHBG (roughly 44%) or albumin (54%). When SHBG levels rise, more testosterone gets locked away; when SHBG levels fall, more free testosterone is available to act on tissues. Rs727428 is one of the best-replicated [genetic regulators | This variant has been identified as genome-wide significant in multiple independent GWAS and validated across ancestries] of circulating SHBG levels, located just outside the SHBG gene itself in a downstream regulatory region.

The Mechanism

Rs727428 sits approximately 1 kb downstream of the SHBG gene on chromosome 17p13.1, in a [regulatory region | This area contains chromatin elements that influence SHBG gene expression in the liver; GERP conservation score and Ensembl RegBuild both annotate it as a functional regulatory feature] outside the protein-coding sequence. The variant does not change the SHBG protein structure — it acts by influencing how much SHBG the liver produces. The T allele at this position is associated with reduced SHBG transcription relative to the C allele. Each copy of the T allele reduces serum SHBG by approximately 3–4 nmol/L [| Grigorova et al. 2017, PMID 29264510 — P=7.3×10⁻¹¹, effect −3.74 nmol/L per allele], meaning TT homozygotes produce measurably less SHBG than CC homozygotes. Because SHBG binds testosterone with roughly five times higher affinity than albumin, small changes in SHBG concentration have outsized effects on how much testosterone is biologically active in tissues.

The Evidence

The original genome-wide significant association between rs727428 and circulating SHBG was established in a GWAS of ~1,600 postmenopausal women11 GWAS of ~1,600 postmenopausal women
Prescott et al. PLoS One, 2012
where the T allele was associated with lower SHBG (β=−0.126 on a log scale, P=2.09×10⁻¹⁶). This finding was simultaneously replicated in a GWAS of 3,225 European men22 GWAS of 3,225 European men
Jin et al. Human Molecular Genetics, 2012
, where rs727428 showed genome-wide significant associations with both total testosterone (P=1.26×10⁻¹²) and dihydrotestosterone (DHT, P=1.47×10⁻¹¹).

A validation study in 1,687 Japanese men33 validation study in 1,687 Japanese men
Sato et al. Human Reproduction Open, 2019
replicated the SHBG association (β=0.21, P=3.4×10⁻¹⁰) but did not replicate testosterone association after multiple testing adjustment, suggesting that in East Asian populations the effect on SHBG may not translate directly into measurable testosterone differences — possibly because other loci or lifestyle factors modulate free testosterone differently in this population.

A population-based sibling study of 999 Dutch men44 population-based sibling study of 999 Dutch men
Walravens et al. Journal of Clinical Endocrinology & Metabolism, 2025
found that rs727428 T allele homozygotes had 10.8–23.1% lower SHBG and 3.9–21.4% lower total testosterone compared to CC homozygotes. Notably, calculated and directly measured free testosterone showed minimal difference across genotypes — suggesting the body compensates for SHBG-driven total testosterone changes by adjusting LH/FSH feedback to maintain free testosterone homeostasis in healthy men.

In women, a Mediterranean PCOS case-control study of 1,004 premenopausal women55 Mediterranean PCOS case-control study of 1,004 premenopausal women
Martínez-García et al. Human Reproduction, 2012
found the T allele was more frequent in PCOS patients than controls (OR=1.29), independent of obesity. A family-based PCOS study in 758 women66 family-based PCOS study in 758 women
Wickham et al. Journal of Clinical Endocrinology & Metabolism, 2011
confirmed that rs727428 genotype was independently associated with SHBG levels after controlling for BMI, insulin resistance, and free testosterone. Meta-analyses have shown mixed results: one 2020 meta-analysis (1,660 cases, 1,312 controls) found the association with PCOS susceptibility was not statistically significant after pooling, while individual studies — particularly in Mediterranean and Middle Eastern populations — consistently found associations. The discrepancy likely reflects population-specific allele frequencies and PCOS diagnostic heterogeneity.

Practical Implications

The clinical consequences of rs727428 depend heavily on context. In women, lower SHBG from the T allele means more free testosterone and a higher [free androgen index | FAI = total testosterone ÷ SHBG × 100; values above 4–5 in women are associated with androgenic symptoms and PCOS]. Women with TT genotype benefit most from monitoring free androgen index rather than total testosterone, and from strategies that support SHBG production — particularly insulin-sensitizing approaches, since insulin is a potent suppressor of hepatic SHBG synthesis. In men, lower SHBG generally keeps more testosterone available in tissue, but the Walravens 2025 data suggest the body's hormonal feedback loop largely compensates in healthy young men. Where this compensation may fail is in older men, men with metabolic syndrome, and men undergoing testosterone monitoring for hypogonadism — in these contexts, SHBG genotype should inform how total testosterone is interpreted.

Interactions

rs1799941 (SHBG promoter G-68A): This variant in the SHBG promoter acts independently from rs727428 to regulate SHBG levels. The A allele of rs1799941 increases SHBG by 15–25%, opposing the T allele effect at rs727428. The Grigorova 2017 study (PMID 29264510) found these variants replicate independently — both are significant GWAS hits with distinct mechanisms (one alters promoter transcription factor binding; the other affects downstream regulatory architecture). A person carrying T at rs727428 AND G at rs1799941 has two additive SHBG-lowering variants; a person carrying T at rs727428 AND A at rs1799941 has opposing forces that may partially cancel. This is a strong candidate for a compound action given the opposite-direction effects in the same pathway.

rs6259 (SHBG Asp327Asn): A missense variant in the SHBG protein (p.Asp327Asn) that reduces testosterone binding affinity by approximately 10%, thereby increasing free testosterone bioavailability even when total SHBG concentration is unaffected. In men, the A allele of rs6259 was associated with increased free testosterone in the Walravens 2025 study. Compound carriers of both the rs727428 T allele (less SHBG protein) and the rs6259 A allele (SHBG protein with lower binding affinity) would have a compounded increase in free testosterone — relevant for interpreting androgen status in both sexes.

Compound action proposal for rs727428 TT + rs1799941 GG: Women carrying TT at rs727428 AND GG at rs1799941 carry two independent SHBG-lowering variants — one reducing transcription rate (promoter), one reducing downstream regulatory expression. The combined recommendation: measure free androgen index (FAI = total testosterone ÷ SHBG × 100); track SHBG as a metabolic risk biomarker; implement insulin-sensitizing strategies (inositol supplementation has evidence for PCOS with low SHBG); and discuss combined SHBG genotype result with clinician before interpreting any sex hormone panel. Evidence level: moderate (individual effects well-established; combined effect inferred from independent pathway data).

rs730882105

LDLR p.Val524Met

Moderate Likely Pathogenic

LDLR Val524Met — A Rare Familial Hypercholesterolemia Mutation

The LDL receptor (LDLR) encoded by the LDLR gene is the primary mechanism by which the liver clears low-density lipoprotein (LDL) cholesterol from the bloodstream. Each functional LDLR molecule captures LDL particles at the hepatocyte surface and draws them into the cell via receptor-mediated endocytosis, where cholesterol is released for cellular use. Pathogenic LDLR mutations disrupt this clearance system, allowing LDL cholesterol to accumulate in the blood from birth — the defining feature of familial hypercholesterolemia (FH)11 familial hypercholesterolemia (FH)
autosomal dominant disorder causing severe, lifelong LDL-C elevation and dramatically accelerated atherosclerosis
.

rs730882105 is an extremely rare missense variant in LDLR that substitutes methionine for valine at amino acid position 524 (c.1570G>A, p.Val524Met). It has been classified as likely pathogenic by the British Heart Foundation LDLR-LOVD database22 likely pathogenic by the British Heart Foundation LDLR-LOVD database
the LDLR Leiden Open Variation Database maintained by the BHF is the most comprehensive curated registry of LDLR variants with clinical significance assignments
, though a second submitter classified it as uncertain significance under stricter ACMG 2015 criteria, and a Merck Research Labs functional study found no significant impairment in vitro. The conflicting evidence reflects a genuine ambiguity in this rare variant: population frequency is too low for robust statistical association, and functional assay results diverge from clinical reports. This YAML entry reflects the intermediate evidence state.

The Mechanism

Val524 sits within the [ligand-binding domain cluster of LDLR | the extracellular ligand-binding domain consists of seven cysteine-rich repeats (LBD1–7) that directly contact apolipoprotein B-100 on LDL particles and apolipoprotein E on VLDL/IDL particles] — specifically in or near repeat 7 (LBD-7), a region required for efficient LDL binding and cellular uptake. Valine-to-methionine substitutions introduce a larger, more polar side chain that can disrupt the local protein conformation, though the magnitude of functional impairment varies by exact position and surrounding structure. The Merck in vitro data suggesting no effect may reflect assay conditions not replicating the full hepatic context of LDL binding and recycling; real-world FH reports in carriers are the stronger signal for a receptor with known genotype-phenotype correlation.

Untreated heterozygous FH (one mutated LDLR copy) typically produces LDL-C of 190–400 mg/dL from birth — levels that accelerate atherosclerotic plaque formation decades earlier than in the general population. Ference et al. 201733 Ference et al. 2017
Low-density lipoproteins cause atherosclerotic cardiovascular disease. European Heart Journal
established that the cumulative LDL burden from birth (not just current levels) drives atherosclerosis; this is why FH carriers begin accumulating plaques in their teens and 20s and may have their first myocardial infarction before age 50.

The Evidence

rs730882105 is too rare (2 alternate alleles in 1.4 million gnomAD samples) to accumulate direct statistical evidence for this specific variant. Its likely pathogenic classification rests on: (1) location in a functionally critical LDLR domain; (2) clinical reports of FH phenotype in at least one carrier submitted to the LDLR-LOVD; (3) the prior probability that missense variants disrupting LDLR ligand-binding repeats are pathogenic, which is high based on the 3,200+ characterized LDLR variants catalogued by Abifadel & Boileau 202344 Abifadel & Boileau 2023
Genetic and molecular architecture of familial hypercholesterolemia. J Intern Med
, where missense variants account for ~60% of all pathogenic LDLR mutations.

The broader FH evidence base is compelling: untreated heFH carries a [substantially elevated risk of CHD | coronary heart disease — up to 13-fold excess risk per the EAS consensus] Defesche et al. 201755 Defesche et al. 2017
Familial hypercholesterolaemia. Nat Rev Dis Primers
. The global prevalence of FH is approximately 1:250 (1:80 in founder populations such as French Canadians and Afrikaners) — far higher than previously thought. Most remain undiagnosed. Tokgozoglu & Kayikcioglu 202166 Tokgozoglu & Kayikcioglu 2021
Familial Hypercholesterolemia: Global Burden and Approaches. Curr Cardiol Rep
estimated that >85% of FH individuals globally are undetected. With statin therapy achieving ≥50% LDL reduction, the excess cardiovascular risk is substantially attenuated — making early identification and treatment directly life-extending.

Practical Actions

Heterozygous carriers of rs730882105 should have a fasting lipid panel performed to establish baseline LDL-C levels. LDL-C >190 mg/dL in an adult with a likely pathogenic LDLR variant typically meets criteria for high-intensity statin therapy (rosuvastatin 20–40 mg or atorvastatin 40–80 mg). The LDL-C target is <100 mg/dL for those without established cardiovascular disease, and <70 mg/dL for those with prior ASCVD events. If statin therapy alone is insufficient, ezetimibe (adds ~15% LDL reduction) and PCSK9 inhibitors (alirocumab, evolocumab — add ~50% LDL reduction) are guideline-recommended additions. Sturm et al. 201877 Sturm et al. 2018
Clinical Genetic Testing for Familial Hypercholesterolemia: JACC Scientific Expert Panel
recommends cascade screening of all first-degree relatives when a pathogenic LDLR variant is identified — each child of a carrier has a 50% chance of inheriting the mutation.

Saturated fat restriction to below 7% of total calories specifically reduces hepatic LDL production and complements statin therapy in FH; this is one of few dietary interventions with FH-specific evidence because it operates through the same LDL-receptor pathway that LDLR mutations impair.

Interactions

LDLR variants interact in severity with PCSK9 gain-of-function variants (rs28942078, rs72658867) and APOB p.Arg3527Gln: carriers of both a pathogenic LDLR variant and a PCSK9 gain-of-function variant have substantially more severe LDL elevation than either alone, as PCSK9 degrades the LDL receptor — compounding the LDLR mutation's reduced receptor availability. Double heterozygotes are estimated to have FH severity approaching homozygous FH. APOE ε4 (rs429358) also modestly elevates LDL-C and is relevant context for cardiovascular risk assessment in carriers.

rs804279

GATA4 GATA4/NEIL2 PCOS Susceptibility

Moderate Risk Factor

GATA4/NEIL2 — A Chromosomal Neighbourhood Tied to PCOS Androgen Excess

On the short arm of chromosome 8, at position 8p23.1, a compact genomic neighbourhood hosts two functionally distinct genes: GATA411 GATA4
GATA-binding protein 4, a zinc-finger transcription factor essential for cardiac and gonadal development; expressed in ovarian granulosa and theca cells where it regulates steroidogenic gene expression
and NEIL222 NEIL2
nei-like DNA glycosylase 2, a base excision repair enzyme that removes oxidised purines from DNA
. Between and around them sits rs804279, an intergenic variant that large-scale genome-wide association studies have linked to polycystic ovary syndrome (PCOS) susceptibility in women of European ancestry.

The Mechanism

The rs804279 variant lies approximately 6,400 bp upstream of GATA4 and 3,300 bp from NEIL2 (within the NEIL2 RefSeqGene interval, NG_053043.1:g.1718). As an intergenic regulatory variant, it does not change a protein sequence; instead, it likely modulates the expression or chromatin accessibility of one or both flanking genes in hormone-responsive tissues. GATA4 is a transcription factor expressed in ovarian theca cells, where it drives expression of steroidogenic enzymes including CYP17A1, the key enzyme in androgen biosynthesis. The locus also harbours FDFT133 FDFT1
farnesyl-diphosphate farnesyltransferase 1, the first committed enzyme in cholesterol biosynthesis that provides the substrate for all steroid hormones
, which sits approximately 29 kb downstream — giving this chromosomal neighbourhood three converging mechanisms for steroid hormone biology.

The T allele is the globally common allele (~72% frequency) but is the risk allele for PCOS at this locus. This contrasts with the typical GWAS pattern where the minor allele carries risk, and implies that the protective A allele (reference, ~28% globally) represents a recent or population-specific buffering variant — or that regulatory activity at this locus is required for normal PCOS-associated androgen elevation, with the A allele conferring partial protection.

The Evidence

The key PCOS association at rs804279 was established in a large European meta-analysis44 large European meta-analysis
Day FR et al. Large-scale genome-wide meta-analysis of polycystic ovary syndrome suggests shared genetic architecture for different diagnosis criteria. PLoS Genet 14:e1007813, 2018
combining 10,074 PCOS cases and 103,164 controls. Among 14 genome-wide significant PCOS loci, rs804279 was notable for one specific reason: it was the only locus showing significant heterogeneity across diagnostic criteria (heterogeneity p=2.6×10⁻⁵). The effect was largest in women diagnosed by NIH criteria (hyperandrogenism plus oligo-anovulation) and smallest in self-reported cases — consistent with a variant that specifically tags the androgen-excess component of PCOS rather than the full syndrome as broadly defined.

Computational functional analysis has categorised rs804279 among variants with predicted functional relevance55 variants with predicted functional relevance
Prabhu BN et al. Conceptualization of functional SNPs of PCOS genes: an in silico approach. J Endocrinol Invest 44:1783–1793, 2021
in the GATA4/NEIL2 region, supporting its role as a regulatory rather than neutral variant.

The evidence level is moderate: the locus has genome-wide significance in the Day 2018 meta-analysis, but functional characterisation of what rs804279 specifically does to GATA4 or NEIL2 expression remains incomplete. The diagnostic-criteria heterogeneity means the association is most reliable when PCOS is defined by the NIH standard (requiring both hyperandrogenism and cycle abnormality) rather than by self-report or broader Rotterdam criteria alone.

Practical Actions

Because this locus appears most strongly associated with the hyperandrogenic PCOS subtype, monitoring should focus on androgen biomarkers: free testosterone, DHEAS, and androstenedione are the most directly relevant, with AMH and LH:FSH ratio providing additional characterisation of the reproductive phenotype.

Inositol supplementation (myo-inositol) has the strongest evidence for improving ovarian responsiveness in PCOS with FSH resistance, acting downstream of the gonadotropin signalling cascade that GATA4 expression influences. Women with the TT genotype who have clinical signs (irregular cycles, hirsutism, acne, difficulty conceiving) have clear grounds for proactive evaluation.

Interactions

The GATA4/NEIL2 locus acts independently of the DENND1A (rs7852296) and LHCGR (rs13405728) PCOS susceptibility loci established in the same European meta-analyses. Women carrying risk alleles at multiple PCOS loci accumulate additive PCOS susceptibility. The GATA4 locus appears to operate primarily through the hyperandrogenic axis (linked to androstenedione and testosterone excess) while DENND1A operates through FSH receptor trafficking — distinct but complementary mechanisms both converging on follicular arrest and androgen excess. A compound interaction between homozygous TT at rs804279 and the DENND1A risk haplotype (rs7852296-AA or homozygous risk) would represent convergent failure of both steroidogenic regulation and FSH receptor recycling; this combination warrants investigation as a compound action (no published combined effect size yet).

HLA-DRA rs9268839 — The Class II Sentinel for Rheumatoid Arthritis

The region just upstream of HLA-DRA on chromosome 6p21.32 is home to one of the most powerful common genetic risk signals for rheumatoid arthritis (RA) in the human genome. rs9268839 is an intergenic variant located approximately 16 kilobases upstream of HLA-DRA11 HLA-DRA
HLA class II histocompatibility antigen, DR alpha chain — the gene encoding the alpha subunit of the HLA-DR heterodimer on antigen-presenting cells
, within a dense cluster of HLA class II genes including HLA-DRB5, HLA-DRB9, and TSBP1. The G allele carries an odds ratio of approximately 2.47 per copy for RA in European populations — among the largest effect sizes of any common non-coding variant discovered in autoimmune disease genetics.

The Mechanism

HLA-DRA encodes the invariant alpha chain of the HLA-DR molecule, a class II major histocompatibility complex (MHC) protein expressed on the surface of antigen-presenting cells — dendritic cells, macrophages, and B cells. HLA-DR is a heterodimer: the alpha chain (encoded by HLA-DRA) pairs with a beta chain (encoded by HLA-DRB1, DRB3, DRB4, or DRB5 depending on the haplotype). The combined molecule presents peptide fragments from pathogens and self-proteins to CD4+ helper T cells, initiating adaptive immune responses. The critical insight is that HLA-DRA is largely non-polymorphic22 HLA-DRA is largely non-polymorphic
Unlike HLA-DRB1, which has hundreds of functionally distinct alleles, HLA-DRA varies very little between individuals — its alpha chain is nearly identical across all people
. rs9268839 is therefore not a coding variant in HLA-DRA itself, but a tag SNP marking an extended haplotype block that includes the HLA-DRB5 locus and influences local regulatory architecture through linkage disequilibrium with functional variants in flanking HLA-DRB genes.

The precise functional variant this SNP tags has not been fully resolved — a common situation in the HLA region, where extreme polymorphism and LD make fine-mapping challenging. What is clear is that the G-allele haplotype is enriched for HLA-DR conformations that present citrullinated self-peptides particularly efficiently to autoreactive CD4+ T cells, driving the production of anti-citrullinated protein antibodies (ACPA)33 anti-citrullinated protein antibodies (ACPA)
The hallmark autoantibody in seropositive RA; detected by the anti-CCP test and predictive of joint erosion
— the defining pathological antibodies in seropositive RA.

The Evidence

Rheumatoid arthritis. The GWAS signal at rs9268839 is among the most replicated findings in autoimmune genetics. In the landmark Okada et al. 2014 meta-analysis44 Okada et al. 2014 meta-analysis
Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 2014; 29,880 RA cases and 73,758 controls
spanning European and Asian populations, the HLA class II locus at chromosome 6p21.32 (of which rs9268839 is a primary tag) showed the strongest association in the entire genome. In the Laufer et al. 2019 trans-ethnic fine-mapping study55 Laufer et al. 2019 trans-ethnic fine-mapping study
Genetic influences on susceptibility to rheumatoid arthritis in African-Americans. Hum Mol Genet 2019; 916 AA cases + >100,000 European/East Asian participants
, the HLA-DRB1/DRA region reached p<1×10⁻²⁵⁰ in Europeans — the maximum reportable significance threshold. The rs9268839-G allele carries an OR of approximately 2.47 (95% CI 2.39–2.55) in European cohorts and 1.90 (95% CI 1.81–1.99) in East Asian cohorts, both at p=1×10⁻²⁵⁰.

Sarcoidosis. The G allele at rs9268839 has also been implicated in sarcoidosis susceptibility, a granulomatous inflammatory disease driven by dysregulated antigen presentation in the lung and lymph nodes. In Levin et al. 201366 Levin et al. 2013
Association of ANXA11 genetic variation with sarcoidosis in African Americans and European Americans. Genes Immun 2013; 1,689 cases and 1,252 controls
, rs9268839 near HLA-DRA was identified as a sarcoidosis risk SNP showing significant SNP-SNP interaction with ANXA11 rs1049550, suggesting that the HLA-DRA region risk variant amplifies disease susceptibility when combined with ANXA11 variants that impair calcium-dependent immune regulation.

Practical Actions

Carrying one or two G alleles does not mean RA or sarcoidosis will develop — these are probabilistic risk elevations, not deterministic outcomes. However, G carriers should be alert to early symptoms and pursue earlier investigation when musculoskeletal symptoms appear. The most actionable step for G carriers is anti-CCP antibody testing: ACPA can appear years before clinical RA and predicts severity and progression. G carriers who are also ACPA-positive should seek rheumatology review proactively.

Smoking is the most important environmental co-trigger: the HLA class II region and cigarette smoke interact to produce citrullinated lung antigens that drive ACPA production, multiplying RA risk superadditively in G-allele carriers.

Interactions

rs9268839 operates in the same antigen-presentation pathway as rs660895 (HLA-DRB1 shared epitope tag), rs6910071 (TSBP1/C6orf10 intronic variant), and rs2476601 (PTPN22 R620W). These variants act at distinct steps — peptide groove composition, MHC region haplotype structure, T-cell receptor signalling — and their co-occurrence is associated with multiplicative rather than additive RA risk. rs9268839 and rs660895 are in moderate LD in Europeans and may partly tag the same underlying haplotype, but they are not fully redundant: rs9268839 explains additional variation in RA susceptibility beyond the shared epitope alone.

IL1RL1 (ST2) — A Protective Variant in the IL-33 Alarm Circuit

The IL1RL1 gene11 IL1RL1 gene
Interleukin-1 Receptor-Like 1, encoding the ST2 protein — the primary cell-surface receptor for the alarmin cytokine IL-33, expressed on mast cells, eosinophils, ILC2s, and Th2 lymphocytes
is one of the most replicated loci in allergy and asthma genetics. Most research focuses on variants that increase risk. rs13424006 is notable for the opposite reason: the C allele at this intronic position is protective — carriers have a measurably lower risk of developing late-onset wheeze, the wheezing phenotype most strongly linked to persistent adult asthma.

The IL-33/ST2 axis functions as an epithelial damage alarm. When airway cells are injured by viruses, allergens, or pollutants, they release IL-3322 IL-33
A nuclear alarmin cytokine that, when released from damaged epithelial cells, binds to ST2 on ILC2s, mast cells, and eosinophils, triggering a rapid Th2 cascade
. The intensity of that alarm depends on the ratio between membrane-bound ST2 (which transmits the signal) and soluble sST2 (which mops up free IL-33 before it can reach its target cells). IL1RL1 variants modulate this ratio — some increasing signalling capacity, others buffering it. rs13424006 sits in the intron where this regulatory balance is calibrated.

The Mechanism

rs13424006 maps to position 102,350,776 on chromosome 2 (GRCh38), within the 10th intron of IL1RL1. Like rs10208293 — the other intronic IL1RL1 variant that specifically tags late-onset wheeze in the same study — rs13424006 is presumed to act through an eQTL mechanism, influencing the relative production of membrane-bound IL1RL1 and the soluble decoy sST2 without altering the protein sequence. The C allele tags a haplotype associated with more effective IL-33 buffering: greater production or activity of sST2, which intercepts IL-33 before it activates eosinophils and ILC2s.

rs13424006 and rs10208293 both sit in the 10th intron and both specifically associate with the late-onset wheeze phenotype, but they are genotyped separately and may tag distinct regulatory elements within the same intronic region. Whether their effects are additive, redundant, or mechanistically coupled is not yet established.

The Evidence

The protective effect of the C allele was first defined in a large two-cohort study by Savenije et al. (2014)33 Savenije et al. (2014), which meta-analysed data from 2,007 children in the Dutch PIAMA cohort and 7,247 children in the UK ALSPAC cohort, following wheeze phenotypes from birth to age 8. The analysis distinguished four phenotypes: no wheeze, early-transient wheeze, intermediate-onset, and late-onset wheeze. rs13424006 was one of only two IL1RL1 SNPs that associated specifically with late-onset wheeze (OR approximately 0.74, 95% CI 0.63–0.87), meaning C allele carriers develop this phenotype at substantially lower rates than TT homozygotes.

The late-onset specificity is clinically important. Late-onset wheeze — wheeze that develops after the toddler years, typically in mid-childhood — has a much stronger association with persistent adult asthma than early-transient wheeze, which is largely virus-driven and often resolves. The IL-33/ST2 pathway specifically drives the eosinophilic subtype44 eosinophilic subtype
Eosinophilic asthma is characterised by airway eosinophilia, elevated FeNO, and good response to inhaled corticosteroids and IL-5/IL-13 pathway biologics
of asthma that underlies most late-onset disease.

Confirmation came from Chinese Han children (265 asthma cases, 153 controls) in Wu et al. (2021)55 Wu et al. (2021), where CT or CC genotypes were associated with lower asthma susceptibility (adjusted OR 0.584, 95% CI 0.362–0.941, p=0.027) and, in those with asthma, a lower probability of elevated FeNO at baseline (adjusted OR 0.286, 95% CI 0.125–0.652, p=0.003). The FeNO finding directly connects genotype to eosinophilic airway inflammation — the mechanism the IL-33/ST2 axis drives.

Fourteen years of IL1RL1 population genetics, most recently summarised in Savenije et al. (2011)66 Savenije et al. (2011), established that 13 of 15 IL1RL1 SNPs tested were significantly associated with circulating sST2 levels in childhood, confirming the IL1RL1 locus as the primary genetic determinant of the soluble decoy receptor that buffers IL-33 signalling.

Practical Implications

For TT homozygotes, the actionable priority is awareness and monitoring. The TT genotype represents the population baseline — lacking the extra sST2-buffering effect the C allele confers. This does not mean TT individuals are at elevated risk in an absolute sense; it means they do not carry the C allele's protective advantage. If there is any history of childhood wheeze, asthma, or respiratory allergy, checking FeNO is the highest-yield approach to characterise whether eosinophilic airway inflammation is present — and whether eosinophil-targeting treatments (ICS, dupilumab, mepolizumab, benralizumab, itepekimab) would be particularly well-matched.

For CC homozygotes, the protective genotype reduces eosinophilic airway inflammation risk through this particular regulatory element — but this doesn't override other asthma risk factors and shouldn't be interpreted as blanket protection against all respiratory allergy.

Interactions

rs13424006 and rs10208293 both tag the late-onset wheeze phenotype at the IL1RL1 locus, sitting in the same intronic region (10th intron). Their effects may reflect distinct regulatory elements — having the protective C allele at rs13424006 alongside a risk allele at rs10208293 may represent partial compensation rather than full protection.

The IL-33 ligand variant rs992969 determines how much IL-33 is produced upstream. A TT carrier at rs13424006 who also carries the IL-33 production-increasing allele at rs992969 faces elevated ligand meeting reduced buffering — a combined effect on the entire axis. The TSLP variant rs1837253 acts upstream in the same Th2 cascade; multiple risk alleles in this pathway produce substantially higher cumulative eosinophilic inflammation risk than any single SNP predicts.