ST3GAL4 — The Sugar Coat That Keeps Clotting Factors in Circulation

Your blood carries von Willebrand factor (VWF)11 von Willebrand factor (VWF)
A large multimeric protein that anchors platelets to damaged vessel walls and escorts Factor VIII through the circulation; elevated VWF is an independent VTE risk factor
and Factor VIII continuously, but the liver clears these proteins at a rate that determines their plasma concentration. What regulates clearance? In large part, a molecular sugar coating. Sialic acid residues attached to the outer glycan chains of VWF and Factor VIII shield them from asialoglycoprotein receptors (ASGPR) on hepatocytes — the liver's galactose-recognition machinery that flags aging or under-glycosylated proteins for destruction. The ST3GAL4 enzyme is responsible for adding α2,3-linked sialic acids to these protective glycan termini. Variants near the ST3GAL4 gene alter this sialylation activity, which in turn shifts steady-state VWF and Factor VIII levels — and therefore VTE risk.

The Mechanism

ST3GAL4 (ST3 beta-galactoside alpha-2,3-sialyltransferase 4)22 ST3GAL4 (ST3 beta-galactoside alpha-2,3-sialyltransferase 4)
One of a family of six ST3GAL enzymes that catalyze transfer of sialic acid in the α2,3 linkage onto N- and O-linked glycans
is expressed in endothelial cells and hepatocytes, where it acts on newly synthesized VWF and Factor VIII. Fully sialylated VWF has reduced affinity for ASGPR on liver macrophages and hepatocytes; removing sialic acid from VWF dramatically accelerates its hepatic clearance33 removing sialic acid from VWF dramatically accelerates its hepatic clearance
Byrne et al. 2025 showed that enzymatic desialylation of both plasma-derived and recombinant VWF produced identical, rapid clearance regardless of other glycan differences
.

The rs35257264 T allele is located in an intronic region overlapping the ST3GAL4 genomic locus. It acts as a cis-regulatory variant — most likely influencing ST3GAL4 expression or splicing in endothelial cells — rather than altering the enzyme's amino acid sequence directly. The net effect of the T allele is an upward shift in steady-state sialylation activity, meaning VWF and Factor VIII are more effectively protected from clearance, circulate longer, and accumulate at higher plasma concentrations. The coagulation balance tilts toward clotting.

The Evidence

The VWF/FVIII-elevating effect of ST3GAL4 variants was established in a 2016 ARIC cohort analysis: Song et al. studied 12,117 participants44 Song et al. studied 12,117 participants
Multi-ethnic ARIC cohort; associations held in both European American and African American ancestry groups after adjustment for ABO blood group, age, BMI, hypertension, and diabetes
and identified three ST3GAL4 intronic SNPs significantly associated with both VWF antigen levels and Factor VIII activity.

The VTE connection was subsequently confirmed in two large independent GWAS meta-analyses. Thibord et al. 202255 Thibord et al. 2022
Cross-ancestry meta-analysis of 30 cohorts; 81,669 VTE cases including European, African, and Hispanic ancestry populations; identified 135 independent genomic loci
identified the ST3GAL4 region among 135 independent VTE risk loci. Ghouse et al. 202366 Ghouse et al. 2023
Nature Genetics; 81,190 VTE cases and 1,419,671 controls across six cohorts; 93 significant loci, 62 previously unreported; polygenic risk score performance comparable to monogenic thrombophilia testing
replicated the ST3GAL4 region in a non-overlapping cohort with 93 total loci, confirming it as a reproducible signal.

A 2024 GWAS of 45,289 participants for circulating Factor VIII and VWF levels77 circulating Factor VIII and VWF levels
de Vries et al. 2024; identified 7 novel loci for FVIII including ST3GAL4 at genome-wide significance P < 5×10⁻⁹
identified ST3GAL4 as a novel genome-wide significant locus for Factor VIII levels, closing the mechanistic loop between the GWAS VTE signal and the VWF/FVIII axis.

For clinical context: a systematic review and meta-analysis of 15 studies88 a systematic review and meta-analysis of 15 studies
Lowe et al. 2023; 5,327 VTE cases; pooled OR for FVIII quartile 4 vs. quartile 1: 3.92 (95% CI 1.61–5.29)
found that Factor VIII levels above the 90th percentile are associated with a 3-fold higher VTE risk, confirming that elevated FVIII — the proximal mechanism by which ST3GAL4 T allele exerts its effect — is a dose-dependent independent VTE risk factor.

The effect size for rs35257264 itself is modest (OR approximately 1.21 per T allele), which is typical for common regulatory GWAS variants acting through quantitative trait changes. This places it in the context of a contributing factor rather than a high-penetrance thrombophilia like Factor V Leiden or prothrombin G20210A.

Practical Implications

The T allele's relevance is highest when added to other thrombotic risk — surgery, immobility, oral contraceptives, pregnancy, or co-inherited thrombophilias. In these settings, the additional nudge toward elevated VWF/FVIII that the T allele provides compounds with environmental and genetic co-factors. Carriers should ensure providers are aware of this variant when planning high-risk situations, and may benefit from VWF or Factor VIII level testing to understand their baseline quantitative risk.

There are no drugs or supplements that specifically modulate ST3GAL4 activity in clinical practice. Standard VTE prevention measures apply with heightened attention to provocation management.

Interactions

The ST3GAL4 T allele acts through the same VWF/FVIII quantitative axis as other clotting factor loci. Its effect is most clinically relevant when combined with other thrombophilic variants — particularly [Factor V Leiden (rs6025, F5 R506Q) | Activated protein C resistance; ~5% carrier frequency in Europeans; 3-8 fold VTE risk for heterozygotes] or [prothrombin G20210A (rs1799963) | ~1-3% carrier frequency; 2-5 fold VTE risk], which impair anticoagulant mechanisms at a different node of the coagulation cascade. Combined carrier status for ST3GAL4 T allele plus either classical thrombophilia would compound risks additively or possibly synergistically.

ABO blood group is a critical co-determinant of VWF levels: blood group O individuals have ~25% lower VWF than non-O individuals through a separate glycosylation-clearance mechanism. The ST3GAL4 T allele effect operates independently of ABO (Song et al. 2016 adjusted for ABO) but their absolute VWF levels are determined by both factors together.

TRAF1-C5 — The Inflammatory Signal Amplifier at Chromosome 9q33

Deep within the long arm of chromosome 9 sits a genetic locus that helps explain why rheumatoid arthritis clusters in families and why some patients respond poorly to the most effective treatments available. The TRAF1-C5 locus11 The TRAF1-C5 locus
TRAF1 encodes TNF receptor-associated factor 1, a scaffold protein inside immune cells that amplifies inflammatory NF-kB signaling; C5 encodes complement component 5, a protein that drives joint inflammation through the complement cascade
at 9q33.3 contains two genes, each contributing to the inflammatory cascade that erodes joints in autoimmune arthritis. rs3761847 is an intronic variant in TRAF1 that serves as the principal GWAS tag SNP for this entire locus — its G allele marks a chromosomal haplotype that increases susceptibility to rheumatoid arthritis by approximately 32% per copy.

The Mechanism

rs3761847 sits in an intron of TRAF1 on chromosome 9 (GRCh38 position 120,927,961), with the TRAF1 gene encoded on the minus strand. The variant itself is not a coding change — it does not alter any amino acid. Instead, it acts as a linkage disequilibrium tag22 linkage disequilibrium tag
A tag SNP marks a haplotype block: it travels with nearby functional variants through generations because recombination between them is rare, making the tag SNP a reliable proxy for the causal variant(s)
for a haplotype that alters TRAF1 expression in immune cells.

TRAF1 protein is a key intracellular scaffold in the TNF receptor signaling pathway. When TNF-alpha binds its receptor on immune cells, TRAF1 is recruited to the receptor complex and modulates the downstream activation of NF-kB33 NF-kB
Nuclear factor kappa-light-chain-enhancer of activated B cells — a master transcription factor that switches on hundreds of pro-inflammatory genes including cytokines, chemokines, and adhesion molecules that drive joint inflammation
. Higher TRAF1 expression — associated with the G-allele haplotype — amplifies NF-kB activation in CD14+ monocytes and other inflammatory cells, tilting the immune response toward persistent inflammation rather than resolution. Studies in Japanese RA patients found significantly elevated TRAF1 protein in monocytes from G-allele carriers, providing a direct mechanistic link between genotype and cellular phenotype.

The neighboring C5 gene contributes through a separate mechanism: complement component 5 is cleaved to C5a in the synovial fluid of inflamed joints, where it drives macrophage activation and inflammatory cytokine release. The G-allele haplotype at rs3761847 spans both genes, potentially influencing both TRAF1-mediated NF-kB amplification and complement-driven synovial inflammation.

The Evidence

The foundational discovery came from Plenge et al. (2007)44 Plenge et al. (2007)
NEJM genome-wide study analyzing 317,503 SNPs in 1,522 anti-CCP-positive RA cases and 1,850 controls from NARAC and EIRA cohorts, replicated in an additional 997 cases and 1,777 controls
, who identified rs3761847 as the top signal at the TRAF1-C5 locus with an odds ratio of 1.32 (95% CI 1.23–1.42, p = 4×10⁻¹⁴) in anti-CCP-positive rheumatoid arthritis — one of the most significant non-HLA genetic associations for RA at that time. Crucially, the effect was strongest in seropositive (anti-CCP-positive) disease, which is the most genetically driven, erosive form of RA.

A meta-analysis by Song et al. (2014)55 meta-analysis by Song et al. (2014)
Immunological Investigations meta-analysis encompassing 24 studies with 22,682 RA cases and 23,493 controls across European and Asian populations
confirmed the European association (OR 1.156, 95% CI 1.006–1.327 per G allele) while finding no statistically significant effect in Asian cohorts (OR 1.049, p = 0.333). This ethnic divergence is explained by differences in linkage disequilibrium66 linkage disequilibrium
LD refers to the non-random co-inheritance of alleles; if the causal variant is in strong LD with rs3761847 in Europeans (r²=0.67) but weaker LD in Asians (r²=0.37), the tag SNP is more informative in the former
between rs3761847 and the actual functional variant(s) in the two population groups.

Replication in East Asian populations confirmed the locus itself matters: Zhu et al. (2011)77 Zhu et al. (2011)
BMC Medical Genetics, 576 RA patients and 689 controls in Han Chinese
found significant association of rs3761847 (p = 0.0018, OR 1.28) in their Han Chinese cohort, independent of anti-CCP and RF concentrations — demonstrating that the genetic effect at this locus is not simply a proxy for serological status.

Beyond susceptibility, the TRAF1-C5 locus influences treatment outcomes. Canhão et al. (2015)88 Canhão et al. (2015)
Biomedical Research International, Southern European RA patients on anti-TNF therapy
found that the G risk allele at rs3761847 predicted poor response to anti-TNF biologics in multivariate analyses. This is mechanistically coherent: if the G haplotype drives higher TRAF1 expression and stronger NF-kB activation, those same pathways may counteract TNF blockade.

Practical Actions

For G-allele carriers — particularly GG homozygotes — the key clinical implications are threefold: early awareness of seropositive RA symptoms, proactive monitoring of relevant biomarkers (anti-CCP antibodies and CRP), and awareness that standard anti-TNF biologic therapy may be less effective for GG carriers. These individuals may benefit from JAK inhibitors or IL-6 pathway inhibitors as preferential first-line biologic options when conventional DMARDs fail, since these targets operate downstream of or parallel to TRAF1-mediated NF-kB signaling.

The G allele is approximately 43% globally and 42% in Europeans, making it common rather than rare. Most Europeans carry at least one copy. GG homozygotes (~18% in Europeans) carry the highest genetic burden at this locus. Seronegative RA patients should note that the evidence for this locus is substantially weaker outside the anti-CCP-positive disease subtype.

Interactions

rs3761847 sits in a pathway heavily influenced by other autoimmune susceptibility loci. The TNFAIP3 locus on chromosome 6q23 — specifically rs6920220 and rs13207033 — encodes A20, the primary brake on NF-kB signaling that TRAF1 activates. Carriers of both a TRAF1-C5 G risk allele and TNFAIP3 risk variants face a double hit: amplified NF-kB signaling (via TRAF1) combined with impaired NF-kB termination (via A20 deficiency). The PTPN22 rs2476601 variant (R620W) represents a third independent RA risk signal that alters T-cell receptor signaling thresholds, and its combination with TRAF1-C5 G-allele carriage further elevates seropositive RA risk.

RSBN1 rs3789604 — A Psoriasis Haplotype Tag at the PTPN22 Locus

The PTPN22 gene region on chromosome 1p13 is one of the most studied loci in autoimmune genetics. Most attention focuses on the R620W missense variant (rs2476601), which alters T-cell signaling threshold and drives risk for rheumatoid arthritis, type 1 diabetes, and lupus. But haplotype analyses have revealed that the locus harbors multiple independent signals11 multiple independent signals
Deep sequencing and haplotype studies of 37+ PTPN22 variants identified secondary association signals that are statistically independent of R620W and tag different disease patterns
. rs3789604 is one such secondary variant — a synonymous change in the neighboring RSBN1 gene that marks a distinct haplotype block associated specifically with early-onset psoriasis and Graves' disease, rather than with the RA/T1D pattern driven by R620W.

RSBN1 (round spermatid basic protein 1) encodes a protein involved in chromatin regulation, but rs3789604's disease associations operate through linkage disequilibrium with nearby PTPN22 regulatory elements22 linkage disequilibrium with nearby PTPN22 regulatory elements
rs3789604 is a synonymous coding change — it doesn't alter the RSBN1 protein. Its clinical significance comes from being a tag SNP that marks a chromosomal haplotype block also covering PTPN22 regulatory regions. Changes in the haplotype's regulatory architecture modulate PTPN22 expression or splice isoform ratios in immune cells
. PTPN22 encodes LYP (lymphoid tyrosine phosphatase), a potent suppressor of T-cell receptor signaling that sets the activation threshold for both T and B lymphocytes.

The Mechanism

rs3789604 sits at GRCh38 chr1:113,812,320 within an exon of RSBN1, which is transcribed on the minus strand. The plus-strand alleles are T (reference, ~83% frequency) and G (alternate, ~17%). The nucleotide change is synonymous in RSBN1 — it does not alter the arginine at codon 31 — but rs3789604 resides in the same chromosomal neighborhood as PTPN22 and is in linkage disequilibrium33 linkage disequilibrium
LD means alleles co-occur more frequently than chance because of their proximity on the chromosome — rs3789604 and nearby variants travel together through generations, so any one serves as a proxy for the others
with multiple PTPN22 intronic and upstream variants. The G allele marks a specific haplotype block (documented in scleritis studies as the "TTATACGCG" haplotype) that is enriched in several inflammatory conditions.

The functional effect is inferred to operate through non-coding regulatory changes in the PTPN22 gene: altered promoter activity, enhancer function, or splicing signals that modulate how much LYP protein is produced in immune cells. Lower LYP activity (from reduced expression) lowers the activation threshold for T cells, increasing the likelihood of autoreactive responses. Higher LYP activity (from elevated expression) may conversely impair normal immune responses. The exact direction depends on the specific haplotype and disease context — which explains why the G allele shows risk in psoriasis but some Asian studies report different directional effects for Graves' disease.

The Evidence

The strongest evidence comes from psoriasis. Smith et al. 200844 Smith et al. 2008
Polymorphisms in the PTPN22 region are associated with psoriasis of early onset. Br J Dermatol 2008
found rs3789604 significantly associated with Type I (early-onset) psoriasis (P=0.0002) in a UK cohort of 647 cases and 566 controls, replicated in a combined dataset of 900 cases and 2,590 controls. Crucially, R620W showed no association with psoriasis in the same samples — demonstrating that rs3789604 tags a distinct functional signal, not the classic R620W haplotype.

This finding was independently replicated by Li et al. 200955 Li et al. 2009
Further genetic evidence for three psoriasis-risk genes: ADAM33, CDKAL1, and PTPN22. J Invest Dermatol 2009
in a combined meta-analysis of 2,823 psoriasis cases and 4,066 controls (P=3.45×10⁻⁵), one of the largest psoriasis genetics datasets assembled at that time. The consistent signal across two independent study designs in different populations substantially strengthens the association.

For Graves' disease, data show population heterogeneity. Ichimura et al. 200866 Ichimura et al. 2008
Associations of PTPN22 gene polymorphisms with susceptibility to Graves' disease in a Japanese population. Thyroid 2008
found the A allele (T on plus strand, the reference) significantly elevated in Japanese GD patients (OR=1.45, P=0.0085; 414 patients, 231 controls). In contrast, a Chinese study (Gu et al. 2010, PMID 19438904) found the AA genotype correlated with reduced GD risk. This apparent contradiction likely reflects differences in haplotype background77 haplotype background
The same rs3789604 allele may sit on chromosomes with different flanking PTPN22 variants in different populations, meaning the allele tags different regulatory configurations in Japanese vs Chinese vs European genomes
.

For rheumatoid arthritis, Wesoly et al. 200788 Wesoly et al. 2007
The 620W allele is the PTPN22 genetic variant conferring susceptibility to RA in a Dutch population. Rheumatology 2007
found no independent RA association for rs3789604 (P=0.134), confirming that at least in Europeans, R620W is the dominant signal and rs3789604 does not independently tag RA risk.

Practical Implications

The clinical takeaway from rs3789604 is primarily about psoriasis and related spondyloarthropathy risk, operating through the PTPN22 locus independently of R620W. Carriers of one or two G alleles face modestly elevated risk for early-onset psoriasis. Psoriasis affects about 2–3% of the global population and can transition to psoriatic arthritis in approximately 30% of cases — a destructive inflammatory arthropathy that may present years after skin disease onset. Catching psoriasis early enables topical and systemic interventions that reduce the progression to joint involvement.

The scleritis and Graves' disease signals are secondary findings with more limited or population-specific evidence. Carriers should be aware that autoimmune conditions at the PTPN22 locus tend to cluster — having one PTPN22-region variant increases the probability of other inflammatory conditions in the same pathway.

Interactions

rs3789604 is in close proximity to rs1217414 (an intronic PTPN22 variant also associated with psoriasis and ankylosing spondylitis) and was found to interact in the psoriasis signal. Smith et al. found that carrying risk alleles at both rs1217414 and rs3789604 showed a stronger combined association (P=0.002) than either alone, consistent with a shared haplotype effect. Both variants are largely independent of R620W (rs2476601), which primarily drives RA and T1D risk. Together, rs3789604 and rs1217414 define a psoriasis/spondyloarthropathy-specific PTPN22 haplotype distinct from the R620W haplotype that governs RA and T1D risk.

TLR6 Val327Met — A Rare Damaging Variant in the Mycoplasma Alarm System

Every time your immune cells encounter diacylated lipopeptides — the molecular calling-card of mycoplasma, staphylococcal lipoproteins, and certain gram-positive bacteria — they rely on a precise molecular handshake between Toll-like Receptor 2 (TLR2)11 Toll-like Receptor 2 (TLR2)
TLR2 is the central scaffold for bacterial lipopeptide sensing; it forms heterodimers with either TLR1 (for triacylated lipopeptides) or TLR6 (for diacylated lipopeptides), activating NF-κB and driving TNF-α, IL-6, and IL-12 production
and TLR622 TLR6
Toll-like Receptor 6, encoded at chromosome 4p14; forms obligate heterodimers with TLR2 and is distinguished from TLR1 by a blocked lipid channel that restricts ligand recognition to diacylated — not triacylated — lipopeptides
. The rs3796508 Val327Met variant substitutes a structurally conservative valine with a bulkier, polar methionine at position 327 in TLR6's extracellular leucine-rich repeat (LRR) domain — a change that SIFT and PolyPhen independently predict to be damaging to protein function.

This variant is extremely rare in European populations (T allele ~0.2%) but reaches a frequency of approximately 6% in East Asian populations (gnomAD), making it clinically relevant primarily for individuals of East Asian ancestry. It has no ClinVar entry, but published functional predictions and emerging association data across multiple disease contexts — colorectal cancer, allergic rhinitis, smoking-related disease — consistently point to meaningful functional consequences in the TLR2/TLR6 signaling arm of innate immunity. Its placement here in the autoimmune-inflammation category captures TLR6's role not only in infectious defense but in the chronic, low-grade inflammatory processes that contribute to autoimmune pathology.

The Mechanism

The 2009 crystal structure of the TLR2/TLR6 heterodimer (Kang et al., Immunity 200933 Kang et al., Immunity 2009
PMID 19931471; first crystal structure of TLR2/TLR6 bound to diacylated lipopeptide; determined at 2.1 Å resolution
) revealed the molecular basis for TLR6's unique specificity. Unlike TLR1, which accommodates a third fatty acid chain in an open lipid channel, TLR6 has its corresponding channel blocked by two conserved phenylalanines — physically preventing triacylated lipopeptide binding and enforcing diacylated specificity. When these phenylalanines are mutated, TLR2/TLR6 loses its diacyl-exclusivity. The compensating structural feature is a dimerization interface that is 80% larger than TLR2/TLR1, stabilising the heterodimer through extensive hydrophobic contacts in the absence of a full three-chain lipid anchor.

Position 327 falls within the leucine-rich repeat domain of TLR6 that forms part of this enlarged dimerization interface and contributes to the extracellular geometry that coordinates diacylated lipopeptide presentation. Valine at position 327 is a hydrophobic residue well-suited to packing into the hydrophobic dimerization interface; methionine is larger and introduces a flexible sulphur-containing side chain. The Val327Met substitution likely perturbs local packing of the LRR domain, potentially altering the geometry of TLR2/TLR6 heterodimer assembly and reducing signaling efficiency upon ligand engagement. SIFT (deleterious) and PolyPhen-2 (probably damaging) predictions, reported in the colorectal cancer association study by Semlali et al. 201944 Semlali et al. 2019
PMID 31281474
, support the inference of functional disruption.

The Evidence

Compared to TLR6 Ser249Pro (rs5743810) — the more extensively studied variant in the same gene — rs3796508 Val327Met has a much smaller published literature, and no study has directly measured the impact of Met327 on TLR2/TLR6 heterodimer formation or NF-κB activation in primary cells. The evidence base is emerging and largely association-derived.

The most direct disease association comes from a Saudi Arabian colorectal cancer case-control study55 colorectal cancer case-control study
115 CRC cases, 102 healthy controls; genotyped by TaqMan assay
by Semlali et al. (2019). In the global analysis, rs3796508 showed no significant association with CRC risk overall, but in subgroup analyses the Val/Met heterozygous genotype demonstrated a protective effect in males (OR=0.095, p=0.034) and in individuals over 57 years66 protective effect in males (OR=0.095, p=0.034) and in individuals over 57 years
these are suggestive findings from a small study (n=217) and require independent replication
. The same paper noted that TLR6 expression was significantly reduced in colon cancer tissue versus normal colon (p<0.001), positioning TLR6 as a potential tumour-suppressive innate immune sensor in the gut. Reduced TLR2/TLR6 signalling from Met327 could plausibly impair this innate surveillance.

In an inflammatory context, a 2026 Chinese Han study (Wang et al., Sci Rep 202677 Wang et al., Sci Rep 2026
PMID 41663697; 992 AR patients versus 992 healthy controls; primary focus on NFKB1 variants with TLR6 rs3796508 as a secondary finding
) found that rs3796508 contributed to increased allergic rhinitis (AR) risk specifically in women. This sex-specific effect is consistent with reports of sex-dimorphic TLR6 phenotypes in other variants (notably rs5743810 Ser249Pro, where cardiovascular and leprosy associations show sex-specific patterns). The AR finding is notable because it suggests Met327 may shift TLR2/TLR6 responses in a direction that promotes type-2 (Th2) allergic inflammation — potentially through altered NF-κB dynamics that affect the Th1/Th2 balance during early immune development and sensitisation.

A smoking study (Kohailan et al., Onco Targets Ther 201688 Kohailan et al., Onco Targets Ther 2016
PMID 27920557; 177 smokers vs 126 non-smokers; Saudi population
) examined rs3796508 among TLR6 variants in smokers, proposing the Val327Met variant as a potential index for smoking-related disease susceptibility. Tobacco smoke contains lipopolysaccharide- and lipopeptide-like compounds that chronically activate TLR2/TLR6; a damaging variant at position 327 could blunt this response — a mechanism consistent with altered inflammatory signalling in smoking-exposed airways.

Practical Implications

The evidence for rs3796508 is emerging and requires cautious interpretation. The variant is predicted damaging by two independent computational tools, which provides a biological prior for functional effects on TLR2/TLR6 signalling. However, no functional cellular studies have directly measured NF-κB output, cytokine production, or heterodimer formation efficiency in cells carrying the Met327 allele — these experiments are needed to validate computational predictions.

For carriers — predominantly individuals of East Asian descent (~6%) where the variant reaches population significance — the actionable implications centre on monitoring contexts where TLR2/TLR6 competence matters: gut barrier function (TLR6 plays a role in colonic innate surveillance), respiratory allergy (sex-specific AR risk increase in women), and chronic inflammatory conditions where TLR2/TLR6 pathway tone modulates disease expression.

The TLR2/TLR6 pathway also serves as a critical interface between commensal gut bacteria and mucosal immune regulation. The beneficial gut bacterium Faecalibacterium prausnitzii99 Faecalibacterium prausnitzii
a major short-chain fatty acid producer and key modulator of gut immune homeostasis; low in IBD patients
signals through TLR2/TLR6 to prime regulatory T cells — a pathway that could be impaired by reduced Met327 signalling.

Interactions

TLR6 operates exclusively through TLR2 (rs5743708). The TLR2 R753Q variant independently impairs TLR2 signalling from the intracellular TIR domain; any individual carrying both rs5743708 (TLR2 R753Q) and rs3796508 (TLR6 Val327Met) faces potential compound reduction of TLR2/TLR6 output from two separate structural points. The Ser249Pro variant (rs5743810) in TLR6 modulates NF-κB activation efficiency from the extracellular domain; individuals with both rs3796508 and rs5743810 risk alleles on the same TLR6 molecule (in cis) would carry two independent damaging changes in the same receptor — a scenario that warrants investigation in populations where both alleles are present (East Asian populations carry both at detectable frequencies). TLR1 (rs5743618) handles the triacylated lipopeptide arm of pattern recognition and operates independently of TLR6.

rs3877899

SELENOP Ala234Thr

Moderate Risk Factor

SELENOP Ala234Thr — When Your Selenium Carrier Protein Carries Less

Selenoprotein P (SELENOP, formerly SEPP1) is the primary selenium transport protein in human plasma. Unlike most proteins, which are built from the standard 20 amino acids, SELENOP incorporates selenocysteine11 selenocysteine
the 21st amino acid, structurally similar to cysteine but with selenium replacing sulfur — it gives selenoproteins their exceptional antioxidant power
at 10 positions. SELENOP accounts for roughly 50–60% of all plasma selenium and is the main vehicle by which the liver ships selenium to peripheral tissues — brain, testes, kidney, and thyroid among them. rs3877899 changes a single amino acid at position 234 of the protein, swapping alanine for threonine, and this substitution measurably alters how much selenium your body successfully delivers.

The Mechanism

SELENOP circulates as a mixture of isoforms. The full-length 60-kDa form contains all 10 selenocysteine residues and binds the apoER2 receptor (LRP8) on target cells, enabling receptor-mediated uptake of selenium into tissues. Shorter truncated isoforms containing six or fewer selenocysteine residues cannot bind apoER2 and are therefore poor selenium donors to tissues. The Ala234Thr substitution falls within a region of SELENOP that influences this isoform balance. In colorectal cancer patients, the GG coding genotype (CC plus-strand) combined with a 3'UTR variant in rs7579 was associated with decreased expression of the full-length 60-kDa isoform22 colorectal cancer patients, the GG coding genotype (CC plus-strand) combined with a 3'UTR variant in rs7579 was associated with decreased expression of the full-length 60-kDa isoform
Short SP et al. Free Radic Biol Med. 2018
, suggesting the variant's effects on isoform ratios are context-dependent.

A randomized dietary intervention in 170 Danish adults aged 50–7433 randomized dietary intervention in 170 Danish adults aged 50–74
Kopp TI et al. Genes Nutr. 2018
provided the clearest functional evidence: participants who ate 1,000 g of selenium-rich fish and mussels weekly for 26 weeks showed a genotype-dependent rise in plasma SELENOP. CC homozygotes achieved plasma selenoprotein P levels averaging 4.68 ng/mL higher than T-allele carriers at week 26 (95% CI −8.49 to −0.871, p=0.018). The T allele blunts the protein-level response to dietary selenium — meaning T carriers absorb selenium but synthesize or retain SELENOP less efficiently.

The Evidence

Selenium transport efficiency: The Kopp et al. RCT (n=170) is the most direct human evidence that the T allele compromises SELENOP upregulation in response to selenium intake. The CC genotype more effectively converts dietary selenium into circulating SELENOP, the protein responsible for distributing it.

Pregnancy and GPx response: A UK cohort of 230 pregnant women by Mao et al. 201644 Mao et al. 2016
Mao J et al. Am J Clin Nutr. 2016
found that T-allele carriers (coding A allele) maintained blood selenium better during gestation (p=0.005, explaining 8% of variance) and showed greater glutathione peroxidase-3 (GPx3) activity increases in response to selenium supplementation (p=0.01). This suggests the T allele may partially reroute selenium toward GPx synthesis rather than SELENOP.

Prostate cancer: A Chicago cohort study of prostate cancer patients by Ekoue et al. 201855 Ekoue et al. 2018
Ekoue DN et al. Prostate. 2018
found that men homozygous for the TT genotype (coding AA, Thr/Thr) had nearly 6-fold higher odds of treatment failure within 2 years of surgery (OR=5.75, 95% CI 1.09–30.5, p=0.021) compared to CC (Ala/Ala) homozygotes. This was independent of serum selenium levels, suggesting the functional isoform produced by the Ala234Thr variant — rather than circulating selenium per se — influences cancer progression.

Retinopathy of prematurity: In a cohort of 173 premature infants (gestational age ≤32 weeks), Strauss et al. 202366 Strauss et al. 2023
Strauss E et al. Int J Mol Sci. 2023
found the rs3877899 A allele (coding A = plus-strand T) significantly associated with advanced retinopathy of prematurity requiring treatment (OR=1.8, p=0.045). TT homozygote infants showed OR=7.0 for treatment-requiring ROP and OR=13.6 for treatment failure (p=0.005). The proposed mechanism is impaired antioxidant protection during critical retinal vascularization due to reduced selenium bioavailability.

Metabolic effects: A Brazil nut supplementation RCT in 130 healthy volunteers by Donadio et al. 201877 Donadio et al. 2018
Donadio JLS et al. Eur J Nutr. 2018
found that CT+TT carriers had significantly lower fasting glucose compared to CC carriers, suggesting the Thr234 protein variant may interact with selenoprotein P's emerging role in glucose metabolism.

Practical Implications

The T allele does not simply lower selenium — it changes how selenium is packaged and delivered. T-allele carriers may benefit from ensuring consistent selenium intake through food sources with high selenomethionine bioavailability, since selenomethionine (from Brazil nuts, fish, eggs) is incorporated non-specifically into all proteins and provides a buffer that selenocysteine-dependent SELENOP cannot. Selenium supplementation above 200 mcg/day carries toxicity risk regardless of genotype; targeting the 55–200 mcg/day range through diet is appropriate for T carriers.

Interactions

The companion variant rs7579 in SELENOP (a 3'UTR variant at c.*14G>A) interacts with rs3877899 to influence isoform expression in colorectal cancer patients. The combined GG(rs3877899) + GA(rs7579) genotype was associated with the greatest reduction in full-length 60-kDa SELENOP. rs3877899 also functionally interacts with the selenium pathway partner rs1050450 (GPX1 Pro198Leu) — both variants modulate how effectively dietary selenium is converted into active selenoproteins, and their combined effect on cancer risk is an active area of research. The SELENOP-GPX4 axis (rs713041) is a third interaction point, as GPX4 competes with SELENOP for selenium incorporation and affects ferroptosis susceptibility.

MYH15 and Coronary Risk — An Emerging but Uncertain Link

Myosin heavy chains are the molecular motors that drive muscle contraction. The MYH gene family encodes 15 isoforms, each specialized for different muscle types: cardiac myosins power the heart, skeletal myosins drive limb movement, and a handful of unconventional isoforms serve more specialized roles. MYH15 falls into this last group. Expressed primarily in extraocular muscles (which control eye movement) and muscle spindles11 muscle spindles
proprioceptive sensory organs embedded in skeletal muscle
, it is not a classic cardiac or smooth muscle myosin. Yet genetic studies from the mid-to-late 2000s flagged rs3900940 in MYH15 as a possible contributor to coronary heart disease risk — a finding that remains intriguing but incompletely explained.

The Mechanism

rs3900940 lies on chromosome 3 at position 108,428,881 (GRCh38). Because MYH15 is transcribed from the minus strand, the plus-strand reference allele T corresponds to a coding-strand A at position 3313 of the transcript (NM_014981.3). The alternate C allele on the plus strand produces a coding-strand G, changing codon 1105 from Thr (ACT) to Ala (GCT) — the p.Thr1105Ala substitution. Threonine at position 1105 sits in the C-terminal tail domain22 C-terminal tail domain
the region of myosin heavy chain responsible for filament assembly and interaction with other structural proteins
. Replacing a polar, phosphorylatable threonine with a nonpolar alanine could in principle alter filament packing or post-translational regulation, but the functional consequence of this specific substitution has not been directly characterized.

How a variant in an extraocular/spindle myosin might influence coronary disease risk is mechanistically unclear. One hypothesis: MYH15 may have low-level expression in vascular smooth muscle or coronary endothelium that has been missed by bulk transcriptomic surveys but contributes to vessel wall mechanics. A 2014 study found that intronic MYH15 variants associate with abnormal coronary flow reserve in men33 intronic MYH15 variants associate with abnormal coronary flow reserve in men
Yoshino et al., Coronary Artery Disease 2014
with odds ratios of 2.27–2.60, the authors noting that "further studies are needed to clarify how MYH15 might be involved in vascular biology." Another possibility is that rs3900940 is in linkage disequilibrium with a nearby causal variant in a different gene. Given the limited mechanistic work, the association currently rests on epidemiological evidence without a confirmed molecular pathway.

The Evidence

The original genetic risk signal emerged from prospective cohort work. Bare et al. (2007)44 Bare et al. (2007) examined five genetic variants in the Atherosclerosis Risk in Communities (ARIC) cohort of 9,129 white adults. Participants carrying the highest-risk combination of all five variants (including rs3900940) had a 57% increased hazard for incident coronary heart disease (HR 1.57, 95% CI 1.21–2.04, P = 0.001). However, individual effect estimates for rs3900940 were not reported separately in the published paper — the variant was evaluated only as part of a combined polygenic score, limiting interpretation of its independent contribution.

The stroke connection came from the Vienna Stroke Registry. Luke et al. (2009)55 Luke et al. (2009) tested CHD-linked SNPs in 562 stroke cases and 815 controls, finding that rs3900940 in MYH15 was among those associated with noncardioembolic stroke: OR 1.31 (90% CI 1.07–1.60). Noncardioembolic stroke includes large-artery atherosclerotic stroke and small-vessel disease — phenotypes that share pathophysiological features with coronary atherosclerosis. The same group extended this analysis to the Cardiovascular Health Study, finding that rs3900940's prespecified risk allele showed nominal association with ischemic stroke in white participants (one-sided P < 0.05), though this study specifically flagged seven SNPs including MYH15 without providing individual ORs for each.

The coronary microvascular angle comes from a 2014 candidate-gene study of 643 patients evaluated for coronary flow reserve (CFR). Yoshino et al.66 Yoshino et al. found that two intronic MYH15 variants (rs4855559 and rs7630352, distinct from rs3900940) showed striking male-specific associations with impaired CFR (OR ≈ 2.3–2.6, SNP-sex interaction P < 0.001) with no significant signal in women. This sex-specific pattern suggests MYH15 may interact with androgen-driven vascular biology, but rs3900940 was not among the variants tested in that study.

Taken together, the evidence is consistent but not strong. Multiple small-to-medium studies point in the same direction, but the variant has never been a standalone genome-wide significant hit, effect sizes are modest, and the biological mechanism connecting an extraocular myosin to coronary risk is unresolved. The evidence level is therefore emerging rather than moderate or strong.

Practical Actions

The modest, replicated association between rs3900940 and coronary/stroke risk means that CC homozygotes have a somewhat elevated probability of atherosclerotic cardiovascular disease. The effect size (OR ~1.3 for one copy of the risk allele) is comparable to many common cardiovascular risk variants — meaningful at a population level, modest at an individual level. The most actionable implication is that CC carriers should prioritize surveillance of established cardiovascular risk factors (LDL, blood pressure, glucose) rather than treating this variant as a high-penetrance finding.

TC heterozygotes carry an intermediate risk. The variant appears to act additively, with each copy of the C allele contributing approximately proportionally to risk.

Interactions

rs3900940 was originally evaluated alongside four other variants — rs20455 in KIF6, rs7439293 in PALLD, rs2298566 in SNX19, and rs1010 in VAMP8 — in a combined genetic risk score for coronary heart disease. When all five risk alleles are present, the combined effect (HR 1.57) exceeds what any single variant contributes. This multi-variant interaction context suggests that rs3900940 may be most informative when interpreted in conjunction with these co-validated CHD risk markers.

HLA-C*06:02 — The Master Psoriasis Susceptibility Haplotype

The HLA-C gene sits at the heart of the immune system's antigen presentation machinery, producing the HLA-C protein that displays peptide fragments to CD8+ cytotoxic T cells11 CD8+ cytotoxic T cells
MHC class I molecules present endogenous peptides to CD8+ T cells, enabling immune surveillance of cellular protein content
. One particular version of this protein — encoded by the HLA-C*06:02 haplotype — is the strongest single genetic risk factor for psoriasis, accounting for the largest fraction of genetic predisposition22 the largest fraction of genetic predisposition
HLA-C*06:02 on PSORS1 is the primary psoriasis susceptibility allele in Caucasian populations, responsible for a large fraction of familial risk
to early-onset disease in European populations. The SNP rs4406273, located approximately 28.7 kb upstream of HLA-C, acts as a near-perfect proxy for the HLA-C*06:02 allele — across European, South Asian, and Southeast Asian cohorts, genotype concordance is 0.984–0.996 (r²=0.946–0.984), making this single SNP almost as informative as full HLA typing.

The Mechanism

HLA-C*06:02 drives psoriasis through a specific autoimmune pathway targeting skin melanocytes. The critical autoantigen is ADAMTS-like protein 5 (ADAMTSL5)33 ADAMTS-like protein 5 (ADAMTSL5)
ADAMTSL5 is highly expressed in psoriasis lesions, especially in melanocytes; ADAMTSL5-specific CD8+ T cells are detectable in patients and produce IL-17A
, a peptide that HLA-C*06:02 presents to skin-infiltrating CD8+ T cells. These T cells produce the pathogenic cytokine IL-17A44 IL-17A
IL-17A stimulation of keratinocytes drives the epidermal hyperproliferation that produces psoriatic plaques
, which drives keratinocyte hyperproliferation and the characteristic plaques of psoriasis vulgaris. Carriers of HLA-C*06:02 have odds ratios for psoriasis ranging from 3.4 to 4.9 depending on the population studied.

The interaction with ERAP1 (endoplasmic reticulum aminopeptidase 1) reveals an elegant molecular specificity: ERAP1 trims peptide precursors55 trims peptide precursors
ERAP1 acts as a gatekeeper, trimming NH2-terminal extended precursors to the exact peptide length required for HLA-C*06:02 binding and presentation
to the optimal size for HLA-C*06:02 binding. Different ERAP1 haplotypes produce different amounts of the ADAMTSL5 autoantigen, meaning ERAP1 variants only influence psoriasis risk in HLA-C*06:02 carriers66 ERAP1 variants only influence psoriasis risk in HLA-C*06:02 carriers
ERAP1 variants confer risk only in individuals with the HLA-C risk allele; no effect in non-carriers, establishing HLA-C as a prerequisite for ERAP1-mediated risk
— an example of genetic epistasis where the effect of one gene depends entirely on the context of another.

The Evidence

The landmark psoriasis GWAS by Strange et al. 201077 The landmark psoriasis GWAS by Strange et al. 2010
Large multi-cohort GWAS identifying new psoriasis loci including the HLA-C × ERAP1 epistatic interaction, P_combined=6.95×10−6
established the HLA-C/ERAP1 interaction and confirmed HLA-C as the primary PSORS1 signal. Stuart et al. 2015 validated rs440627388 Stuart et al. 2015 validated rs4406273
Genotype concordance 0.984–0.996; sensitivity 0.965–1.000; specificity 0.9963–0.9994 as a surrogate for HLA-C*06:02 across European, Asian, and some African and admixed populations
as an excellent single-SNP surrogate across multiple diverse cohorts, reporting ORs of 3.38, 2.32, and 4.91 in Michigan, Pakistani, and Thai samples respectively.

HLA-C*06:02 status has emerged as a clinically useful predictive biomarker99 clinically useful predictive biomarker
Meta-analysis (937 patients at 6-month timepoint); risk difference of 0.24 (95% CI 0.14–0.35, P<0.001) in PASI75 response favoring HLA-C*06:02-positive patients on ustekinumab
for biologic drug response in psoriasis. HLA-C*06:02-positive patients achieve substantially higher response rates on ustekinumab (anti-IL-12/IL-23) — approximately 89–92% achieving PASI75 at 6 months versus 62–67% in HLA-C*06:02-negative patients.

Phenotypically, HLA-C*06:02-positive patients are more likely to present with guttate lesions, greater body surface area, and higher PASI scores1010 guttate lesions, greater body surface area, and higher PASI scores
HLA-Cw6 positivity associated with guttate psoriasis, greater body surface area, and higher PASI scores in multiple cohorts
and earlier disease onset (type I psoriasis, onset before age 40).

Practical Actions

HLA-C*06:02 carrier status (tagged by rs4406273-A) has direct, genotype-specific clinical implications. If you carry one or two copies of the A allele, your dermatologist should be informed: this genotype predicts better response to ustekinumab (Stelara) and poorer relative benefit from some other biologics. It also warrants heightened awareness of early psoriasis symptoms — particularly following streptococcal throat infections, which are a known trigger for guttate flares specifically in HLA-C*06:02 carriers.

Homozygous AA carriers face substantially elevated risk and should establish regular dermatological follow-up even without current disease, since psoriasis is often preceded by subclinical immune activation. Triggers known to unmask or exacerbate psoriasis in HLA-C*06:02 carriers include streptococcal infections, stress, skin trauma (Koebner phenomenon), and certain medications (lithium, beta-blockers).

Interactions

The most important interaction is with ERAP1 (rs27524). ERAP1 risk alleles only increase psoriasis susceptibility in individuals who also carry the HLA-C risk allele. A user carrying risk alleles at both rs4406273 (HLA-C*06:02 proxy) and rs27524 (ERAP1) has a substantially higher combined risk than either variant alone, because ERAP1 controls the generation of the very autoantigen that HLA-C*06:02 presents to pathogenic T cells.

The related SNP rs12191877 is another HLA-C*06:02 tagging SNP at a different position; both co-segregate with the same HLA-Cw6 haplotype and share near-identical psoriasis associations. They provide complementary coverage for HLA-C*06:02 detection rather than marking independent biological signals.

rs489693

MC4R MC4R AIWG variant

Strong Risk Factor

MC4R rs489693 — The Antipsychotic Weight Gain Locus

The melanocortin-4 receptor (MC4R) sits at the centre of the hypothalamus's appetite control system. When the brain releases satiety signals — leptin-driven alpha-melanocyte stimulating hormone11 alpha-melanocyte stimulating hormone
α-MSH, derived from the POMC precursor peptide in hypothalamic neurons
— MC4R receives them, suppressing hunger and raising energy expenditure. Rare coding mutations in MC4R cause severe monogenic obesity; common variants near the gene produce subtler, population-wide effects on body weight. rs489693 belongs to this second category: an intergenic variant located approximately 155 kilobases downstream of MC4R on chromosome 18, within a regulatory region that influences MC4R locus activity.

What makes rs489693 distinctive within the MC4R neighbourhood is its clinical specificity. Unlike rs17782313 (the strongest common MC4R-region obesity signal), rs489693 was discovered through a pharmacogenomics GWAS22 pharmacogenomics GWAS
genome-wide association study restricted to patients being treated with antipsychotic medications
rather than a general population obesity scan. The A allele acts recessively — heterozygous AC carriers show modest effects, while AA homozygotes (about 10% of Europeans) experience markedly amplified weight gain when prescribed second-generation antipsychotics.

The Mechanism

rs489693 has no protein-coding consequence — it lies in intergenic DNA without a documented transcript. Its functional effect is presumed to be regulatory: the variant likely resides in an enhancer or repressor element33 enhancer or repressor element
non-coding DNA that controls when and how much a nearby gene is expressed
that modulates MC4R expression in hypothalamic or limbic tissue. Second-generation antipsychotics (SGAs) such as olanzapine and clozapine are known to antagonise histamine H1 and serotonin 5-HT2C receptors, triggering increased appetite and reducing energy expenditure. Reduced MC4R-mediated satiety tone appears to compound this drug effect: when the melanocortin brake on appetite is already weakened by genetic variation at this locus, the appetite-stimulating actions of SGAs go less opposed, producing larger weight increases.

The Malhotra et al. 2012 GWAS44 Malhotra et al. 2012 GWAS
139 pediatric patients in discovery, three independent replication cohorts, total n=344
found that AA homozygotes showed not only greater weight gain but also consistent elevations in triglycerides, leptin, and insulin — a full metabolic syndrome profile. This suggests the variant influences an MC4R-mediated pathway that governs both appetite and peripheral metabolic signalling.

The Evidence

The pharmacogenomics signal at rs489693 is among the most robustly replicated in antipsychotic-induced weight gain (AIWG) research. The pivotal study by Malhotra et al.55 Malhotra et al.
GWAS with three independent replication cohorts across US and European psychiatric centres
achieved genome-wide significance (p=5.59×10-12) in meta-analysis across cohorts — one of the few pharmacogenomics variants to reach this threshold for any drug-induced side effect.

A subsequent naturalistic study by Czerwensky et al. 201366 Czerwensky et al. 2013
341 Caucasian inpatients on SGAs including olanzapine, clozapine, risperidone, paliperidone, quetiapine, and amisulpride
found that AA homozygotes gained 2.2 times as much weight as CC carriers after four weeks of treatment (+2.2 kg vs +1.0 kg, p=0.039). In the subgroup without weight-gain-inducing co-medications, the difference was 3.1-fold; in first-episode patients, 2.7-fold — both p<0.05. This confirms the signal is not confounded by prior drug exposure and is strongest when the genetic variant can be evaluated in isolation.

A 2026 meta-analysis77 2026 meta-analysis
75 studies examined in systematic review, all adult patients with severe mental illness prescribed antipsychotics
confirmed rs489693 as one of six pharmacogenomics loci reaching significance across the AIWG literature (Hedge's g=0.127). Notably, five of the six replicated genes regulate hypothalamic appetite and satiety pathways — MC4R, HTR2C, LEPR, ADRA2A, and CNR1 — pointing to a convergent mechanism through central appetite regulation rather than peripheral metabolic effects.

A real-world polygenic risk score88 real-world polygenic risk score
incorporating rs489693 plus five other AIWG variants in first-episode psychosis patients
predicted weight gain with adjusted r²=0.59 in females, though the PRS showed no significant effect in males — a sex-specific finding that requires replication but highlights the importance of considering sex as a biological variable in AIWG prediction.

Practical Actions

For AA homozygotes, the most clinically useful application is pre-treatment risk stratification. Before starting a high-weight-gain-liability SGA (olanzapine, clozapine, quetiapine, or risperidone), knowing you carry the AA genotype provides a genetic rationale for discussing lower-weight-gain alternatives with your prescriber — aripiprazole, lurasidone, and ziprasidone have substantially lower AIWG profiles in the general population, and genetic risk strengthens the case for preferring them when clinically equivalent.

When SGA treatment is medically necessary and cannot be changed, proactive metabolic monitoring becomes essential. AA carriers should have baseline and regular follow-up measures of weight, waist circumference, fasting glucose, triglycerides, and HbA1c — the full metabolic syndrome panel. Early detection of metabolic drift enables earlier intervention (lifestyle, metformin adjunct, or medication review) before frank metabolic syndrome develops.

Timing matters: the Czerwensky 2013 data shows 2.2 kg excess gain in just four weeks, before many patients would be flagged by routine clinical review. More frequent early weight monitoring (weekly for the first month) in AA carriers could prompt earlier clinical response.

Interactions

rs17782313 (MC4R near-gene variant): Both variants tag the MC4R locus, but they are not in strong linkage disequilibrium and appear to capture partially independent signals. rs17782313 is primarily an obesity signal in the general population; rs489693 is primarily an antipsychotic-weight-gain signal. Carriers of risk alleles at both loci may have compound MC4R-pathway vulnerability — both general appetite dysregulation and amplified drug-induced weight gain. The combined effect has not been quantified in a single study.

rs8087522 (MC4R upstream regulatory variant): A second MC4R upstream variant ~155 kb from the TSS with an exploratory association with clozapine-induced weight gain in European-ancestry patients (PMID 22310352). The rs489693 and rs8087522 signals may partially overlap or independently contribute to MC4R regulatory tone. Combined effects have not been formally assessed.

HTR2C rs3813929: The serotonin 2C receptor variant is the strongest single-gene AIWG signal (Hedge's g=0.76 in the Warner-Levy 2026 meta-analysis). Antipsychotics cause weight gain partly through 5-HT2C antagonism; MC4R pathway impairment at rs489693 operates through a different mechanism (reduced satiety tone). Carriers of risk alleles at both loci face additive AIWG risk through independent pathways, making a combined pharmacogenomics assessment more predictive than either alone.

rs498422

LOC101929163 LOC101929163 Variant

Emerging Risk Factor

LOC101929163 rs498422 — An HLA-Region Variant Associated with Non-Obstructive Azoospermia in East Asian Men

Non-obstructive azoospermia (NOA) is the complete absence of sperm in semen due to a failure of sperm production in the testes — the most severe form of male infertility, affecting roughly 1% of all men and approximately 10% of infertile men. Unlike obstructive azoospermia (where a physical blockage prevents sperm from reaching the ejaculate), NOA reflects a fundamental spermatogenic failure11 spermatogenic failure
inability of the testes to produce mature sperm through the meiotic and mitotic divisions of spermatogenesis
. Genetic factors account for a substantial proportion of NOA cases. Chromosome abnormalities (e.g. Klinefelter syndrome, Y microdeletions) are the most common causes, but common variants of modest effect — particularly in the HLA region — have also been implicated.

The Mechanism

rs498422 lies in an intronic region of LOC101929163 (also annotated as TSBP1-AS1), a non-coding antisense RNA gene situated on chromosome 6p21.32 between TSBP1 (testis-expressed basic protein 1) and BTNL2 (butyrophilin-like protein 2). This locus sits within the extended major histocompatibility complex (MHC)22 major histocompatibility complex (MHC)
the HLA region, spanning ~4 Mb on chromosome 6p21, encoding immune-recognition proteins and dozens of regulatory genes
. The precise causal mechanism is not established. The intronic position suggests the G allele may tag a haplotype that influences the expression of nearby genes, particularly TSBP1 (expressed in testis) or BTNL2 (a co-stimulatory molecule involved in immune regulation). Given the strong immune-regulatory role of the HLA region, one leading hypothesis is that certain MHC haplotypes alter testicular immune tolerance — creating a milieu in which the immune system fails to appropriately protect germ cells from autoimmune attack.

The Evidence

The primary evidence comes from a landmark genome-wide association study by Zhao et al. 201233 genome-wide association study by Zhao et al. 2012
A genome-wide association study reveals that variants within the HLA region are associated with risk for nonobstructive azoospermia. Am J Hum Genet, 2012
, conducted in Han Chinese men across three independent stages: discovery (802 NOA cases, 1,863 controls), northern China replication (818 cases, 1,755 controls), and central/southern China replication (606 cases, 958 controls), totalling 2,226 cases and 4,576 controls. rs498422 at the C6orf10/BTNL2 region reached a combined p-value of 2.43 × 10⁻¹², with an odds ratio of 1.42 — a statistically robust finding that survived genome-wide significance thresholds after multiple-testing correction.

A subsequent case-control study and meta-analysis by Zou et al. 201744 case-control study and meta-analysis by Zou et al. 2017
Association and meta-analysis of HLA and non-obstructive azoospermia in the Han Chinese population. Andrologia, 2017
in 603 NOA cases and 610 controls replicated the association (OR 1.40, p = 0.006). A meta-analysis across five studies confirmed that the three HLA-region loci including rs498422 are consistently associated with NOA susceptibility (p < 0.01 across all), leading the authors to propose these variants as potential diagnostic markers for male infertility risk.

A replication study in Japanese men by Jinam et al. 201355 replication study in Japanese men by Jinam et al. 2013
HLA-DPB1*04:01 allele is associated with non-obstructive azoospermia in Japanese patients. Hum Genet, 2013
(443 patients, 544 controls) showed that the correlated marker rs3129878 (in linkage disequilibrium with rs498422) was associated with NOA in a Japanese population, though HLA-DPB1*04:01 emerged as the primary independently associated allele in that cohort. This confirms the broader HLA-region involvement in NOA across East Asian populations.

Critical limitation: All replication data are from East Asian populations (Han Chinese and Japanese). The G allele of rs498422 is approximately three times more common in East Asians (~18%) than in Europeans (~6%), and the MHC haplotype structure differs substantially between populations. No replication data exist for European, African, South Asian, or Latino men. Until independent replication in non-East-Asian populations is published, this association should be considered population-specific and the evidence classified as emerging for general population use.

Practical Actions

For men of East Asian ancestry, carrying G alleles at rs498422 modestly elevates the prior probability of spermatogenic impairment. The OR of 1.42 means that, in the context of an infertility evaluation, this genotype adds modest incremental information — it does not diagnose NOA or predict it with high certainty, but it can inform how aggressively to investigate spermatogenesis, particularly in the context of other risk factors such as a history of orchitis, cryptorchidism, or prior chemotherapy.

Semen analysis remains the essential diagnostic test. Men with G alleles who are pursuing fertility and have not had a semen analysis should consider obtaining one. If NOA is diagnosed, the presence of this risk genotype does not change management — NOA is evaluated with hormonal testing, testicular biopsy, and potential surgical sperm retrieval regardless of genotype.

Interactions

rs498422 is in linkage disequilibrium with rs3129878 (HLA-DRA) and rs7194, two other HLA-region variants associated with NOA. In the Zhao et al. 2012 GWAS, rs498422 and rs3129878 showed independent signals after conditioning on each other, suggesting they may tag distinct functional effects within the MHC. A 2019 fine-mapping study (Huang et al., PMID 30502936) identified additional loci (rs7194, rs4997052) within the MHC class I region, suggesting that multiple independent signals in the HLA region collectively contribute to NOA susceptibility — possibly through different immune mechanisms acting on spermatogenesis. Men who carry risk alleles at multiple HLA-region NOA loci may have cumulatively higher risk, though formal compound analysis of this specific combination has not been published.

NEDD4L and Salt-Sensitive Blood Pressure — The ENaC Gatekeeper

In the collecting tubule of the kidney, a molecular handshake determines how much salt your body holds onto each day. The epithelial sodium channel (ENaC) sits at the apical membrane of tubular cells and controls the final step of sodium reabsorption. How long ENaC stays at the cell surface — and therefore how much sodium it captures — depends critically on the NEDD4L protein, an E3 ubiquitin ligase that marks ENaC subunits for removal and degradation. Genetic variants in NEDD4L that alter this regulatory efficiency create a spectrum of salt handling phenotypes, with direct consequences for blood pressure and cardiovascular risk. The intronic rs549476 variant lies within this regulatory locus and tags haplotypes associated with altered NEDD4L isoform expression and hypertension susceptibility.

The Mechanism

NEDD4L encodes an HECT-domain E3 ubiquitin ligase11 E3 ubiquitin ligase
An enzyme that attaches ubiquitin tags to target proteins, marking them for proteasomal or lysosomal degradation
that specifically targets the β- and γ-subunits of ENaC through their C-terminal PY (proline-tyrosine) motifs. After binding, NEDD4L catalyzes the transfer of ubiquitin chains to ENaC, triggering internalization and degradation of the channel from the cell surface. Less ENaC at the membrane means less sodium reabsorption and, ultimately, lower extracellular fluid volume and blood pressure.

The rs549476 variant is intronic and tags a haplotype that affects NEDD4L isoform balance. The closely linked functional variant rs4149601, at the last nucleotide of exon 1, creates a cryptic splice site22 cryptic splice site
A splice site normally suppressed; when activated by a variant, it causes alternative splicing
. The G allele at rs4149601 generates isoform I, which retains the Ca²⁺-dependent lipid-binding C2 domain; this C2-domain-containing isoform interacts less efficiently with ENaC under high-sodium conditions. Carriers of the G haplotype therefore reabsorb more sodium under salt challenge, producing higher blood pressure and greater salt sensitivity33 salt sensitivity
A blood pressure increase of ≥10 mmHg in response to a sodium load
.

The renin-angiotensin-aldosterone system (RAAS) responds by suppressing renin — GG carriers show significantly lower plasma renin concentrations under salt loading, confirming that their kidneys are already retaining excess sodium relative to the system setpoint.

The Evidence

The clinical evidence for NEDD4L variants and salt-sensitive hypertension comes from several independent cohorts. In a Swedish crossover study of 39 normotensive subjects, individuals carrying the high-risk NEDD4L haplotype (GG at rs4149601 plus CC at rs2288774) showed salt sensitivity of 8.0 mmHg versus 5.0 mmHg in non-carriers (P=0.007)44 individuals carrying the high-risk NEDD4L haplotype (GG at rs4149601 plus CC at rs2288774) showed salt sensitivity of 8.0 mmHg versus 5.0 mmHg in non-carriers (P=0.007)
Holmberg et al. Polymorphism in NEDD4L is associated with increased salt sensitivity. PLoS One, 2007
, with accompanying suppression of plasma renin (9.0 vs 15.0 mU/L, P=0.03) and elevation of N-terminal pro-atrial natriuretic peptide.

The population-scale impact was established in the Malmö Diet and Cancer Study of 27,564 participants55 Malmö Diet and Cancer Study of 27,564 participants
Dahlberg et al. Genetic variation in NEDD4L is associated with cardiovascular disease and cardiovascular death. J Hypertens, 2014
: carriers of the NEDD4L salt-sensitivity genotype had higher systolic BP (142 vs 141 mmHg, P=0.002) and diastolic BP (86.0 vs 85.6 mmHg, P=0.025), with a multivariate hazard ratio of 1.13 for cardiovascular disease (P=0.018) and 1.20 for coronary events (P=0.005) over 14 years of follow-up.

The pharmacogenomics implications are clinically important. In the PEAR trial of 767 hypertensive patients66 PEAR trial of 767 hypertensive patients
Bress et al. Association of variants in NEDD4L with blood pressure response and adverse cardiovascular outcomes in hypertensive patients treated with thiazide diuretics. J Hypertens, 2013
, NEDD4L G-allele carriers on hydrochlorothiazide achieved significantly greater blood pressure reductions than non-carriers, consistent with the salt-retaining phenotype being particularly responsive to diuretic therapy.

Practical Actions

The actionable takeaway from the NEDD4L literature is clear: individuals with the salt-retaining genotype benefit disproportionately from sodium restriction and from diuretic antihypertensive therapy. Reducing dietary sodium from a typical 3,400 mg/day to below 1,500 mg/day can lower systolic blood pressure by 5–8 mmHg in salt-sensitive individuals — a reduction comparable to some antihypertensive medications. For those who require pharmacological treatment, thiazide diuretics are particularly effective in this genetic context, and this information should inform antihypertensive prescribing decisions.

Interactions

rs549476 tags a haplotype in LD with the functionally characterized rs4149601 and rs2288774 variants. The combined GG+CC haplotype at these two positions confers the highest salt-sensitivity phenotype, present in approximately 9.6% of European populations.

The NEDD4L pathway also interacts with the alpha-adducin (ADD1 rs4961) and WNK1 variants in a convergent renal sodium-handling network. In a physiological interaction study77 In a physiological interaction study
Manunta et al. Physiological interaction between alpha-adducin and WNK1-NEDD4L pathways. Hypertension, 2008
, the combination of ADD1 Trp allele plus WNK1 GG plus NEDD4L GA/AA showed consistent synergistic effects on nocturnal blood pressure, thiazide response, and acute saline-challenge BP. Users carrying risk variants in both pathways should be particularly attentive to sodium intake.