TSHZ1 and Constitutional Vestibular Susceptibility to Motion Sickness

About one in three people is highly susceptible to motion sickness — nausea, dizziness, and malaise triggered by cars, boats, planes, or virtual environments. For most of them, this isn't a character flaw or a question of toughening up: a large portion of the variation is genetic, with twin studies estimating heritability at 57–70%11 57–70%
Reavley et al. 2006 estimated heritability at 57% (95% CI 51–63%) for a composite motion sickness score in adult twins, rising to 70% when measured retrospectively in childhood
. rs10514168 is among the most clearly defined of those genetic contributors.

The variant sits downstream of TSHZ1 (Teashirt Zinc Finger Homeobox 1)22 TSHZ1 (Teashirt Zinc Finger Homeobox 1)
a transcription factor in the teashirt family, expressed during embryonic development and essential for forming the external auditory canal, middle ear ossicles, soft palate, and inner ear structures in mice and humans
. Loss-of-function mutations in TSHZ1 cause congenital aural atresia33 congenital aural atresia
absence or severe malformation of the external auditory canal, often accompanied by middle ear structural abnormalities — the most severe phenotype of TSHZ1 haploinsufficiency
(OMIM 607842), an autosomal dominant condition. The implication of rs10514168 is that constitutional variation in TSHZ1 expression — far subtler than loss-of-function disease mutations — shapes the architecture of the vestibular apparatus itself, leaving some individuals with an inner ear that is structurally more sensitive to conflicting motion signals.

The Mechanism

The vestibular system of the inner ear detects head movement and body position via fluid-filled semicircular canals and otolith organs (utricle and saccule). Motion sickness is thought to arise from sensory conflict: the vestibular system signals movement while the visual system — watching a stationary car interior, for example — reports stillness. The brain's resolution of this conflict is what produces nausea and autonomic symptoms.

rs10514168 is an intergenic regulatory variant located downstream of TSHZ1 on chromosome 18. It does not change any protein sequence. Instead, by modulating TSHZ1 transcriptional activity during inner ear morphogenesis, it appears to influence the structural sensitivity of the vestibular apparatus — an innate architectural predisposition that persists throughout life. The C allele (major, protective) is associated with lower motion sickness scores; each copy of the A allele (minor) adds to constitutional susceptibility.

The Evidence

The primary evidence comes from Hromatka et al. 201544 Hromatka et al. 2015
Genetic variants associated with motion sickness point to roles for inner ear development, neurological processes and glucose homeostasis. Human Molecular Genetics
, the first genome-wide association study of motion sickness, conducted in 80,494 individuals from the 23andMe research cohort. Participants rated their car sickness on a 0–3 scale. Thirty-five SNPs reached genome-wide significance (P < 5×10⁻⁸); rs10514168 reached P = 2.7×10⁻⁹. The C allele was protective with an effect size of β = −0.047 per allele (95% CI −0.062 to −0.031) — meaning each additional C allele reduces the motion sickness score by roughly 0.05 units on a 3-point scale. The A allele confers the complementary risk.

The TSHZ1 locus was specifically highlighted because of the gene's established role in murine inner ear development. Multiple other significant GWAS hits from the same study implicate related developmental pathways — rs12111385 near MUTED (endosomal trafficking in cochlear hair cells) and HOX gene clusters — converging on the hypothesis that motion sickness susceptibility is substantially determined by vestibular organ architecture laid down during embryogenesis.

Population frequencies show marked variation by ancestry: the A risk allele is most common in Latino/Admixed American populations (~22%), moderate in Europeans (~16%) and South Asians (~12%), less common in Africans (~6%), and rare in East Asians (~0.4%). This creates meaningful population-level differences in the baseline prevalence of high motion sickness susceptibility.

Evidence level is classified as strong: a single large GWAS (n=80,494) with genome-wide significance, biologically coherent mechanism through an established developmental gene, and independent heritability data. The variant has not yet been replicated in a second independent GWAS cohort of comparable size.

Practical Actions

Carriers of the A allele — particularly AC heterozygotes and AA homozygotes — have a constitutional vestibular susceptibility that does not resolve with time or willpower. The appropriate responses are: (1) desensitization through vestibular rehabilitation exercises, which have strong evidence for reducing vestibulo-ocular conflict responses; (2) anticipatory mitigation before travel (positioning, gaze strategies, antiemetics as needed); and (3) career counselling for individuals considering occupations with high vestibular demands such as commercial aviation, naval service, or professional maritime work, where motion sickness susceptibility affects performance and certification.

Interactions

rs10514168 is one of several motion sickness–associated GWAS hits near genes involved in inner ear development and vestibular function. rs12111385 (near MUTED, encoding an endosomal trafficking protein important for cochlear hair cell function) and rs1435985 (also near TSHZ1 in some analyses) represent related loci from the same GWAS. Individuals carrying risk alleles at multiple loci would be expected to have additive constitutional susceptibility, though compound analyses have not been published for this specific combination.

The migraine-vestibular overlap is also relevant: several motion sickness GWAS hits are near genes implicated in migraine (e.g., WNT10B locus), and vestibular migraine shares phenomenology with severe motion sickness. Carriers of both TSHZ1 risk variants and migraine-associated variants may have higher combined susceptibility to vestibular symptoms.

FBN1 rs10519177 — The Recessive Aortic Variant: Two Copies Required for Risk

Fibrillin-1 is the primary structural protein of extracellular microfibrils11 microfibrils
microscopic fibrous scaffolds embedded in connective tissue, giving the aortic wall its tensile strength and elasticity
in the aortic wall. Mutations in FBN1 cause Marfan syndrome, but the gene also harbors common intronic variants that, without causing Marfan syndrome, can subtly degrade the aortic wall's resistance to dissection. rs10519177 is one such variant — but it behaves very differently from its better-known FBN1 neighbor rs2118181. Where rs2118181 elevates risk with a single risk allele, rs10519177 requires two copies of the G allele to produce a measurable biological effect.

The Mechanism

The variant sits at position c.4942+570 in an intron of FBN1, approximately 570 nucleotides downstream of exon 40. The FBN1 gene lies on the minus strand of chromosome 15; on the plus strand the alleles are A (reference) and G (risk). Like all intronic variants it does not change the fibrillin-1 amino acid sequence, but it can influence mRNA splicing efficiency, regulatory element binding, or expression levels in a dosage-sensitive way.

The key mechanistic clue comes from a 269-person study by Sepetiene et al. (Mol Med, 2015)22 Sepetiene et al. (Mol Med, 2015)
Sepetiene R, et al. Association between Fibrillin1 Polymorphisms and TGF-β1 Concentration in Human Plasma. Mol Med, 2015
: elevating circulating TGF-β1 levels — the downstream consequence of impaired fibrillin-1 function — required two copies of the rs10519177 G allele. One copy alone produced no measurable effect. This recessive pattern contrasts sharply with rs2118181, where a single risk allele raised TGF-β1 by approximately 1 ng/mL. The implication is that rs10519177 has a weaker per-allele impact on fibrillin-1's TGF-β1 sequestration33 TGF-β1 sequestration
fibrillin-1 normally binds and stores TGF-β1 in the extracellular matrix; when fibrillin-1 function is impaired, TGF-β1 is released into circulation, driving aortic wall inflammation and structural weakening
capacity, and only the homozygous GG state is sufficient to push TGF-β1 meaningfully above baseline.

The Evidence

The strongest positive data come from a Lithuanian surgical cohort by Lesauskaite et al. (Eur J Cardiothorac Surg, 2015)44 Lesauskaite et al. (Eur J Cardiothorac Surg, 2015) studying 312 patients who underwent aortic surgery against 472 reference subjects. The minor allele frequency of rs10519177 was significantly higher in aortic dissection patients compared to controls (p < 0.0001). Crucially, a recessive model best described the rs10519177 association with Stanford Type A aortic dissection — the most severe form requiring emergency open surgery — with an odds ratio of 4.31 (95% CI 2.06–9.01). This is a large effect size, but the confidence interval is wide, reflecting the rarity of GG homozygotes in the study population and the modest overall cohort size.

The picture is complicated by a null result: the Yale multicenter study by Iakoubova et al. (PLoS One, 2014)55 Iakoubova et al. (PLoS One, 2014) — which included 140 TAD cases and 275 controls from the US, Hungary, and Greece — found that rs10519177 was not significantly associated with TAD, TAA, or combined TAAD. The same paper found rs2118181 significant (OR 1.87). One plausible explanation is statistical power: with a recessive model and G allele frequency of ~25% in Europeans, only about 6% of the population is GG, meaning a study of 275 controls would include fewer than 20 GG homozygotes — far too few to reliably detect a recessive effect. The Lithuanian cohort, larger and enriched for surgical cases, may have had sufficient GG representation.

Evidence overall is emerging: two studies with discordant results, no GWAS-level replication, and mechanistic data from a single moderate-sized cohort.

Practical Actions

The recessive pattern is clinically important: heterozygous AG carriers — the most common non-reference genotype at roughly 38% of people — appear biologically equivalent to AA homozygotes for this specific variant. The actionable concern applies to the rare GG group. For GG homozygotes, the recommendations parallel those for other FBN1 risk variants: establish baseline aortic dimensions, maintain tight blood pressure control, and recognize the warning signs of aortic dissection. Recognizing this variant's recessive pattern also means that genome-wide risk assessment for aortic disease should account for both rs10519177 (recessive) and rs2118181 (additive/dominant) separately — carrying one copy of each does not sum in a straightforward way.

Interactions

rs10519177 and rs211818166 rs2118181
the other well-studied intronic FBN1 variant, with a dominant/additive effect on TGF-β1 that acts with a single risk allele copy
are likely in incomplete linkage disequilibrium — both are in the same FBN1 gene but show different inheritance patterns, suggesting they tag different functional elements. An individual who is GG at rs10519177 AND carries the rs2118181 C allele would have disruption from two independent fibrillin-1 mechanisms simultaneously. rs1036477 is a third FBN1 variant identified in the same Zhejiang Han cohort as correlated with increased mortality in male sTAAD patients. No compound analysis of all three variants together has been published.

rs1065852

CYP2D6 *10

Established Risk Factor

CYP2D6*10 - The Decreased Function Variant

The CYP2D6*10 allele11 rs1065852 is the most common decreased-function variant worldwide. While it is most prevalent in East Asian populations (frequency 40-70%), it is also found at lower frequencies in European populations. Unlike the *4 allele which completely abolishes enzyme function, *10 produces a functional but unstable enzyme with reduced activity.

The Mechanism

The rs1065852 variant causes a proline-to-serine substitution at position 34 of the CYP2D6 protein22 Amino acid change: proline to serine at position 34 (P34S). This amino acid change occurs in the N-terminal signal anchor sequence, affecting how the enzyme is folded and inserted into the endoplasmic reticulum membrane. The resulting enzyme has reduced stability and lower catalytic efficiency, typically retaining about 25-50% of normal activity.

Clinical Impact

Because *10 reduces rather than eliminates activity, its clinical impact is more subtle than *4. However, when combined with another reduced or non-functional allele (like *4), the compound effect can push someone into the poor metabolizer category. For medications with narrow therapeutic windows33 Narrow therapeutic window: small difference between effective dose and toxic dose, even moderate reductions in CYP2D6 activity can be clinically meaningful. This variant is the most frequently observed decreased-function allele in East Asian populations44 most frequently observed decreased-function allele in East Asian populations
Bradford et al. CYP2D6 allele frequency study, 2002
, making it a major contributor to the higher prevalence of intermediate metabolizers in these populations.

Combined CYP2D6 Status

Your overall CYP2D6 metabolizer status is determined by the combination of both alleles. Someone carrying *1/*10 (one normal, one decreased) would be an intermediate metabolizer, while someone with *4/*10 (one non-functional, one decreased) would likely be classified as a poor metabolizer. This is why looking at all CYP2D6 variants together is essential for accurate phenotype prediction. The CPIC activity score system55 CPIC activity score system
Gaedigk A et al. Clin Pharmacol Ther, 2008
assigns *10 a value of 0.25, compared to 1.0 for the normal *1 allele and 0 for the non-functional *4.

Practical Considerations

If you carry the *10 allele, your CYP2D6 function is moderately reduced. The clinical significance depends on your other CYP2D6 allele and the specific medication in question. For medications with wide therapeutic windows, this may not matter much. For medications like tamoxifen, codeine, or tricyclic antidepressants, even moderate reductions in CYP2D6 activity can affect outcomes.

ADIPOR2 rs11061946 — A Rare Signal in the Adiponectin Receptor

Adiponectin is one of the few adipokines that works against metabolic disease: it rises with fat loss, improves insulin sensitivity, suppresses hepatic glucose production, and triggers fatty acid oxidation11 fatty acid oxidation
the breakdown of fat for fuel, primarily in liver and muscle through the PPARα pathway
. The receptor through which adiponectin acts in the liver is ADIPOR2 (Adiponectin Receptor 2, gene symbol ADIPOR2, chromosome 12p13.31). Unlike its partner ADIPOR1 — the predominant muscle receptor — ADIPOR2 is most abundantly expressed in hepatic tissue, where it couples adiponectin signaling to the PPARα pathway22 PPARα pathway
Peroxisome proliferator-activated receptor alpha, a nuclear receptor that upregulates enzymes for fatty acid β-oxidation and downregulates hepatic gluconeogenesis and pro-inflammatory gene programs
. Disrupting AdipoR2 in mice impairs hepatic fatty acid oxidation, worsens diet-induced insulin resistance, and elevates fasting glucose, establishing a clear causal role for the receptor in metabolic homeostasis.

rs11061946 is an intronic variant located in intron 1 of ADIPOR2. It does not change any amino acid and has not been shown to alter splicing or gene expression in available functional studies. It may be a marker in linkage disequilibrium with a nearby causal variant, or it may itself influence chromatin accessibility or transcription factor binding in a context not yet captured by available expression datasets.

The Mechanism

The variant sits in intron 1 of ADIPOR233 intron 1 of ADIPOR2
The first intron, between the first and second coding exons; intronic variants can affect mRNA splicing, act as regulatory elements, or simply be neutral markers for nearby functional variants
. No allele-specific differences in ADIPOR2 mRNA expression were detected in peripheral blood mononuclear cells or subcutaneous adipose tissue in the Genobin sub-study (56 subjects). The authors of the Finnish DPS paper explicitly noted that rs11061946 and its LD partner rs11061973 "are intronic SNPs, have no known functional significance, and may therefore be merely markers in LD with a true causal variant." The working hypothesis is that reduced or altered ADIPOR2 activity in rare TT homozygotes impairs hepatic adiponectin signaling, reducing PPARα-driven fatty acid oxidation and leaving the liver less able to suppress gluconeogenesis — a mechanism consistent with T2D progression, but not yet mechanistically demonstrated for this variant specifically.

rs11061946 is in strong linkage disequilibrium (r² = 0.674) with rs11061973, another intronic ADIPOR2 variant. The two SNPs co-segregate tightly: all five TT homozygotes at rs11061946 in the Finnish DPS were also AA homozygotes at rs11061973, meaning the signal may reflect a haplotype effect across this region of intron 1 rather than either SNP individually.

The Evidence

The primary evidence comes from the Finnish Diabetes Prevention Study (DPS)44 Finnish Diabetes Prevention Study (DPS)
A randomized controlled trial of 484 overweight adults with impaired glucose tolerance; intervention arm received intensive diet and exercise counseling, control arm received general information; median follow-up 7 years
. Eight ADIPOR2 SNPs were genotyped. In a Cox proportional-hazards model adjusted for age, sex, study arm, baseline waist circumference, and fasting glucose:

  • TT genotype (n = 5): HR = 5.54 (95% CI 2.01–15.23), p = 0.001
  • CT genotype (n = 49): HR = 0.71 (95% CI 0.41–1.21), p = 0.206
  • CC genotype (n = 428): reference

The TT finding is striking numerically, but critical limitations apply. Only five individuals carried the TT genotype across the entire study, making the confidence interval very wide. The dominant inheritance model (CT + TT vs. CC) showed no significant association, indicating the risk is concentrated in the rare homozygote and does not manifest in heterozygotes. The study has not been replicated in an independent cohort, and a q-value (false discovery rate) correction of 0.369 for the TT result indicates that at this sample size, the finding does not survive multiple testing correction. rs11061946 also deviated from Hardy-Weinberg equilibrium in this Finnish sample — the authors retained it in analysis but flagged this as potentially reflecting small-sample chance deviation.

No association was found with cardiovascular disease outcomes in the same cohort. Broader ADIPOR2 SNP studies in UK populations (PMID 17216283) and Caucasian cohorts (PMID 16505255) did not report rs11061946 specifically, and none of 24 ADIPOR1/R2 polymorphisms were associated with T2D or insulin phenotypes in the larger UK replication sets.

Practical Actions

For the ~1% of people who are TT homozygotes, the Finnish DPS data suggest a substantially elevated hazard of progressing from impaired glucose tolerance to type 2 diabetes, though this signal has not been independently replicated. For CT heterozygotes (~14%), no statistically significant risk above baseline was observed.

Given that ADIPOR2 mediates adiponectin-driven hepatic fatty acid oxidation through PPARα, the metabolically meaningful intervention for anyone with impaired glucose tolerance — independent of genotype — is to support adiponectin activity. Adiponectin rises with weight loss, aerobic training, caloric restriction, and omega-3 fatty acid intake. For TT homozygotes specifically, the implication is that if receptor-level function is compromised, downstream metabolic support becomes more important: prioritizing dietary patterns that minimize hepatic fat accumulation (the chief driver of hepatic insulin resistance) and that avoid further suppression of adiponectin. Fasting glucose, 2-hour glucose tolerance, and HbA1c monitoring at the intervals used for clinical pre-diabetes management apply.

Interactions

rs11061946 is in moderate LD (r² = 0.674) with rs11061973, another intronic ADIPOR2 variant on the same haplotype block. The five TT individuals in the Finnish DPS were uniformly AA at rs11061973, suggesting a shared haplotype drives the risk rather than either variant independently. If both rs11061946 TT and rs11061973 AA are present together, the signal represents the same underlying haplotype — not an independent combinatorial risk.

The broader adiponectin pathway involves ADIPOQ (the adiponectin gene itself), ADIPOR1 (the complementary muscle-expressed receptor), and downstream transcription factors PPARA and PPARGC1A. Variants in these genes that reduce circulating adiponectin or impair receptor coupling could compound the effect of reduced ADIPOR2 activity, though no published compound analysis covers this specific combination.

Supervisor interaction proposal: rs11061946 TT + rs11061937 (the companion ADIPOR2 intronic variant) — if both variants tag a disrupted ADIPOR2 haplotype, combined homozygosity may indicate a more comprehensive ADIPOR2 functional deficit. Evidence is currently speculative (no compound study exists); propose as an interaction candidate for future investigation rather than a compound action today.

rs11152221

MC4R MC4R Proximal LD Block Variant

Strong Risk Factor

The MC4R Proximal Regulatory Signal — A Distinct Obesity Locus

The melanocortin-4 receptor (MC4R) is the hypothalamic master switch for satiety and energy expenditure. When leptin signals through the POMC/α-MSH cascade11 POMC/α-MSH cascade
pro-opiomelanocortin neurons release alpha-melanocyte stimulating hormone, which binds MC4R to suppress appetite and increase thermogenesis
, MC4R tells the brain to stop eating. rs11152221 lies approximately 87 kilobases 3' of MC4R in the proximal LD block22 proximal LD block
a cluster of genetic variants that are inherited together and are physically closer to the MC4R gene, as opposed to the distal LD block ~188 kb downstream where rs17782313 and rs571312 reside
— a region distinct from the more widely studied distal regulatory cluster.

What makes rs11152221 scientifically important is that it identifies a separate regulatory architecture around MC4R. While the distal block (rs17782313, rs571312, rs476828) has been replicated across dozens of GWAS, Evans et al. demonstrated that variants in the proximal block — including rs11152221 — show statistically independent associations with adiposity traits even after accounting for the distal block. The two clusters are not in strong linkage disequilibrium with each other, meaning they may tag different cis-regulatory elements modulating MC4R expression.

The Mechanism

rs11152221 is an intergenic regulatory variant with no effect on the MC4R protein sequence. Its functional role is proposed to operate through the modulation of MC4R transcription in hypothalamic neurons. The Evans et al. fine-mapping study33 Evans et al. fine-mapping study
examining conservation, enhancer activity, and ENCODE annotations in the proximal LD block
identified a putative CTCF-binding site44 CTCF-binding site
CTCF is an insulator protein that organizes chromosomal loops, controls enhancer-promoter contact, and can silence or activate gene expression in a position-dependent manner
in the proximal region, suggesting the T allele may disrupt normal chromatin organization near MC4R rather than directly reducing promoter activity. Enhancer assays in zebrafish and mice did not confirm reporter expression in this specific region, consistent with an insulator or topological mechanism rather than a classic enhancer effect.

The downstream consequence, however, is consistent with reduced effective MC4R signaling: T allele carriers in the Health ABC cohort had higher BMI, greater body fat percentage, and elevated fasting leptin levels even after adjusting for body fat — the elevated leptin itself indicating leptin resistance55 leptin resistance
a state where adipose tissue secretes adequate leptin but the hypothalamus fails to respond proportionately, a hallmark of MC4R pathway dysfunction
rather than simply excess fat mass.

The Evidence

The definitive study for rs11152221 is the Evans et al. 2014 fine-mapping analysis66 Evans et al. 2014 fine-mapping analysis
Health ABC Study, a biracial cohort of 2,163 white and 1,388 Black adults aged 70–79, plus a UCSF severe obesity case-control sample
. In white participants, rs11152221 T allele was associated with:

  • BMI: additive β=0.53±0.15 kg/m² per allele (p=5×10⁻⁴), significant after empirical correction for multiple testing (P_emp=5×10⁻⁴)
  • Body fat percentage: additive β=0.42±0.19% per allele (p=0.03)
  • Leptin (adjusted for fat mass): additive β=0.13±0.05 log-units per allele (p=0.005)

In the case-control obesity analysis within the Health ABC cohort (296 cases, 1,303 controls), the T allele conferred an odds ratio of 1.76 (dominant model, 95% CI=1.34–2.30, p=4×10⁻⁵) and 1.46 (additive, 95% CI=1.20–1.78, p=2×10⁻⁴). Replication in UCSF severe obesity cases vs. Health ABC controls yielded OR=1.28 (additive, 95% CI=1.00–1.64, p=0.05). These effect sizes are comparable to those reported for the distal block anchor variant rs17782313 in European populations.

Notably, Evans et al. observed no significant association in Black participants, consistent with the pattern documented for the distal block variants — likely reflecting different haplotype structure and LD patterns in African ancestry populations.

Practical Implications

The T allele's effects on BMI, adiposity, and leptin track the same appetite-satiety dysregulation seen with other MC4R-region variants. The leptin elevation adjusted for fat mass is particularly meaningful: it signals that the hypothalamic melanocortin system is failing to suppress appetite proportionately to energy stores — a biological drive to eat that operates below the level of conscious awareness. Structural interventions that bypass impaired hypothalamic satiety signaling — consistent meal timing, pre-committed portions, protein-heavy meals that trigger mechanical stretch receptors independently of the MC4R cascade — are the most evidence-aligned approaches for this variant.

Because the proximal LD block may tag an insulator disruption near MC4R, rather than a promoter methylation change like the distal block, the molecular details differ — but the observable appetite phenotype and practical recommendations converge with those for rs17782313 and rs571312 carriers.

Interactions

rs17782313 and rs571312 (distal MC4R LD block): rs11152221 is in the proximal 3' LD block and is NOT in strong LD with the distal block variants (low r² with rs17782313 in the Health ABC cohort). Carriers of T alleles at rs11152221 are not necessarily carriers of risk alleles at rs17782313, and the two blocks may contribute additive effects on MC4R expression. If you carry risk alleles at both loci, you likely have a compounded reduction in effective MC4R signaling — an interaction worth capturing in a combined analysis.

rs17700633: rs11152221 is in high LD with rs17700633 (r²=0.79 in HapMap CEU), the proximal-block anchor SNP identified in the Evans analysis. The two variants together define the proximal LD block signal.

FTO rs9939609: As with all MC4R-region variants, FTO and MC4R operate through distinct mechanisms (thermogenesis regulation vs. appetite signaling). Combined MC4R + FTO risk genotypes have been shown to confer up to 2.45-fold increased obesity risk in Chinese pediatric cohorts compared to neither risk genotype alone.

PRG2 — When Eosinophil Granule Proteins Attack the Gut

Inside your gastrointestinal tract, a normally protective immune cell may be working against you. Eosinophils — immune cells best known for fighting parasites and driving allergic reactions — reside in the gut wall as part of normal tissue defense. When activated, they discharge their granule contents, including major basic protein (MBP)11 major basic protein (MBP)
The predominant crystalline core protein of eosinophil granules, encoded by the PRG2 gene on chromosome 11q12.1. MBP is a cationic protein that disrupts cell membranes through electrostatic interaction, causing cytotoxicity to host epithelial cells when released in excess
. rs11229030 sits in the PRG2/PRG3 gene cluster — two closely related eosinophil major basic protein genes — and its C allele was identified as a Crohn's disease susceptibility signal in a large genome-wide association study of Ashkenazi Jewish populations.

The Mechanism

PRG2 encodes eosinophil major basic protein (MBP), the predominant constituent of the crystalline core of the eosinophil granule. When eosinophils degranulate in the gut wall, MBP directly increases epithelial layer permeability22 epithelial layer permeability
MBP disrupts cell membrane integrity through its highly cationic charge, causing direct toxicity to epithelial cells and increasing paracellular permeability — the same "leaky gut" mechanism central to Crohn's disease pathophysiology
. Mouse studies confirm the causal link: MBP knockout mice are protected from experimental colitis, and in vitro co-culture of eosinophils with intestinal epithelial cells causes dose-dependent epithelial dysfunction attributable to MBP.

rs11229030 at chr11:57,435,536 (GRCh38) lies approximately 45 kb downstream of PRG2 (chr11:57,386,780–57,390,650) and within the broader PRG2/PRG3 regulatory neighborhood. The SNP's intergenic location suggests it acts as a regulatory tag SNP influencing expression levels in this eosinophil protein cluster rather than altering protein sequence directly. The neighboring PRG3 gene encodes a related eosinophil major basic protein homologue (MBPH), also expressed preferentially in eosinophils, providing two potential effectors at this locus.

The Evidence

The association between rs11229030 and Crohn's disease emerged from a genome-wide scan combining 10 Ashkenazi Jewish cohorts33 genome-wide scan combining 10 Ashkenazi Jewish cohorts
Ashkenazi Jews have a 2–4× higher prevalence of Crohn's disease than non-Jewish Europeans, making this population particularly informative for discovering CD susceptibility loci
: 907 cases and 2,345 controls in discovery, followed by 971 cases and 2,124 controls in replication. rs11229030 reached genome-wide significance (OR 1.15, p=8×10⁻⁹), ranking among five novel loci identified in the study. While an OR of 1.15 represents a modest per-allele effect — typical for common GWAS variants in complex diseases — the biological candidacy of PRG2/PRG3 at this locus provides strong mechanistic plausibility.

Evidence for eosinophil-mediated intestinal damage in Crohn's disease is well established independently. Ultrastructural studies show eosinophil MBP granule release and cytotoxic tissue changes specifically in Crohn's disease biopsy specimens. Eosinophil cationic protein (ECP), a related granule protein, correlates with Crohn's disease activity with remarkable precision44 activity with remarkable precision
r=0.89, p<0.0001 in 10 children with Crohn's disease followed prospectively; active disease median ECP 24.5 µg/L vs remission 5.7 µg/L
. And in patients with Crohn's disease, elevated eosinophilic infiltration is associated with fibrosis development and poor response to medical therapy.

Practical Actions

For CC genotype carriers, the actionable implications center on gut barrier support and early monitoring. Eosinophil-driven permeability is worsened by food antigens that trigger eosinophil degranulation; eliminating common triggers can reduce intestinal eosinophil activation. Monitoring stool calprotectin and blood eosinophil counts provides early warning of inflammatory flares before clinical symptoms escalate. In established Crohn's disease, serum ECP can be used as a disease-activity index alongside standard markers.

Interactions

The PRG2 locus operates within the broader landscape of Crohn's disease genetic risk. Variants in rs7234029 (PTPN2) reduce T-cell regulatory phosphatase activity, lowering the threshold for immune activation that drives eosinophil recruitment. The rs2631367 variant in SLC22A5 (OCTN2) impairs carnitine transport in intestinal epithelial cells, compounding the barrier dysfunction that eosinophil MBP degranulation initiates. Carriers of multiple Crohn's disease risk alleles — particularly those affecting both eosinophil effector function (PRG2) and immune activation thresholds (PTPN2, IL23R) — face additive susceptibility. Each of these interactions involves independent biological mechanisms that converge on intestinal barrier disruption and aberrant immune activation.

DEFB1 G-20A — Your First Line of Mucosal Defense

The surfaces lining your gut, mouth, and airways are under constant microbial assault. Beta-defensin 1 (hBD-1)11 Beta-defensin 1 (hBD-1)
a small cationic antimicrobial peptide of 36 amino acids, constitutively expressed by epithelial cells throughout the gastrointestinal and urogenital tracts
is one of the body's key innate antimicrobial peptides, continuously secreted by epithelial cells to maintain the delicate boundary between host tissue and the microbial world. Unlike most defensins that are induced by infection or inflammation, hBD-1 is [constitutively expressed | meaning it is always present as a standing guard, rather than being activated only when infection is detected] — it functions as a permanent sentinel in mucosal surfaces.

The G-20A variant (rs11362) sits in the 5' untranslated region (5' UTR)22 5' untranslated region (5' UTR)
the region of mRNA before the protein-coding sequence begins, which regulates how efficiently the gene is translated into protein
of the DEFB1 gene on chromosome 8. Because DEFB1 is transcribed from the minus strand, the "G-20A" notation used in research literature corresponds to a C-to-T change on the plus strand reported by genome sequencing. The T allele (coding-strand A) is carried by approximately 44% of Europeans and reduces hBD-1 expression, weakening mucosal antimicrobial defense.

The Mechanism

The 5' UTR of a gene controls how efficiently its mRNA is translated into protein. The rs11362 variant alters mRNA secondary structure33 alters mRNA secondary structure
different DEFB1 mRNAs fold in patterns that are haplotype- and length-dependent, potentially driving changes in peptide expression dynamics
in ways that reduce translation efficiency. This region also contains putative NF-kappaB binding sites44 putative NF-kappaB binding sites
NF-kappaB is a master transcription factor for immune response genes; altered binding reduces defensin transcription
, and the variant may impair transcription factor recruitment.

Direct evidence of reduced expression comes from a study of 754 adolescents measuring salivary hBD-1 protein55 study of 754 adolescents measuring salivary hBD-1 protein
CC genotype: 4.12 ng/mL; CT genotype: 2.77 ng/mL; TT genotype: 2.32 ng/mL
. The TT genotype produces roughly 44% less hBD-1 protein than CC — a substantial reduction in antimicrobial peptide output at mucosal surfaces. GTEx data further confirm that the risk genotype associates with lower DEFB1 mRNA expression across multiple tissues66 risk genotype associates with lower DEFB1 mRNA expression across multiple tissues
including aorta, coronary artery, and heart tissue
.

The Evidence

Colonic Crohn's disease. A case-control study of 190 Crohn's patients and 95 controls77 A case-control study of 190 Crohn's patients and 95 controls
Kocsis et al. studied Hungarian patients with detailed disease localization phenotyping
found the heterozygous genotype at 60% frequency among patients with colonic Crohn's versus 39% in controls (OR 2.39). Notably, no association was found with ileal Crohn's disease, which is consistent with the biology: hBD-1 is constitutively expressed in colonic epithelium88 hBD-1 is constitutively expressed in colonic epithelium
unlike alpha-defensins which predominate in ileal Paneth cells, beta-defensin 1 is the primary constitutive defensin of the colon
, so reduced expression would specifically compromise colonic defense.

Dental caries. A meta-analysis of rs11362 and dental caries99 meta-analysis of rs11362 and dental caries
Hatipoglu and Saydam, 2020, pooling multiple case-control studies
found that TT homozygotes have 7-fold higher caries risk compared to CC, with the dominant model showing OR 3.11 (95% CI 1.18-8.21, p=0.022). A separate study found that carrying a copy of the variant allele increased DMFT/DMFS scores more than five-fold1010 carrying a copy of the variant allele increased DMFT/DMFS scores more than five-fold
DMFT = Decayed, Missing, and Filled Teeth index, a standard measure of caries burden
. The mechanism is straightforward: reduced hBD-1 in saliva allows cariogenic bacteria like Streptococcus mutans to colonize tooth surfaces more effectively.

Periodontitis. Despite the oral health connection, a meta-analysis of 7 case-control studies1111 meta-analysis of 7 case-control studies
approximately 1,500 participants, analyzing allelic, dominant, and recessive models
found no significant association between rs11362 and chronic periodontitis (allelic OR 0.86, 95% CI 0.61-1.20). Periodontitis is driven more by inflammatory response than by direct antimicrobial defense, which may explain why a defensin expression variant has less impact.

Other associations. The variant has also been linked to HIV-1 susceptibility in Mexican women1212 HIV-1 susceptibility in Mexican women
-20A allele OR 1.60, 95% CI 1.06-2.40
and to coronary artery disease risk with reduced cardiac hBD-1 expression1313 coronary artery disease risk with reduced cardiac hBD-1 expression
2024 case-control study of 219 CAD patients vs 522 controls
, reflecting the broad antimicrobial and immunomodulatory role of beta-defensin 1 across tissues.

Practical Implications

Beta-defensin 1 is a [zinc-dependent antimicrobial peptide | defensins use cysteine-coordinated zinc binding for structural stability and antimicrobial function]. Zinc is essential both for defensin protein folding and for broader innate immune signaling. Carriers of the reduced-expression genotype should ensure adequate zinc status, as deficiency would compound the genetic reduction in hBD-1 output.

Vitamin D signaling upregulates antimicrobial peptide expression1414 upregulates antimicrobial peptide expression
1,25-dihydroxyvitamin D3 induces cathelicidin and beta-defensin gene expression through VDR-mediated pathways in epithelial cells
through VDR-mediated pathways. While the best-characterized targets are cathelicidin (LL-37) and beta-defensin 2, the broader defensin family benefits from adequate vitamin D status. For individuals with genetically reduced DEFB1 expression, optimizing vitamin D may partially compensate by upregulating other antimicrobial peptides in the same mucosal compartments.

Lactoferrin1515 Lactoferrin
an iron-binding glycoprotein naturally present in mucosal secretions, saliva, and breast milk
works synergistically with defensins as part of the mucosal innate defense system. Oral lactoferrin supplementation enhances mucosal barrier function and antimicrobial activity, complementing reduced defensin output.

Interactions

rs11362 is one of three functional 5' UTR variants in DEFB1 that form haplotypes affecting expression. The others are rs1799946 (G-52A) and rs1800972 (C-44G)1616 rs1799946 (G-52A) and rs1800972 (C-44G)
these three SNPs in the DEFB1 5' UTR create haplotypes with distinct mRNA folding patterns and transcription factor binding affinities
. The rs1800972 GG genotype appears protective for Crohn's disease (OR 3.37 protective), while rs11362 and rs1799946 confer risk. Combined haplotypes show stronger effects than individual variants — the ACA haplotype (coding strand) was associated with 5.82-fold increased HIV-1 susceptibility.

In the context of Crohn's disease, DEFB1 variants may compound with NOD2 (rs2066844) mutations1717 NOD2 (rs2066844) mutations
NOD2 controls Paneth cell defensin production in the ileum, while DEFB1 governs colonic defensin expression
. A patient carrying both DEFB1 risk variants (reduced colonic defensin) and NOD2 variants (reduced ileal defensin) would have compromised antimicrobial peptide defense along the entire intestinal tract, substantially increasing inflammatory bowel disease susceptibility.

rs1144566

RGS16 RGS16 H137R

Moderate Risk Factor

RGS16 H137R — A Coding Variant in the Circadian Clock's Synchronizer

Every cell in your body runs on a roughly 24-hour molecular clock, but those clocks need a conductor to stay synchronized with each other and with the outside world. That conductor lives in the suprachiasmatic nucleus (SCN)11 suprachiasmatic nucleus (SCN)
A cluster of ~20,000 neurons in the hypothalamus directly above the optic chiasm; it receives light signals from the retina and broadcasts a 24-hour timing signal to every organ in the body via neural, hormonal, and temperature rhythms
— and one of its critical molecular regulators is RGS1622 RGS16
Regulator of G-protein Signaling 16; a GTPase-accelerating protein that terminates Gαi/o signaling, thereby controlling when intracellular cAMP can accumulate in SCN neurons
. The rs1144566 variant changes a single amino acid in RGS16 — histidine to arginine at position 137 — in a gene whose protein levels oscillate daily to gate the cAMP pulses that coordinate clock-neuron communication.

The Mechanism

RGS16 operates at a key node in the GPR176–Gz–RGS16 signaling axis33 GPR176–Gz–RGS16 signaling axis
GPR176 is an orphan GPCR enriched in the SCN; it activates the Gz G-protein subunit (a slow-cycling member of the Gαi family), which suppresses cAMP synthesis; RGS16 terminates Gz signaling by accelerating GTP hydrolysis, releasing the cAMP brake
. Each morning, the molecular clock drives a surge in RGS16 expression; this terminates the Gz-mediated cAMP suppression and allows cyclic AMP to accumulate — a biochemical event essential for synchronizing the phase-leading dorsomedial SCN neurons with the light-receiving ventrolateral neurons.

Doi et al. (2011)44 Doi et al. (2011)
Doi M et al. Nature Communications, 2011
showed that mice lacking RGS16 entirely lose their circadian cAMP rhythm in the SCN and develop a lengthened free-running behavioral period. A longer internal period means the clock drifts toward an evening phase — the same directional shift associated with the C allele at rs1144566 in humans.

The H137R change (His→Arg at position 137) falls within the functional RGS domain responsible for G-protein contact. A positively charged arginine replacing a moderately polar histidine at this position is predicted to reduce GTPase acceleration of Gαi/o, impairing the protein's ability to terminate Gz signaling and thereby weakening the daily cAMP gate that sets circadian timing. This is a missense variant on a gene whose null phenotype directly produces the human evening-type circadian shift, making H137R a plausible partial loss-of-function that nudges carriers toward later timing.

The Evidence

The strongest human genetic evidence comes from two large independent GWAS that identified a chronotype signal at the RGS16 locus in high LD with rs1144566.

Hu et al. (2016)55 Hu et al. (2016)
Hu Y et al. GWAS of 89,283 individuals identifies genetic variants associated with self-reporting of being a morning person. Nature Communications, 2016
found rs12736689 (in LD r²=0.89 with rs1144566) to be the single most significant chronotype hit genome-wide at P=7×10⁻¹⁸ in 89,283 participants, and explicitly noted that rs1144566 is the nearby nonsynonymous coding variant in high LD with the lead SNP. This positions H137R as the most compelling functional candidate underlying the association.

Jones et al. (2016)66 Jones et al. (2016)
Jones SE et al. Genome-Wide Association Analyses in 128,266 Individuals Identifies New Morningness and Sleep Duration Loci. PLoS Genetics, 2016
independently confirmed the RGS16 locus in 128,266 UK Biobank participants, with the C-allele at the linked rs516134 conferring an odds ratio of 1.21 (95% CI 1.15–1.27) for morningness at P=3×10⁻¹². The RGS16 locus is the most replicated single locus in human chronotype genetics, confirmed in at least three independent GWAS totaling over 700,000 individuals.

The mechanistic grounding was extended by Nakagawa et al. (2020)77 Nakagawa et al. (2020)
Nakagawa S et al. Time-Restricted G-Protein Signaling Pathways via GPR176, Gz, and RGS16 Set the Pace of the Master Circadian Clock in the SCN. IJMS, 2020
, which showed that RGS16 protein peaks in the early morning in SCN neurons and that this timed expression is indispensable for the cAMP surge that coordinates cellular synchrony. Human genetic variants of RGS16 associating with earlier wake-up times are highlighted as the translational implication of these rodent findings.

The evidence level is rated moderate rather than strong because the functional impact of H137R specifically has not been characterized in cell or animal models — the GWAS signal is well-established, but whether H137R is the causal variant (versus being a proxy for a linked regulatory variant like rs516134) has not been definitively resolved.

Practical Implications

The C allele (H137R / Arg137) is the overwhelmingly common allele globally (~97.5%), making the CC genotype the population default. The H137R form of RGS16 has modestly reduced GTPase activity compared to the ancestral His137 form, but since nearly everyone carries it, this represents the baseline circadian setting rather than a deviation from normal.

The rare T allele (~2.5% globally) encodes the ancestral His137 form with stronger GTPase activity. Carriers of the T allele (CT or TT genotypes, ~5% of people) have a circadian clock that may run with slightly tighter phase advance — the practical expression is a modest morning preference with easier early waking. The T allele is most common in European (~2.5%) and African (~5.5%) ancestry populations and rarest in East Asian populations (~0.1%).

Interactions

rs1144566 sits at the same RGS16 locus as the regulatory variant rs516134 and the linked rs12736689. Both the coding change (rs1144566) and the regulatory signal (rs516134/rs12736689) independently tag the circadian effect at this locus, and they are in high LD (r²=0.89), so most users with the rs1144566 T allele will also carry the morningness-associated allele at rs12736689.

For additive circadian effects, variants in CLOCK (rs1801260) and PER3 (rs5751876) act at different nodes of the same oscillator. Carriers of morningness alleles at both RGS16 (T) and CLOCK may experience stronger morning preference than either variant alone predicts, as the two genes affect different parts of the feedback loop — RGS16 affects intercellular cAMP synchrony while CLOCK affects transcription-factor stability in the core loop.

CYP2J2 rs11572325 — A Cardiac Epoxygenase Variant and MI Risk

Your heart produces its own endogenous cardioprotective molecules. CYP2J2 is a cytochrome P450 enzyme expressed predominantly in cardiomyocytes and vascular endothelial cells, where it converts arachidonic acid into epoxyeicosatrienoic acids (EETs)11 epoxyeicosatrienoic acids (EETs)
Biologically active lipid mediators produced by cytochrome P450 epoxygenases; they dilate coronary arteries, reduce vascular inflammation, inhibit platelet aggregation, and protect cardiac tissue from ischemic injury
. The intronic variant rs11572325 sits within an intron of CYP2J2 on chromosome 1 and forms part of a haplotype block with the promoter variant rs890293 (CYP2J2*7) and the neighboring intronic SNP rs2280275 — all three of which tag the same reduced-CYP2J2-expression signal. Population studies have linked the rs11572325 T allele to both myocardial infarction risk and female-specific essential hypertension.

The Mechanism

CYP2J2 converts arachidonic acid into four EET regioisomers (5,6-EET, 8,9-EET, 11,12-EET, and 14,15-EET). These lipid mediators activate K⁺ channels in vascular smooth muscle to cause vasodilation, suppress nuclear factor-κB signaling to reduce endothelial inflammation, inhibit platelet aggregation, and reduce cardiomyocyte apoptosis after ischemic stress. [| Lai & Chen 2021 (PMID 33716791): EETs from CYP2J2 protect against cardiac hypertrophy, fibrosis, and apoptosis; raising EET levels via CYP2J2 overexpression or sEH inhibition reproducibly reduces myocardial injury in experimental models]. The rs11572325 T allele is in strong [linkage disequilibrium | LD means two alleles are inherited together so frequently that one allele predicts the other across the population] with the CYP2J2*7 promoter variant (rs890293), which disrupts an Sp1 transcription factor binding site and reduces CYP2J2 promoter activity by approximately 50%. T-allele carriers show measurably lower circulating EET metabolite concentrations (14,15-DHET), attenuating the vasodilatory and anti-inflammatory signaling that normally protects the coronary vasculature.

CYP2J2 also preferentially metabolizes omega-3 fatty acids EPA and DHA into 17,18-EEQ and 19,20-EDP22 17,18-EEQ and 19,20-EDP
Omega-3 epoxides with antiarrhythmic and vasodilatory properties; EPA is metabolized by CYP2J2 at approximately 17-fold higher efficiency than arachidonic acid
. This substrate competition means that high dietary omega-3 intake can shift eicosanoid output toward cardioprotective mediators even when EET-generating capacity is reduced.

The Evidence

Myocardial infarction risk: The best-powered study comes from a population-based case-control analysis33 population-based case-control analysis
Marciante et al. Common variation in cytochrome P450 epoxygenase genes and MI risk. Pharmacogenet Genomics, 2008
of 856 incident MI cases and 2,688 controls. The rs11572325 T allele was associated with increased MI risk (OR 1.27, 95% CI 1.08–1.51, P=0.006, Benjamini-Hochberg q=0.090), surviving multiple-testing correction in a panel of 30 tag-SNPs across three CYP epoxygenase genes. Notably, the companion CYP2J2 variant rs10889160 showed an independent association (OR 1.24, 95% CI 1.07–1.43, P=0.004), while neither CYP2C8 nor CYP2C9 variants reached significance — pointing specifically to CYP2J2 as the locus driving cardiovascular EET risk. No association with ischemic stroke was observed.

Hypertension — female-specific effect: A comprehensive Russian population study44 comprehensive Russian population study
Polonikov et al. 2019, n=2,314 (discovery + replication cohorts)
genotyped eight CYP2J2 SNPs including rs11572325. The T allele was associated with essential hypertension in women (OR 1.89, 95% CI 1.22–2.95) in the discovery cohort, with directional replication in an independent Belgorod cohort. Critically, no association was detected in men, and haplotype analysis placed rs11572325 within the highest-risk female haplotype (T–T–G–C–C–C–T–A across eight CYP2J2 SNPs). This sex-dependent effect likely reflects estrogen's modulatory role in arachidonic acid metabolism and CYP2J2 transcriptional sensitivity.

Functional EET reduction: The CYP2J2*7 promoter variant (rs890293), in strong LD with rs11572325, was shown to [reduce promoter activity by ~50% | Liu et al. 2007 (PMID 17126841): CYP2J2*7 T carriers had lower plasma EET metabolites; smoking synergistically increased MI risk 6.7-fold] and produce lower circulating EET concentrations. Smokers carrying the T allele exhibited a 6.7-fold amplification of MI risk relative to non-smoking wild-type carriers (P=0.01 for interaction) — highlighting that environmental exposures compound the genetic EET deficit.

Practical Actions

The most specific actionable intervention is dietary: because CYP2J2 metabolizes EPA and DHA into cardioprotective omega-3 epoxides at far higher efficiency than arachidonic acid, supplementing with concentrated fish oil or algae-based EPA/DHA shifts eicosanoid output toward 17,18-EEQ and 19,20-EDP even when EET-generating capacity is reduced. Cardiovascular monitoring is also warranted, particularly in women, given the replicated hypertension signal. Smoking dramatically amplifies the EET-reduction risk and represents the highest-leverage modifiable factor for carriers.

Interactions

rs11572325 is part of the CYP2J2 haplotype block that includes the promoter variant rs890293 (CYP2J2*7) and the intronic variant rs2280275. Studies of all three variants consistently find similar effect directions — they tag the same reduced-expression signal. The companion MI-associated variant rs10889160 showed an independent association in the same Marciante 2008 study, suggesting the two intronic variants may contribute additively to cardiovascular risk through distinct LD blocks within CYP2J2. The soluble epoxide hydrolase variant rs751141 (EPHX2) modulates EET degradation downstream: reduced EPHX2 activity raises EET levels, potentially offsetting reduced CYP2J2 production. The CYP2C8 epoxygenase variant rs1058932 contributes to the same vascular EET pool via a different gene — together, variants in CYP2J2 and CYP2C8 define an individual's total coronary EET production capacity.

rs1159327

ESR1 ESR1 intron variant

Moderate Risk Factor

ESR1 Intron Variant — Estrogen Receptor Expression and Bone Health

The ESR1 gene encodes estrogen receptor alpha (ERα), the primary nuclear receptor through which estrogen orchestrates bone remodeling, reproductive function, and metabolism. ERα is expressed throughout the skeleton, uterus, ovary, breast, and cardiovascular system — making ESR1 variants some of the most broadly consequential polymorphisms in reproductive-age women. The rs1159327 variant sits deep within an ESR1 intron on chromosome 6 (position 151,726,887 GRCh38) and has been identified at genome-wide significance for bone mineral density11 identified at genome-wide significance for bone mineral density
He et al. Osteoporosis International 2023, n=141,261 UK Biobank participants, p=5×10⁻⁹
.

The Mechanism

As an intronic variant, rs1159327 does not alter the ESR1 protein sequence. Instead, it likely influences gene expression through one of several regulatory mechanisms common to intronic SNPs: disruption of transcription factor binding sites22 transcription factor binding sites
protein complexes that bind DNA and control when and how much a gene is transcribed
, alteration of enhancer activity, or effects on mRNA splicing efficiency. The ESR1 gene is large (~300 kb) with numerous regulatory elements distributed across its introns, and the intron 1 region in particular has been repeatedly implicated in controlling receptor expression levels.

ERα regulates osteoblast (bone-forming cell) proliferation and survival while suppressing osteoclast (bone-resorbing cell) activity. When ERα expression or sensitivity is reduced — as intronic regulatory variants can cause — the balance tips toward net bone resorption. The T allele at rs1159327 is associated with lower average bone mineral density, consistent with a model in which reduced ESR1 expression impairs estrogen's bone-protective signaling.

The Evidence

The primary evidence linking rs1159327 to bone health comes from a 2023 longitudinal GWAS in the UK Biobank33 2023 longitudinal GWAS in the UK Biobank
He et al., Osteoporosis International, n=141,261
, which used trajectory analysis (TrajGWAS) to model heel bone mineral density over time. The ESR1 locus emerged at genome-wide significance (p=5×10⁻⁹) for BMD mean, placing rs1159327 among a handful of ESR1-region variants robustly associated with skeletal health in large populations.

This finding sits within a rich broader literature on the ESR1 locus. A meta-analysis of GWAS in premenopausal women44 A meta-analysis of GWAS in premenopausal women
Koller et al. JBMR 2013, discovery n=4,061 + replication n=5,597
identified multiple independent signals in the ESR1 region, and a separate GWAS in 2,468 European men55 a separate GWAS in 2,468 European men
Holliday et al. PLoS One 2011
confirmed the 6q25 locus affects both calcaneal ultrasound BMD and radial volumetric BMD. The largest early meta-analysis — 18,917 individuals across 8 European centers66 18,917 individuals across 8 European centers
Ioannidis et al. JAMA 2004
— found that ESR1 variants affect fracture risk by mechanisms independent of baseline bone density, suggesting the estrogen receptor pathway influences bone quality and geometry as well as mass.

Population data from dbSNP (ALFA) shows the T allele frequency varies from ~17% in South Asian populations to ~45% in African populations, meaning TT homozygosity (~8% globally) is substantially rarer in South/East Asian ancestry groups but more common in individuals of African descent.

Practical Implications

The most direct implications of the T allele relate to bone health across the lifespan. Carriers of one or two T alleles have a tendency toward lower bone mineral density, which translates to heightened importance of bone-protective behaviors — particularly weight-bearing exercise, calcium intake, vitamin D sufficiency, and proactive bone density monitoring. The effect of ESR1 variants on fracture risk appears to operate partly independently of BMD, which means a normal DEXA scan does not fully reassure against elevated fracture risk in T allele carriers.

For women, the ESR1 locus is also relevant to reproductive outcomes. Although the strongest fertility associations in ESR1 involve the nearby intron-1 variants rs2234693 (PvuII) and rs9340799 (XbaI), these three variants are in linkage disequilibrium and likely index overlapping biology. ESR1 polymorphisms have been examined in endometriosis, IVF outcomes, and ovarian reserve studies — the receptor's role in folliculogenesis, endometrial receptivity, and implantation means that even modest changes in expression can influence reproductive efficiency.

Interactions

rs1159327 sits near two of the most studied ESR1 variants: rs2234693 (PvuII, intron 1) and rs9340799 (XbaI, intron 1). These three variants span a regulatory stretch of ESR1 and are in partial linkage disequilibrium — meaning carriers of the rs1159327 T allele are more likely to also carry the rs2234693 T allele and rs9340799 G allele. Haplotype analyses of the ESR1 region show that combined effects on bone, cardiovascular, and hormone-responsive tissues can differ substantially from any single variant's individual contribution. Individuals carrying T alleles at multiple ESR1 loci may have more pronounced reductions in receptor expression.