Intronic TERT variant that increases uterine fibroid (leiomyoma) risk through impaired telomere maintenance and genome stability, independent of the longevity TERT variants

Uterine fibroids (leiomyomas) are benign smooth-muscle tumors of the uterus that affect up to 70% of women by age 50. While they are common, their underlying biology is clonal — each fibroid arises from a single cell that escapes normal growth control. A genome-wide association study of more than 400,000 women in the UK Biobank pinpointed rs72709458 in the TERT gene as one of the strongest genetic risk factors for leiomyoma, with a genome-wide significant odds ratio of 1.12 (p = 6.9 × 10⁻¹⁶). This is not the same biological story as TERT longevity variants — this variant acts through a different mechanism focused on the uterine smooth-muscle cell's capacity to maintain genome integrity11 uterine smooth-muscle cell's capacity to maintain genome integrity
Telomere maintenance failure allows chromosomal instability, which can initiate clonal tumor growth
.

TERT encodes the catalytic subunit of telomerase, the enzyme that rebuilds telomere caps on chromosome ends after cell division. Without adequate telomerase activity, telomeres shorten with each cell division until they trigger a DNA damage response — usually stopping the cell from dividing further, but sometimes allowing chromosomal instability and aberrant clonal expansion instead.

The rs72709458 variant sits in an intron of TERT at chromosome 5, position 1,283,640 (GRCh38). It does not change the protein sequence directly; rather, it is a regulatory tag variant in strong linkage disequilibrium with nearby functional elements that influence TERT expression levels or splicing in uterine tissue. Importantly, the three LD-independent TERT associations with leiomyoma risk (rs72709458, rs2736100, rs2853676) each contribute independently, and their combined signal points to the same biological axis: telomere maintenance capacity in the uterine myometrium22 telomere maintenance capacity in the uterine myometrium
The myometrium is estrogen-responsive, undergoes repeated cycles of growth and regression, and depends on intact genome surveillance to prevent clonal outgrowth
. The TERT, TERC, and OBFC1 gene cluster together represent the telomere maintenance axis in the leiomyoma GWAS, grouping with TP53 and ATM as a "genome stability" gene cluster distinct from the "genitourinary development" gene cluster (WT1, ESR1/SYNE1, CDC42/WNT4).

Notably, rs72709458 and rs2853676 at TERT did not independently associate with blood leukocyte telomere length in this study — unlike rs2736100, which was significantly associated with shorter telomere length in tumor tissue. This suggests rs72709458 may act through a tissue-specific or context-specific mechanism rather than globally shortening telomeres. The combined effect of all three TERT SNPs plus TERC and OBFC1 variants showed a negative trend toward shorter telomeres (p = 0.055), consistent with a cumulative model.

Välimäki et al. (2018, eLife)33 Välimäki et al. (2018, eLife)
Genetic predisposition to uterine leiomyoma is determined by loci for genitourinary development and genome stability
conducted the largest uterine leiomyoma GWAS to date using the UK Biobank, identifying 22 genome-wide significant loci. The rs72709458 T allele at the TERT locus showed OR = 1.12, p = 6.9 × 10⁻¹⁶ — a highly reproducible, statistically unambiguous association. The effect allele frequency of approximately 0.21 means that a substantial fraction of women carry at least one T allele.

A subsequent Russian case-control study44 A subsequent Russian case-control study
GWAS-Significant Loci and Uterine Fibroids Risk: Analysis of Associations, Gene-Gene and Gene-Environmental Interactions
genotyped 737 hospitalized women with uterine fibroids and 451 controls for seven GWAS-validated SNPs including rs72709458. The study identified synergistic gene-gene interactions among these loci and a gene-environment interaction between the TERT variant and prior abortion history — reinforcing the view that rs72709458 acts within a broader biological context of uterine tissue stress and cell division history.

Vitamin D merits specific mention as a modifiable factor in fibroid biology. A comprehensive mechanistic review55 A comprehensive mechanistic review
The Role of Vitamin D in Uterine Fibroid Biology, Baird & Wise 2015
documents that vitamin D (1,25-dihydroxyvitamin D3) exerts direct anti-fibroid effects: it induces G1 cell cycle arrest in leiomyoma cells, downregulates anti-apoptotic proteins (Bcl-2, Bcl-xL), reduces extracellular matrix remodeling enzymes (MMP-2/9), and suppresses estrogen and progesterone receptor expression. Three large observational studies found inverse associations between serum vitamin D levels and fibroid prevalence, with adjusted ORs ranging from 0.68 to 2.5 depending on the study. African American women — who have 2–3× higher fibroid incidence and carry the rs72709458 T allele at a frequency of ~14.5% — are also 10× more likely to be vitamin D deficient. Whether optimizing vitamin D status specifically attenuates fibroid risk in genetically predisposed women has not been formally tested in a randomized trial, but the mechanistic and epidemiological rationale is substantial.

The rs72709458 T allele also has prior associations to endometriosis, lung adenocarcinoma, and glioma — illustrating the broad neoplasia-predisposing context of TERT variation. This pleiotropy is consistent with the telomere maintenance model: cells that divide frequently in hormone-responsive or inflammatory contexts (uterine myometrium, lung epithelium, neural tissue) are particularly dependent on adequate TERT function.

For women who carry the T risk allele at rs72709458, the key message is that this genetic signal warrants increased awareness and proactive monitoring — not alarm. Uterine fibroids are common, usually benign, and often manageable. The OR of 1.12 per allele confers a modest individual risk increment; the variant matters most when combined with other risk factors (family history of fibroids, Black ethnicity, obesity, early menarche, low vitamin D).

The most evidence-supported modifiable interventions targeting fibroid biology include: - Optimizing vitamin D status: The mechanistic evidence is strong and the intervention is low-risk. Maintaining 25(OH)D levels ≥ 40 ng/mL is a reasonable target. - Early symptom reporting: Many fibroids are asymptomatic; those that become symptomatic cause heavy menstrual bleeding, pelvic pressure, or fertility impairment. Earlier diagnosis allows more conservative management options (myomectomy, UAE, progesterone IUDs). - Pelvic ultrasound awareness: Women with a genetic or family history of fibroids can discuss with their clinician whether surveillance ultrasound is appropriate, particularly in the reproductive years.

rs2736100 (TERT, intron 2): The strongest of the three LD-independent TERT signals for leiomyoma, and the one most directly associated with shorter telomeres in tumor tissue. If you carry risk alleles at both rs72709458 and rs2736100, the combined effect on TERT-related genome stability may be greater than either alone. A compound action analysis across these two variants would capture the combined telomere maintenance burden.

rs2853676 (TERT): A third independent TERT signal for leiomyoma risk, also not individually associated with telomere length. Its biological relationship to rs72709458 is not yet fully characterized.

rs12696304 (TERC): The RNA component of telomerase, which acts with TERT to maintain telomeres. Combined risk alleles at TERT and TERC loci showed a negative trend toward shorter telomeres (p = 0.055), suggesting an additive model of telomere maintenance failure.

CD226 Gly307Ser — The T-Cell Co-Stimulation Checkpoint Variant

CD226 (also called DNAM-1, DNAX Accessory Molecule-1)11 CD226 (also called DNAM-1, DNAX Accessory Molecule-1)
A co-stimulatory receptor expressed on T cells, natural killer cells, and myeloid cells. CD226 binds the ligands PVR (CD155) and Nectin-2 (CD112) on antigen-presenting cells and cancer cells, delivering a secondary activation signal that lowers the threshold for T-cell and NK-cell cytotoxicity
occupies a central position in both antitumour immunity and autoimmune regulation. rs763361 is a missense variant in exon 7 of CD226 on chromosome 18q22 that substitutes glycine for serine at protein position 307. Unlike loss-of-function variants, the Ser307 form of the protein alters co-stimulatory signalling in a way that has been consistently — across more than 50 independent studies — associated with increased susceptibility to a broad spectrum of autoimmune diseases.

The Mechanism

The Gly307Ser substitution lies in the cytoplasmic tail of CD226, near a tyrosine-based signalling motif22 tyrosine-based signalling motif
A short amino acid sequence (typically YxxL or similar) within a receptor's intracellular domain that, when phosphorylated, recruits signalling kinases and adaptor proteins to propagate downstream immune activation
. The Gly→Ser change at position 307 is thought to alter the phosphorylation state of this domain, affecting how efficiently CD226 recruits downstream kinases such as Grb2 and PI3K after ligand engagement. The net biological result is that T cells and NK cells carrying the Ser307 form have a subtly altered activation threshold: they respond more readily to self-antigens presented in the context of inflammatory signals, a pattern consistent with the variant's observed association with multiple autoimmune diseases that have T-cell hyperactivation as a core pathogenic feature.

CD226 also functions in a competitive balance with its inhibitory counterpart TIGIT33 TIGIT
A co-inhibitory receptor on T cells that competes with CD226 for the same ligands (PVR, Nectin-2) but delivers an inhibitory rather than activating signal, normally dampening T-cell responses
. When CD226 Ser307 shifts this co-stimulatory/co-inhibitory balance toward excess activation, the self-tolerance mechanism that TIGIT normally enforces is partially undermined. This mechanistic framework positions CD226 Gly307Ser alongside PTPN22 R620W (rs2476601) and CTLA4 variants (rs3087243, rs231775) as part of a broader network of T-cell checkpoint genes where single-nucleotide variants collectively tune the threshold at which T cells become self-reactive.

The Evidence

The association was first established by Hafler et al. 200944 Hafler et al. 2009
Hafler JP et al. CD226 Gly307Ser association with multiple autoimmune diseases. Genes Immun, 2009
in a European cohort, finding association with type 1 diabetes (P=3.46×10⁻⁹), multiple sclerosis (P=4.2×10⁻⁴), and rheumatoid arthritis (P=0.017). This multi-disease pattern immediately distinguished CD226 from disease-specific loci and placed it among the small number of pan-autoimmune susceptibility genes.

The most comprehensive meta-analysis to date, Bai et al. 202055 Bai et al. 2020
Bai et al. Role of CD226 rs763361 polymorphism in susceptibility to multiple autoimmune diseases. Immunol Invest, 2020; 29 reports, 51 studies, 18,157 cases, 29,904 controls
, confirmed associations with rheumatoid arthritis, SLE, type 1 diabetes, and multiple sclerosis, while finding no stable association for systemic sclerosis. An earlier meta-analysis by Song et al. 201266 meta-analysis by Song et al. 2012
Song G et al. Association between CD226 rs763361 polymorphism and susceptibility to autoimmune diseases. Lupus, 2012; 17 studies, 8,900 cases, 10,295 controls
quantified the per-allele risk: overall OR 1.162 (95% CI 1.097–1.230, P<10⁻⁸), with T1D showing a stronger signal (OR 1.353) than SLE (OR 1.150) or systemic sclerosis (OR 1.126).

A further meta-analysis by Qiu et al. 201377 Qiu et al. 2013
Qiu et al. CD226 Gly307Ser association with multiple autoimmune diseases: a meta-analysis. Hum Immunol, 2013
reported an overall OR of 1.19 (95% CI 1.12–1.27), with markedly higher risk in South American populations (OR 1.72) compared to Asian (OR 1.46) or European (OR 1.29) cohorts. Both the additive and codominant models were significant, confirming that each T allele contributes incrementally to risk — TT homozygotes face approximately OR² ≈ 1.42× the risk of CT heterozygotes relative to CC homozygotes.

Sex-specific analysis in a Brazilian T1D cohort88 Brazilian T1D cohort
Mattana et al. 2014, PMID 24891767
found the TT genotype association was driven primarily by females (P=0.0012), and TT carriers showed both higher GAD65 autoantibody frequency (31.9% vs 24.5%, OR 1.57) and lower residual C-peptide levels — markers of more aggressive autoimmune beta-cell destruction.

Practical Implications

The per-allele OR of ~1.16–1.19 places rs763361 among the moderate-effect common autoimmune risk variants. For CT heterozygotes (approximately 50% of the population), the risk elevation is modest and the absolute lifetime risk of any specific disease remains low. For TT homozygotes (~28%), the cumulative risk is more meaningful, especially when combined with other autoimmune susceptibility variants in the T-cell checkpoint network.

The diseases most consistently associated — T1D, MS, SLE, and RA — all have well-defined autoantibody and inflammatory biomarkers that can detect subclinical disease years before clinical presentation. Targeted monitoring is the primary actionable response.

Interactions

CD226 rs763361 is one node in a T-cell checkpoint network that includes PTPN22 rs2476601 (LYP phosphatase, reduces TCR signalling), CTLA4 rs3087243 and rs231775 (inhibitory co-receptor on T cells), and IL2RA rs2104286 (IL-2 receptor alpha, T-regulatory cell survival). These four genes converge on the decision of whether a T cell becomes activated or tolerized when encountering antigen. Carrying risk alleles at multiple checkpoints compounds the tolerance deficit multiplicatively. The compound effect of CD226 T + PTPN22 A (rs2476601) on T1D risk in particular deserves a compound action: both variants shift the T-cell activation threshold in the same direction (lower tolerance, higher autoimmune activation), and their combined risk is plausibly additive or super-additive.

IL23A Near-Gene Variant — The IL-23/Th17 Axis and Personalized Biologic Selection

Interleukin-2311 Interleukin-23
a heterodimeric cytokine composed of a unique p19 subunit (encoded by IL23A) and a p40 subunit shared with IL-12 (encoded by IL12B)
is one of the master regulators of chronic autoimmune inflammation. IL-23 is produced by dendritic cells and macrophages responding to bacterial pathogens and tissue damage signals, and its primary function is to expand and maintain [Th17 cells | a subset of helper T cells that produce IL-17A, IL-17F, IL-22, and TNF-α — cytokines that collectively drive the keratinocyte hyperproliferation, neutrophil recruitment, and sustained skin inflammation seen in psoriasis]. rs7746808 is an intergenic variant on chromosome 6 captured as part of the broader IL23A disease-susceptibility landscape, in a genomic neighbourhood associated with IL-23-pathway regulation in autoimmune disease cohorts. While the IL23A gene itself sits at 12q13.3, this variant was identified as a biologically relevant marker in the IL-23/Th17 disease context — its C allele tracks with the pro-inflammatory haplotype architecture observed across IL23A-pathway GWAS loci.

The clinical importance of this variant lies in its pharmacogenomic relevance: the IL-23 p19 subunit encoded by IL23A is the specific target of three approved biologics — guselkumab (Tremfya), risankizumab (Skyrizi), and tildrakizumab (Ilumya) — that selectively neutralise IL-23 while sparing IL-12. Variants that modulate IL23A expression or activity in pro-inflammatory contexts are therefore directly relevant to predicting who will benefit most from anti-p19 biologic therapy.

The Mechanism

IL-23 drives autoimmune inflammation through a well-characterised cytokine relay: IL-23 binds its receptor (IL23R) on Th17 progenitors and innate lymphoid cells, activating JAK2 and TYK2 kinases that phosphorylate STAT3. STAT3 then enters the nucleus and drives transcription of [RORγt | the master transcription factor for Th17 cell identity], IL-17A, IL-17F, and IL-22. In psoriatic skin, the resulting IL-17A surge acts on keratinocytes to produce CXCL1, CXCL8, and S100 proteins that recruit neutrophils into the epidermis — forming the characteristic Munro microabscesses and driving epidermal hyperproliferation.

rs7746808 sits in an intergenic region on chromosome 6 that lacks direct coding function, but intergenic variants in established GWAS loci are frequently regulatory in nature: they may tag distal enhancer elements, influence chromatin accessibility, or act as eQTLs for nearby or distant immune genes. The dbSNP allele frequencies show striking ancestry stratification — the C allele reaches 71% in East Asians, 65% in Latinos, and 61% in Europeans, but drops to only 22% in Africans — a pattern consistent with evolutionary selection operating differently across populations on immune-response variants. This population stratification parallels the known ancestry differences in psoriasis prevalence (higher in European and East Asian populations, lower in African-ancestry populations).

The Evidence

The foundational evidence for the IL23A locus in immune-mediated disease comes from a landmark psoriasis GWAS by Capon et al., Nature Genetics 2008/200922 psoriasis GWAS by Capon et al., Nature Genetics 2008/2009
Genome-wide scan reveals association of psoriasis with IL-23 and NF-κB pathways
, which genotyped 438,670 SNPs in 1,409 psoriasis cases and 1,436 controls (European ancestry) with follow-up in 5,048 cases and 5,041 controls. The IL23A-proximal SNP rs2066808 showed a combined p-value of 1×10⁻⁹ with an odds ratio of 1.34 — making the IL23A locus one of the most robustly associated genetic regions in psoriasis outside the MHC.

A UK/Ireland psoriatic arthritis replication study33 UK/Ireland psoriatic arthritis replication study
Confirmation of TNIP1 and IL23A as susceptibility loci for psoriatic arthritis
extended this finding to joint disease, confirming IL23A rs2066808 at p = 9.1×10⁻⁷ in PsA — a more stringent genome-wide significance threshold than the skin-only GWAS achieved independently, establishing that the IL23A genetic association spans both skin and joint manifestations of psoriatic disease.

A Romanian IL23A haplotype study44 Romanian IL23A haplotype study
Sarbu et al., 2013
fine-mapped the IL23A locus using both rs2066808 (intron of STAT2, 3.7 kb downstream of IL23A) and rs11171806 (exon 3 of IL23A). In 94 PsA patients and 161 controls, the IL23A exonic variant rs11171806 showed OR 0.39 (p=0.03), and the AC haplotype was protective (p=0.03), confirming that the IL23A genomic region harbours genuine disease-susceptibility architecture across multiple variants.

A Spanish pharmacogenomic study55 Spanish pharmacogenomic study
Robledinos-Antón et al., 2018, Hum Immunol
examined IL12B, IL23A, IL23R, and HLA-C*06 variants as predictors of both psoriasis susceptibility and biologic treatment response. The IL23A variants studied did not reach significance for PASI75/PASI90 response to ustekinumab — an important finding that suggests the IL23A locus may better predict susceptibility than response to anti-p40 therapy, and that future studies specifically examining anti-p19 (guselkumab, risankizumab) response in IL23A genotype-stratified cohorts are needed.

Practical Actions

The most immediate clinical relevance of IL23A-locus variants is pharmacogenomic. Anti-p19 biologics (guselkumab, risankizumab, tildrakizumab) selectively inhibit IL-23 while preserving IL-12-mediated immunity to intracellular pathogens — an advantage over ustekinumab, which blocks both. For individuals with psoriasis or psoriatic arthritis who carry the C risk allele at rs7746808, the IL-23/Th17 pathway is biologically implicated in their disease architecture, supporting the rationale for anti-p19 biologic selection. However, no formal CPIC/DPWG guideline yet exists for IL23A variants in anti-p19 biologic dosing — this remains an area of active pharmacogenomic research.

For susceptibility, the IL23A C-risk-allele carriers should be aware that inflammatory comorbidities extending beyond skin are relevant: IL-23 pathway dysregulation contributes to Crohn's disease, ulcerative colitis, ankylosing spondylitis, and uveitis — all conditions with elevated prevalence in psoriatic disease populations. Monitoring for gut symptoms and joint pain beyond the skin presentation is warranted.

Interactions

The IL-23/Th17 axis forms a tightly integrated genetic system. rs7746808 (IL23A locus) acts upstream in the pathway: IL23A provides the p19 subunit driving Th17 expansion. rs11209026 (IL23R R381Q) encodes a loss-of-function receptor variant that is strongly protective against psoriasis, PsA, and IBD by reducing IL-23 receptor sensitivity — carriers of the IL23R protective allele may experience attenuation of the IL23A risk-allele effect. rs12188300 (IL12B) encodes the shared p40 subunit; elevated p40 increases both IL-12 and IL-23 availability, compounding IL23A-driven Th17 activation. rs33980500 (TRAF3IP2/Act1) lies downstream, amplifying IL-17 receptor signaling after Th17 cells have already been expanded by IL-23 — making this a downstream amplifier of the IL23A-initiated cascade.

rs9340799

ESR1 XbaI polymorphism

Moderate Risk Factor

ESR1 XbaI — How This Estrogen Receptor Variant Shapes Reproductive Outcomes

Estrogen receptor alpha (ERα), encoded by ESR1 on chromosome 6, is the master mediator of estrogen signaling across reproductive tissues, bone, brain, and the cardiovascular system. When estrogen binds ERα, the receptor dimerizes, translocates to gene promoters, and drives transcription of hundreds of downstream targets governing menstrual cyclicity, endometrial receptivity, follicular growth, and uterine function. The rs9340799 XbaI polymorphism — an A-to-G substitution deep in the first intron of ESR1 — does not change the receptor's amino acid sequence, but accumulating evidence suggests it influences receptor expression levels and estrogen-signaling efficiency in reproductive tissues.

The variant earned its name from the restriction enzyme XbaI: the A allele creates a recognition site that is cut by XbaI, while the G allele abolishes this site. Because it sits in an intron and has been studied extensively by RFLP genotyping, the literature uses two parallel naming conventions — the "XbaI A/G" allele system (used here, as it corresponds to the plus-strand GRCh38 alleles) and the older "X/x" uppercase/lowercase notation where uppercase X denotes absence of the restriction site (the G allele).

The Mechanism

rs9340799 lies within intron 1 of ESR1, a region that contains regulatory elements influencing transcription and alternative splicing. The A>G change is in strong linkage disequilibrium11 linkage disequilibrium
LD means the two variants are inherited together so often that knowing one predicts the other
with the adjacent PvuII polymorphism (rs2234693), and both are typically analyzed as a haplotype. The G allele at rs9340799 disrupts predicted regulatory motifs and appears to reduce baseline ERα expression in endometrial tissue relative to the A allele, which may alter estrogen sensitivity during the implantation window.

The precise molecular mechanism remains incompletely characterized. The variant may act through altered splicing efficiency, effects on enhancer activity, or changes to the local chromatin environment. Crucially, the same G allele that reduces endometrial estrogen responsiveness may paradoxically protect against bone fractures (potentially via altered bone-specific ERα isoform expression) and against male infertility — illustrating how tissue-specific regulatory effects can produce direction-switching outcomes across biological contexts.

The Evidence

Endometriosis and IVF outcomes: A Brazilian case-control study (98 endometriosis patients, 115 IVF-failure patients, 134 fertile controls)22 Brazilian case-control study (98 endometriosis patients, 115 IVF-failure patients, 134 fertile controls)
Paskulin et al. Disease Markers, 2013
found that the GG genotype was strongly associated with endometriosis-related infertility (OR 4.67, 95% CI 1.84–11.83, P = 0.001) and with IVF failure (OR 3.33, 95% CI 1.38–8.03, P = 0.007). In the same cohort, the AA genotype was significantly more common in fertile controls, suggesting the A allele supports normal endometrial function.

Consistent with this, a cross-sectional IVF study of 136 Brazilian infertile women33 cross-sectional IVF study of 136 Brazilian infertile women
de Mattos et al. Journal of Ovarian Research, 2014
found the AA genotype was associated with significantly more follicles, more mature oocytes, and better embryo quality during controlled ovarian hyperstimulation — suggesting the A allele confers superior ovarian responsiveness to FSH stimulation.

Premature ovarian insufficiency: An Iranian case-control study (150 POI cases, 150 controls under age 35)44 Iranian case-control study (150 POI cases, 150 controls under age 35)
Eshaghi et al. International Journal of Reproductive Biomedicine, 2022
found significant associations between rs9340799 genotypes and premature ovarian insufficiency. The T(PvuII)/A(XbaI) haplotype (i.e., carrying the rs2234693 T allele with the rs9340799 A allele) was identified as a risk haplotype for POI, suggesting complex genotype × context effects at this locus.

Severe pre-eclampsia: A meta-analysis of seven studies55 meta-analysis of seven studies
Zhao et al. Bioscience Reports, 2019
found the GG genotype was associated with elevated severe pre-eclampsia risk (OR 1.67, 95% CI 1.10–2.25, P = 0.017), while neither genotype was associated with mild pre-eclampsia. The evidence is modest given the small number of constituent studies and limited statistical power.

Fractures: The large GENOMOS consortium study of 18,917 individuals across eight European centers66 GENOMOS consortium study of 18,917 individuals across eight European centers
Ioannidis et al. JAMA, 2004
found that the GG genotype (absent XbaI site) was associated with a 19% reduction in all fractures (OR 0.81, P = 0.002) and a 35% reduction in vertebral fractures (OR 0.65, P = 0.003) — an effect independent of bone mineral density. This is a protective effect of the G allele that contrasts with its reproductive-tissue risk profile.

Breast cancer: A meta-analysis of 34,721 subjects across 23 studies77 meta-analysis of 34,721 subjects across 23 studies
Tan et al. Scientific Reports, 2021
found no significant association between rs9340799 and breast cancer risk across all genetic models tested, settling a previously contested question.

Practical Implications

For women, the strongest clinically actionable signals from rs9340799 are in the reproductive domain: the GG genotype is associated with elevated endometriosis-related infertility risk and IVF failure, and with greater severe pre-eclampsia susceptibility. Heterozygous AG carriers show intermediate risk. The AA genotype is associated with better ovarian responsiveness during IVF stimulation.

Women carrying the GG genotype who are planning pregnancy should discuss their genetic profile with a reproductive specialist, particularly if IVF is anticipated. Early referral and personalized FSH dosing protocols informed by this variant may improve controlled ovarian hyperstimulation outcomes. GG carriers with symptoms suggestive of endometriosis warrant proactive evaluation rather than symptom normalization.

For men, the G allele appears protective against infertility, suggesting sex-specific regulatory effects on the reproductive axis.

Interactions

rs2234693 (ESR1 PvuII): rs9340799 and rs2234693 are in strong linkage disequilibrium and are most informative as a haplotype. The C(PvuII)/G(XbaI) haplotype was associated with approximately 5-fold elevated coronary artery disease risk in one Indian case-control study. For reproductive outcomes, the T/A haplotype (rs2234693 T + rs9340799 A) appears relevant to premature ovarian insufficiency. Compound action proposal: women carrying both rs2234693 TT and rs9340799 GG genotypes may represent a subgroup with elevated reproductive vulnerability warranting earlier fertility workup; evidence level moderate.

rs2046210 (ESR1 promoter): A 2024 study found rs2046210 GA heterozygosity significantly increased endometriosis risk and elevated ESR1 expression in eutopic endometrium, suggesting that multiple ESR1 regulatory variants contribute independently to endometriosis susceptibility. Women carrying risk alleles at both rs9340799 and rs2046210 may have compounded estrogen-signaling dysregulation.

rs7521902 (near WNT4): WNT4 signaling suppresses androgen production and supports female reproductive development; its GWAS locus is one of the most replicated endometriosis signals. Carrying risk alleles at both ESR1 and WNT4 loci may confer additive endometriosis susceptibility, though formal interaction testing has not been published.

rs17250932

TBX21 TBX21 Promoter Variant

Emerging Risk Factor

TBX21 Promoter — Neonatal Th2 Cytokine Programming and Atopic Risk

The TBX21 gene encodes T-bet, the master transcription factor that drives naive CD4+ T cells toward the Th1 differentiation path11 Th1 differentiation path
Th1 cells produce IFN-γ and suppress IgE production; when T-bet is robust, Th2 overactivation — the driver of allergic disease — is held in check
. rs17250932 is a single-nucleotide variant situated approximately 1.5 kilobases upstream of the TBX21 transcription start site in the promoter region. The position is designated -1514T/C in the literature (numbering from the start codon) — the T allele is the common reference, the C allele the minor variant. Unlike the well-characterized downstream promoter variant rs4794067 (-1993T/C), rs17250932 has not been studied in functional promoter activity assays, but its proximity to the transcription start site places it squarely in the regulatory region where transcription factor binding determines how strongly TBX21 is expressed.

The Mechanism

Variants in the TBX21 upstream promoter region influence binding of transcriptional activators and repressors to core promoter elements. The related -1993T/C variant (rs4794067)22 related -1993T/C variant (rs4794067)
Akahoshi et al. Hum Genet 2005; the C allele creates a novel nuclear protein binding site, directly increasing TBX21 transcriptional activity and associating with aspirin-induced asthma at OR 1.93
demonstrates that single-nucleotide changes in this promoter region have direct, measurable effects on TBX21 expression. For rs17250932, the functional readout is cytokine-level: in a German birth cohort of 200 neonates, Casaca et al. 201233 Casaca et al. 2012
TBX21 and HLX1 polymorphisms influence cytokine secretion at birth; PLoS One; cord blood mononuclear cells genotyped and stimulated with lipid A (LpA)
found that carriers of the C allele showed reduced or trendwise-reduced (p ≤ 0.07) secretion of IL-5, IL-13, and TNF-α after innate immune stimulation with lipid A. IL-5 and IL-13 are canonical Th2 effector cytokines — IL-5 drives eosinophil maturation and survival, while IL-13 promotes mucus production, airway hyperresponsiveness, and IgE class-switching. Reduced secretion of these cytokines at birth suggests that C allele carriers are born with a less Th2-primed innate immune response, providing a window of relative protection during the critical early-life immune programming period.

The T allele (common, reference) is associated with higher neonatal Th2 cytokine output — a pro-atopic starting point that, when compounded by environmental triggers (antibiotic exposure, low microbial diversity, allergen sensitization), can translate into clinical atopic disease by the third year of life.

The Evidence

The primary atopic evidence comes from the Casaca et al. 2012 birth cohort44 Casaca et al. 2012 birth cohort
200 German neonates; cord blood mononuclear cells genotyped for TBX21 and HLX1 polymorphisms; cytokine secretion measured after LpA stimulation; children followed to age 3 for atopic disease outcomes
. Among the 184 genotyped neonates, 113 were TT homozygotes, 61 were TC heterozygotes, and 10 were CC homozygotes. Carriers of the C allele (TC + CC combined) showed reduced IL-5 and IL-13 secretion after innate stimulation — the direction consistent with attenuated neonatal Th2 polarization. The statistical threshold was p ≤ 0.07, reflecting the limited statistical power of a 200-person cohort where only 71 carry at least one C allele. This must be counted as [emerging evidence | a p-value of 0.07 does not meet the conventional 0.05 threshold; the signal is directionally consistent with the TBX21 biology but requires replication in larger cohorts].

The broader TBX21 promoter context extends the picture. In a Japanese autoimmune study, the T allele of the related -1514T/C position55 T allele of the related -1514T/C position
Morita et al. Autoimmunity 2012; 66 intractable GD patients, 47 GD remission, 79 controls
was enriched in patients with intractable Graves' disease compared to those who achieved remission — suggesting that the T allele promotes stronger TBX21-driven Th1 immune activity, which in the autoimmune context amplifies disease persistence. A Chinese SLE study66 Chinese SLE study
You et al. Scand J Rheumatol 2010; 248 SLE cases, 261 controls
found the -1514T allele significantly more frequent in SLE patients, and that the CC haplotype (-1993C/-1514C) was protective against SLE with OR 0.316 (95% CI 0.167–0.599, p = 0.0004). A subsequent meta-analysis in Asian populations77 meta-analysis in Asian populations
Wang et al. Allergol Immunopathol 2021; 12 studies, 3,834 cases, 4,824 controls
did not confirm rs17250932 as a significant autoimmune risk variant in pooled analysis, tempering the earlier single-study SLE signal. Taken together, the evidence is consistent with the C allele reducing TBX21 promoter output — less T-bet means less Th1 (and Th2) cytokine drive, an effect that is measurably protective for neonatal Th2-mediated atopy, and potentially modulates the Th1-driven autoimmune phenotype as well.

Practical Actions

TT homozygotes — the common form carried by roughly 70% of people globally — show the highest neonatal IL-5/IL-13 output at innate stimulation and represent the atopy-susceptible reference. For TT carriers who develop or whose infants develop atopic symptoms, the TBX21 genetic context supports interventions targeting the Th2 cytokine axis: specifically IL-5-driven eosinophilia and IL-13-driven airway and skin inflammation.

TC and CC carriers demonstrate attenuated neonatal Th2 cytokine production. The CC genotype is rare (~2.6% globally), and no prospective atopic outcome data for CC specifically are available from this small cohort (only 10 CC participants in Casaca 2012). The evidence does not support treating CC individuals as categorically protected from atopic disease.

Interactions

rs17250932 lies in the TBX21 5' regulatory region, approximately 500 bp upstream of the better-characterized rs4794067 (-1993T/C) promoter variant. These two promoter variants are in partial linkage disequilibrium — the protective CC haplotype spanning both positions (rs4794067-C/rs17250932-C) was the SLE-protective configuration in You et al. 2010. Within the TBX21 locus, the other variants rs11079788 (intron 3, Treg-related) and rs11650354 + rs16947078 (allergic asthma risk haplotype) are in separate LD blocks and likely operate through distinct regulatory mechanisms. HLX1 variants (including rs2738751) interact with TBX21 polymorphisms to amplify their effects on both neonatal cytokine output and childhood asthma risk.

rs1784931

SORL1 SORL1 intron 39 variant

Moderate Risk Factor

SORL1 — The APP Traffic Controller That Keeps Alzheimer's at Bay

Every neuron in your brain constantly recycles its membrane proteins through a network of intracellular sorting stations called endosomes. One of the most important proteins cycling through this network is APP — amyloid precursor protein — the molecule whose misprocessing produces the sticky amyloid-beta plaques that define Alzheimer's disease. The sorting receptor that decides which fate APP meets is encoded by SORL1 (sortilin-related receptor 111 sortilin-related receptor 1
Also known as SORLA or LR11; a 2,214-amino-acid transmembrane receptor expressed at highest levels in neurons
). When SORL1 is present and functional, it binds APP in early endosomes and hands it off to the retromer complex for retrograde transport back to the trans-Golgi network — diverting it away from the late-endosomal compartments where β-secretase and γ-secretase cleave APP into amyloid-beta fragments. When SORL1 levels fall, APP lingers in late endosomes, amyloid-beta production rises, and the neuronal housekeeping system begins to fail.

rs1784931 is an intronic variant in the 3′ region of SORL1 (intron 39, c.5323-507C>A, chromosome 11 position 121,612,229 GRCh38). It does not alter the protein but tags a haplotype block that influences how efficiently the gene is expressed in brain tissue. The A allele marks a risk haplotype associated with reduced SORL1 mRNA and protein in the brain, while the C allele tags a more protective configuration associated with higher receptor levels.

The Mechanism

SORL1 intersects with APP at multiple trafficking nodes. In the early endosome, SORL1 physically binds APP through both its extracellular LDLR class-A domain and its intracellular FANSHY motif22 FANSHY motif
the cytoplasmic tail sequence that engages the retromer VPS35 subunit
. This dual grip allows SORL1 to transfer APP to the retromer complex, which routes it retrogradely to the trans-Golgi network. The net effect is that APP bypasses the late-endosomal compartment where BACE1 (β-secretase) and γ-secretase are concentrated. When SORL1 is knocked out in human iPSC-derived neurons, APP accumulates in enlarged early endosomes33 APP accumulates in enlarged early endosomes
Knupp et al. 2022 showed significantly increased early-endosomal APP localization, enlarged Rab5+ and Rab11+ compartments, and impaired surface recycling of multiple cargo proteins
, APP trafficking is disrupted, and amyloid-beta production increases. SORL1 also routes Aβ peptides already formed toward lysosomes for degradation via its VPS10 domain — a second line of defense that is also lost when receptor levels fall.

The rs1784931 A allele tags a 3′ haplotype block (in linkage disequilibrium with rs1699102, rs3824968, rs2282649, and rs1010159) that was shown to alter codon usage in SORL1 transcripts, reducing translational efficiency44 reducing translational efficiency
Caglayan et al. 2012 showed the risk haplotype changes frequent-to-rare codon usage in minor transcripts, impairing ribosomal elongation rate
. Carriers of the risk haplotype at the 3′ cluster show reduced SORL1 protein in frontal cortex autopsy tissue from Alzheimer's disease brains. Separately, A-allele homozygotes at rs1784931 showed significantly reduced prefrontal SORL1 mRNA compared to CC homozygotes in post-mortem brain data, with the effect driven by Caucasian samples.

The Evidence

The genetic association between SORL1 common variants and late-onset Alzheimer's disease was first established by Rogaeva et al. in a landmark Nature Genetics paper55 Rogaeva et al. in a landmark Nature Genetics paper
Study of 19 SNPs in four ethnicities: Caucasian (Mayo Clinic, 1,400 cases/controls), Caribbean-Hispanic, Israeli-Arab, and African-American; two haplotype clusters independently replicated across populations
. They identified two distinct LD blocks: a 5′ cluster (SNPs 8-10) and a 3′ cluster (SNPs 22-25). The risk haplotype for the 3′ cluster showed odds ratios for AD of approximately 1.3–1.5 across multiple populations, with consistent directionality of association. rs1784931 falls within the same 3′ linkage disequilibrium block as this cluster.

The association was replicated in a 2007 multiethnic community cohort66 2007 multiethnic community cohort
Lee et al. 2007 — 296 AD cases, 428 controls, African-American, Caribbean-Hispanic, and non-Hispanic white participants from northern Manhattan
and subsequently confirmed by a large meta-analysis of 12,464 AD cases and 17,929 controls77 meta-analysis of 12,464 AD cases and 17,929 controls
Reitz et al. 2011 — both the 5′ CGC haplotype (SNPs 8-10) and 3′ SNPs (19, 23-25) remained significant after Bonferroni correction across white and Asian populations
. The same 3′ haplotype variants were linked to multiple AD endophenotypes: white matter hyperintensities, hippocampal atrophy on MRI, and cerebrospinal fluid amyloid-beta levels.

For rs1784931 specifically, the C;C genotype is associated with approximately 0.83× the population risk of Alzheimer's disease (a ~17% relative risk reduction), while A;A homozygotes show modestly elevated risk. The effect is modest in isolation — SORL1 common variants are secondary risk factors compared to APOE ε4, but the mechanism is well-validated and the biology directly connects reduced SORL1 expression to amyloid-beta accumulation.

Practical Actions

rs1784931 is a common intronic variant, not a rare pathogenic mutation. Carriers of the A allele do not have a SORL1 deficiency syndrome — they simply sit on a haplotype associated with somewhat less efficient SORL1 expression in brain tissue. The modest effect on AD risk (OR ~1.2-1.3 per A allele copy) means this variant is informative for long-range risk awareness rather than urgent clinical action.

SORL1 expression is upregulated by brain-derived neurotrophic factor (BDNF) and by retinoic acid88 retinoic acid
BDNF and retinoic acid signaling promote SORL1 transcription in neurons; A-allele carriers show blunted BDNF-induced SORL1 upregulation in some studies
. Regular aerobic exercise is the best-validated intervention for raising brain BDNF levels, and the SORL1-BDNF connection makes this particularly relevant for carriers of the A allele. Monitoring cognitive function over time — through validated tools such as the MoCA or periodic neuropsychological assessment — is prudent for A;A homozygotes with additional risk factors (APOE ε4, family history, cardiovascular risk).

Interactions

The most clinically important interaction is with APOE ε4 (rs429358 / rs7412). SORL1 mediates neuronal cholesterol uptake by acting as a receptor for apolipoprotein E, with preferential binding to the APOE ε4 isoform. Individuals who carry both an A allele at rs1784931 and APOE ε4 face compounded amyloid-beta accumulation risk: reduced SORL1 levels impair APP routing while APOE ε4 independently amplifies amyloidogenic processing and impairs Aβ clearance. The combination is additive or possibly supra-additive.

The retromer pathway also connects SORL1 to VPS35 variants (rs6733839, VPS35 D620N): retromer dysfunction from either SORL1 deficiency or VPS35 loss produces similar endosomal trafficking failures. Carriers of A alleles at rs1784931 with concurrent retromer pathway variants may warrant earlier cognitive monitoring.

rs1800909

GGH c.16T>C (p.Cys6Arg)

Moderate Risk Factor

GGH c.16T>C — A Signal Peptide Variant That Shifts Folate Balance

Every folate molecule entering a cell faces the same fate: to be useful, it must be trapped. Cells accomplish this by adding chains of glutamate — called polyglutamate tails11 polyglutamate tails
Polyglutamylation: the enzymatic addition of 2–8 glutamate residues to folate, anchoring it inside the cell. Polyglutamylated folate is the metabolically active form used by folate-dependent enzymes; monoglutamylated folate can cross cell membranes and leave the cell
— that anchor folate inside the cell. GGH (gamma-glutamyl hydrolase) is the enzyme that removes these tails, converting retained polyglutamates back to exportable monoglutamates. Adjusting GGH activity is therefore one of the primary levers controlling intracellular folate retention.

The rs1800909 variant (c.16T>C in coding notation) falls in a part of the GGH gene that most genetics tests would classify as "probably benign" — the signal peptide that directs newly synthesized GGH protein to the endoplasmic reticulum. Yet this variant shows up consistently in studies of folate metabolism and methotrexate pharmacokinetics, suggesting the signal peptide change has subtle but real functional consequences.

The Mechanism

GGH sits on chromosome 8 at position 63,038,752 on the plus strand (GRCh38), within the GGH gene which itself runs on the minus strand. The c.16T>C change (T at coding position 16, C in the variant) alters codon 6 of the precursor protein — or equivalently codon -19 relative to the start of the mature protein22 mature protein
Mature protein: GGH contains a signal peptide of about 25 amino acids that is cleaved during processing; the "mature" protein is what remains after signal peptide removal in the ER
. Chave et al. (2003)33 Chave et al. (2003)
Chave KJ et al. Identification of single nucleotide polymorphisms in the human gamma-glutamyl hydrolase gene and characterization of promoter polymorphisms. Mutat Res, 2003
were the first to characterize this polymorphism, noting that it changes the signal peptide from cysteine to arginine (Cys6Arg). Since the signal peptide directs GGH to the ER for processing and eventual lysosomal localization, an amino acid change here could alter protein folding during transit, signal peptide cleavage efficiency, or the quantity of functional GGH that reaches its final destination.

Unlike the promoter variant rs11545076 (-124T>G), which straightforwardly increases GGH transcription, the mechanism of rs1800909 is more subtle. The bioinformatics prediction for this variant is "benign" per in-silico tools noted in the Singapore Chinese cohort study44 noted in the Singapore Chinese cohort study
Oppeneer SJ et al. Genetic variation in FPGS and GGH and plasma homocysteine levels in the Singapore Chinese Health Study. Mol Genet Metab, 2012
, yet three independent human studies find meaningful associations with folate pathway outcomes — suggesting a real but modest functional effect that current computational prediction tools underestimate.

The Evidence

The most direct evidence for a folate-related effect comes from a cohort of 482 Singapore Chinese adults55 cohort of 482 Singapore Chinese adults
Oppeneer SJ et al. Genetic variation in FPGS and GGH and plasma homocysteine levels in the Singapore Chinese Health Study. Mol Genet Metab, 2012
. Among nine GGH SNPs tested, rs1800909 was one of three associated with plasma homocysteine (p=0.005 multivariate-adjusted). The direction is counterintuitive: carriers of the C allele (G on the plus strand) had lower homocysteine than TT homozygotes — TT averaged 10.0 nmol/L versus 9.4 nmol/L in CT heterozygotes and 9.7 nmol/L in CC homozygotes. The R² was 0.022 (explaining ~2% of variance), and the authors noted the ~0.6 nmol/L difference "would not be considered clinically meaningful" on its own. Only the promoter variant rs11545076 survived multiple-comparison correction in this study.

The pharmacogenomics picture is more consistent. A study of 70 Japanese rheumatoid arthritis patients66 study of 70 Japanese rheumatoid arthritis patients
Hakamata J et al. Factors Predicting the Therapeutic Response to Methotrexate in Japanese Patients with Rheumatoid Arthritis. Biol Pharm Bull, 2018
found by multivariate logistic regression that patients carrying the C allele of GGH 16T>C had a better therapeutic response to methotrexate than TT homozygotes. A complementary study of 149 Chinese pediatric ALL patients77 study of 149 Chinese pediatric ALL patients
Li M et al. Frequency distribution of five SNPs in human GGH gene and their effects on clinical outcomes of Chinese pediatric patients with acute lymphoblastic leukemia. Pharmazie, 2020
found that a haplotype including rs1800909 was associated with improved event-free survival (89.2% vs 71.9%) and lower relapse rates versus wild-type haplotype carriers. A European RA study of 352 patients88 European RA study of 352 patients
van der Straaten RJHM et al. Exploratory analysis of four polymorphisms in human GGH and FPGS genes and their effect in methotrexate-treated rheumatoid arthritis patients. Pharmacogenomics, 2007
found no significant association with MTX efficacy or toxicity overall, though there was a trend in haplotype analysis. The GGH 16T>C variant was also identified among polymorphisms associated with chemotherapy toxicity in osteosarcoma99 chemotherapy toxicity in osteosarcoma
Hattinger CM et al. Candidate germline polymorphisms of genes belonging to the pathways of four drugs used in osteosarcoma standard chemotherapy. Oncotarget, 2016
.

Practical Actions

The C allele (G on plus strand) of rs1800909 is associated with modestly lower plasma homocysteine and appears to improve methotrexate response in some populations. For everyday folate metabolism, the effect is subtle — standard dietary folate intake supports adequate homocysteine levels for most carriers. The more actionable consequence is pharmacogenomic: if you carry the G allele (C in coding notation) and take methotrexate for rheumatoid arthritis, psoriasis, or cancer, you may have altered MTX polyglutamate kinetics that your treating clinician should be aware of. Because GGH activity affects how long methotrexate is retained in cells as active polyglutamates, this variant may influence both MTX efficacy and side-effect burden.

The variant's context matters: rs1800909 never occurs in isolation in the GGH gene. Individuals who also carry the promoter variant rs11545076 (C allele) — which more strongly elevates GGH expression and depletes intracellular folate — face a compounded effect. Supporting folate status through methylfolate (5-MTHF) rather than synthetic folic acid makes sense for anyone carrying multiple GGH variants, since it bypasses the conversion step and provides the active form directly.

Interactions

The most clinically significant interaction is with rs11545076 (GGH -124T>G), the promoter variant that dramatically increases GGH transcription. Rs11545076 has a larger and better-replicated effect on folate retention and DNA uracil accumulation (up to 73% higher in CC homozygotes). Individuals carrying the risk allele at rs1800909 alongside the risk allele at rs11545076 face overlapping GGH pathway disruption through two independent mechanisms.

For methotrexate pharmacology, rs1800909 interacts with rs11545078 (GGH 452C>T, Thr127Ile), the coding variant with the strongest characterized functional effect on GGH catalytic activity (2.7-fold higher Km for long-chain substrates). Together, variants in the signal peptide (rs1800909) and catalytic domain (rs11545078) may produce compound effects on MTX polyglutamate retention in target tissues.

Rs1800909 is in partial linkage disequilibrium with rs3758149 (GGH -401C>T promoter variant) and other GGH polymorphisms — haplotype-level analysis often shows stronger associations than any individual SNP, as seen in the Li et al. 2020 leukemia study.

HNF4A Thr130Ile — A Hepatocyte Switch With Metabolic Consequences

HNF4A11 HNF4A
Hepatocyte Nuclear Factor 4-Alpha: a master transcription factor that controls hundreds of target genes in the liver, pancreatic beta cells, kidney, and intestine. It sits at the apex of the MODY transcription factor cascade, directly activating HNF1A expression
. The Thr130Ile variant (T130I) substitutes threonine for isoleucine at position 130 in the DNA-binding domain — a domain so conserved it is essentially identical across vertebrates. The result is a partial, tissue-selective loss of transcriptional activity that affects the liver more than the pancreas, tilting the metabolic balance toward insulin resistance and dyslipidemia in T allele carriers.

The Mechanism

The p.Thr130Ile substitution falls within the DNA-binding domain of HNF4A. Zhu et al. 200322 Zhu et al. 2003
Zhu Q et al. T130I mutation in HNF-4alpha gene is a loss-of-function mutation in hepatocytes and is associated with late-onset Type 2 diabetes mellitus in Japanese subjects. Diabetologia, 2003
demonstrated that T130I retains normal transcriptional activity in HeLa cells and MIN6 beta-cell-like cells, but shows 27–78% of wild-type activity in HepG2 hepatoma cells and primary cultured mouse hepatocytes. The functional impairment is therefore hepatocyte-specific, not pancreatic — a key distinction from the truly pathogenic MODY1 nonsense variants (e.g. rs137853334, p.Gln277Ter) that ablate activity in all tissues.

In hepatocytes, HNF4A normally drives expression of genes controlling [gluconeogenesis | the production of glucose from non-carbohydrate sources in the fasting state], lipid metabolism (ApoAII, ApoB, ApoCI/II, CETP), and bile acid synthesis. Partial loss of HNF4A activity in the liver disrupts these programs, producing the clinical signature seen in T130I carriers: elevated fasting glucose, insulin resistance, hypertriglyceridemia, and reduced HDL-cholesterol.

The Evidence

The largest analysis of T130I and type 2 diabetes comes from a meta-analysis by Jafar-Mohammadi et al. 201133 meta-analysis by Jafar-Mohammadi et al. 2011
Jafar-Mohammadi B et al. A role for coding functional variants in HNF4A in type 2 diabetes susceptibility. Diabetologia, 2011
combining data from 14,279 cases and 26,835 controls: OR 1.20 (95% CI 1.10–1.30), p=2.1×10⁻⁵. This is a modest but well-replicated effect. The original Japanese discovery cohort (Zhu 2003) found a larger OR of 4.3 in a smaller sample — the larger meta-analytic OR of 1.20 is the more reliable estimate.

Population-specific effects are notable. Granados-Silvestre et al. 201744 Granados-Silvestre et al. 2017
Granados-Silvestre MA et al. Susceptibility background for type 2 diabetes in eleven Mexican Indigenous populations: HNF4A gene analysis. Mol Genet Genomics, 2017
found the T allele at 2.7–16% in Mexican indigenous groups, with a consistent association with hypertriglyceridemia. The variant appears to have been positively selected in some Amerindian populations, possibly conferring metabolic advantage under historical nutritional stress while now contributing to T2D risk in a calorie-dense environment.

A pediatric metabolic syndrome association was documented by García-Rodríguez et al. 202055 García-Rodríguez et al. 2020
García-Rodríguez MH et al. Association of the T130I Variant of the HNF4A Gene with Metabolic Syndrome and Its Components in Mexican Children. Metab Syndr Relat Disord, 2020
: OR 2.31 (95% CI 1.10–4.83, p=0.026) for metabolic syndrome in children, with abdominal obesity as the dominant component (OR 1.20). This is the earliest age at which T130I phenotypic consequences have been documented.

The Spanish family study by Cieza-Borrella et al. 201466 Cieza-Borrella et al. 2014
Cieza-Borrella C et al. Early-onset type 2 diabetes mellitus is associated to HNF4A T130I polymorphism in families of central Spain. J Investig Med, 2014
found that T130I carriers in diabetes families had significantly higher HbA1c and fasting glucose than non-carriers, with diabetes onset often triggered by stressful situations and tightly linked to gestational diabetes in female carriers.

Practical Actions

The key implication for T allele carriers is liver-centric metabolic monitoring: fasting lipid panel (particularly triglycerides and HDL), fasting glucose, and HbA1c. The hepatocyte-specific nature of the dysfunction means insulin secretion capacity is less impaired than in full MODY1 patients, but liver-driven insulin resistance and dyslipidemia accumulate over time. Reducing hepatic triglyceride burden — through dietary carbohydrate restriction and omega-3 fatty acids — directly addresses the HNF4A-driven lipid dysregulation pathway.

Homozygous TT carriers are extremely rare (~0.1%) and warrant additional vigilance: both alleles carry the partial loss-of-function, potentially doubling hepatic HNF4A impairment and compounding metabolic risk.

Interactions

HNF4A directly activates HNF1A transcription. Carriers who also have functional variants in HNF1A (rs1169288, Ile27Leu) or HNF1B may have compounded transcription factor cascade disruption. The T130I variant in HNF4A affects the DNA-binding domain, while the truly pathogenic MODY1 nonsense variants (rs137853334) eliminate transactivation entirely — these are mechanistically distinct and should not be conflated in clinical communication.

CYP2E1 — Your Liver's Chemical Processor

CYP2E1 is a cytochrome P450 enzyme11 cytochrome P450 enzyme
Phase I drug-metabolizing enzyme that oxidizes substrates to prepare them for excretion
responsible for metabolizing ethanol, acetaminophen, volatile organic compounds, and several anesthetic agents including sevoflurane and isoflurane. A key variant in its upstream promoter region — rs2070672 (historically the RsaI/PstI polymorphism) — alters how strongly the gene is expressed, shifting the balance between normal detoxification and toxic metabolite generation.

The Mechanism

The rs2070672 SNP sits approximately 2 kilobases upstream of the CYP2E1 transcription start site in the 5' regulatory region. The reference A allele (c1 haplotype) and the alternate G allele (c2 haplotype) affect transcription factor binding at the promoter. The c2 (G) allele is associated with higher basal CYP2E1 mRNA expression22 higher basal CYP2E1 mRNA expression
~1.7-fold higher expression in c1/c2 heterozygotes versus c1/c1 homozygotes in non-drinkers; with alcohol, the induction is ~2.0-fold greater
and greater enzyme inducibility when substrates like ethanol or isoniazid are present. This means c2 carriers metabolize substrates faster, generating more reactive intermediates per unit time.

CYP2E1's two most important substrates for toxicity are acetaminophen and ethanol:

  • Acetaminophen: CYP2E1 converts a small fraction (~5–10%) of each dose into NAPQI33 NAPQI
    N-acetyl-p-benzoquinone imine — the reactive electrophile responsible for centrilobular hepatocellular necrosis in acetaminophen overdose
    . Higher enzyme activity means more NAPQI per dose.
  • Ethanol: CYP2E1 is the microsomal ethanol-oxidizing system (MEOS), converting ethanol to acetaldehyde and generating reactive oxygen species (ROS) in the process. Chronic alcohol consumption itself upregulates CYP2E1 expression, amplifying the oxidative stress cycle.

The Evidence

Acetaminophen metabolism: A study by Ueshima et al. (1996)44 Ueshima et al. (1996)
Ueshima Y et al. Acetaminophen metabolism in patients with different cytochrome P-4502E1 genotypes. Alcohol Clin Exp Res, 1996
found that c2/c2 individuals cleared acetaminophen at more than twice the rate of c1/c1 homozygotes, while c1/c2 heterozygotes showed intermediate metabolism and marked enzyme induction after alcohol exposure.

Anti-tuberculosis drug hepatotoxicity: Isoniazid is metabolized partly by CYP2E1 into reactive intermediates. A meta-analysis by Wang et al. (2016)55 Wang et al. (2016)
Wang FJ et al. Update meta-analysis of the CYP2E1 RsaI/PstI and DraI polymorphisms and risk of antituberculosis drug-induced hepatotoxicity: evidence from 26 studies. J Clin Pharm Ther, 2016
(7,423 participants across 26 studies) found that the c1/c1 genotype (A/A at rs2070672) was associated with significantly higher risk of anti-TB drug-induced hepatotoxicity (OR 1.32, 95% CI 1.03–1.69), particularly in East Asians on combination therapy. This counterintuitive finding — c1/c1 showing higher hepatotoxicity risk — likely reflects the complex interplay between CYP2E1 inducibility and the specific toxicokinetics of isoniazid. A smaller clinical study by Vuilleumier et al. (2006)66 Vuilleumier et al. (2006)
Vuilleumier N et al. CYP2E1 genotype and isoniazid-induced hepatotoxicity in patients treated for latent tuberculosis. Eur J Clin Pharmacol, 2006
found that the *1A/*1A genotype (c1/c1) was significantly associated with isoniazid-induced liver enzyme elevation (OR 3.4, 95% CI 1.1–12).

Alcoholic liver disease: A study in Caucasians by Grove et al. (1998)77 Grove et al. (1998)
Grove J et al. The RsaI polymorphism of CYP2E1 and susceptibility to alcoholic liver disease in Caucasians. Pharmacogenetics, 1998
found that c2 allele carriers presented with alcoholic liver disease roughly 7 years earlier (mean 42 vs. 49 years), often with equally advanced pathology. The association was amplified in patients carrying the ADH3*2/*2 genotype, suggesting gene-gene synergy in alcohol-related liver injury.

Hepatocellular carcinoma: A meta-analysis by Liu et al. (2014)88 Liu et al. (2014)
Liu W et al. CYP2E1 gene polymorphism and alcohol drinking on the risk of HCC: a meta-analysis. Mol Biol Rep, 2014
found no standalone CYP2E1 genotype-HCC association, but when combined with alcohol exposure, the c2 variant allele was associated with OR 2.88 (95% CI 1.25–6.60).

Practical Actions

The key clinical contexts for this variant are:

  1. Acetaminophen dosing: Both genotypes face risk at high doses, but c2 carriers generate reactive NAPQI more rapidly. Strict adherence to dosing limits is especially important. Combining acetaminophen with alcohol (even moderate amounts) dramatically increases NAPQI production regardless of genotype.

  2. Isoniazid/anti-TB therapy: Paradoxically, c1/c1 individuals show higher hepatotoxicity risk from isoniazid — liver function monitoring during TB treatment is warranted regardless of genotype, but c1/c1 individuals may need more frequent checks.

  3. Anesthesia: CYP2E1 is the principal enzyme defluorinating sevoflurane and isoflurane. Altered expression may influence post-operative fluoride accumulation and anesthetic metabolism, particularly with repeated exposures.

  4. Occupational chemical exposure: CYP2E1 activates benzene, vinyl chloride, and other volatile organic compounds into genotoxic intermediates. G allele carriers with higher basal expression may face increased occupational chemical toxicity with sustained VOC exposure.

Interactions

rs2070672 is in strong linkage disequilibrium with rs2031920, the other key CYP2E1 5' flanking variant (RsaI site). These two SNPs form the c1/c2 haplotype block and are often studied together — their combined haplotype status is a more reliable predictor of CYP2E1 expression than either SNP alone. The G allele at rs2070672 alone should be interpreted in the context of the full CYP2E1 5' haplotype. The alcohol-interaction effects at this locus also interact with ADH1B/ADH1C (alcohol dehydrogenase) variants — carriers of both low-activity CYP2E1 and high-activity ADH genotypes may have distinct alcohol-exposure profiles.

rs225014

DIO2 Thr92Ala

Strong Risk Factor

DIO2 Thr92Ala — Why Some People Need T3 Instead of T4 Alone

Your thyroid gland secretes mostly T4 (thyroxine), an inactive prohormone that must be converted to T3 (triiodothyronine) to exert biological effects. This conversion happens locally in tissues11 This conversion happens locally in tissues
The brain derives up to 80% of its intracellular T3 from circulating T4 through local conversion
via the type 2 deiodinase (DIO2) enzyme. The Thr92Ala variant changes a threonine to alanine at position 92 of the DIO2 protein, altering enzyme efficiency22 altering enzyme efficiency
Castagna et al. J Clin Endocrinol Metab 2017
and altering its cellular behavior. This common variant affects roughly 36% of people of European ancestry33 roughly 36% of people of European ancestry
Present in 11-16% as CC homozygotes
and has become a focal point in debates about optimal thyroid hormone replacement therapy.

The Mechanism

The wild-type Thr92 version of DIO2 normally resides in the endoplasmic reticulum, where it efficiently converts T4 to T3. The Ala92 variant protein has altered cellular behavior44 The Ala92 variant protein has altered cellular behavior
Studies suggest reduced enzyme efficiency and altered protein dynamics
, which disrupts normal T4-to-T3 conversion. Comarella et al. found C-allele carriers had lower post-treatment weight variation in Graves' disease55 Comarella et al. found C-allele carriers had lower post-treatment weight variation in Graves' disease
Suggesting impaired metabolic adaptation linked to reduced DIO2 activity
, and the effect is most significant in tissues that rely heavily on local T3 production like the brain and pituitary gland. Because DIO2 activity in the hypothalamus and pituitary regulates TSH secretion through negative feedback, the variant can create a mismatch: normal serum TSH and T4 levels may mask inadequate tissue-level T3, especially in the central nervous system.

The Evidence

The landmark study establishing clinical relevance of this variant is a pharmacogenetic analysis of 552 hypothyroid patients from the WATTS randomized trial66 552 hypothyroid patients from the WATTS randomized trial
Panicker et al. J Clin Endocrinol Metab 2009
. The CC genotype was present in 16% of participants and was associated with worse baseline psychological well-being scores77 worse baseline psychological well-being scores
14.1 vs 12.8 on GHQ-12, P=0.03
compared to TT carriers. More importantly, CC carriers showed greater improvement on T4+T3 combination therapy88 greater improvement on T4+T3 combination therapy
2.3 GHQ points at 3 months, P=0.03
compared to T4 alone, despite no differences in serum thyroid hormone levels between genotypes.

A Danish randomized controlled trial of 45 hypothyroid patients99 Danish randomized controlled trial of 45 hypothyroid patients
Carle et al. Eur Thyroid J 2017
found that preference for T4+T3 combination therapy increased in a dose-dependent manner with genetic burden: 42% preferred combination therapy with no polymorphisms, 63% with one polymorphism (DIO2 or MCT10), and 100% with both1010 42% preferred combination therapy with no polymorphisms, 63% with one polymorphism (DIO2 or MCT10), and 100% with both
p=0.009 for trend
. This suggests the DIO2 variant has a measurable, though incomplete, effect on treatment satisfaction.

In thyroidectomized patients on levothyroxine replacement1111 thyroidectomized patients on levothyroxine replacement
Castagna et al. J Clin Endocrinol Metab 2017
, carriers of the Thr92Ala variant showed significantly lower mean serum free T3 levels1212 significantly lower mean serum free T3 levels
FT3 was lower in carriers of the mutated allele(s) vs wild-type, despite similar TSH
compared to wild-type patients, providing biochemical confirmation that the variant impairs systemic T4-to-T3 conversion. However, not all studies show associations: [a Dutch population study of over 1,000 individuals | Wouters et al. Thyroid 2017] found no association with thyroid hormone levels or quality of life in the general population, suggesting the variant's effects may be most apparent in patients who lack endogenous thyroid function.

Beyond thyroid therapy, the variant has been linked to higher body mass index and altered metabolic regulation in some populations, though findings are inconsistent. Associations have also been reported with osteoarthritis, bipolar disorder, and schizophrenia1313 osteoarthritis, bipolar disorder, and schizophrenia
DIO2 is expressed in growth plate cartilage and multiple brain regions
.

Practical Implications

If you're on levothyroxine (T4) monotherapy and still experience fatigue, weight gain, brain fog, or mood disturbances1414 fatigue, weight gain, brain fog, or mood disturbances
Common persistent symptoms in euthyroid patients
despite normal TSH levels, the Thr92Ala variant could be contributing. The evidence supports considering T4+T3 combination therapy for C-allele carriers who remain symptomatic. Current guidelines suggest an L-T4/L-T3 dose ratio between 13:1 and 20:1 by weight1515 an L-T4/L-T3 dose ratio between 13:1 and 20:1 by weight
European Thyroid Association 2012 guidelines
, with T3 typically split into two daily doses due to its shorter half-life.

Testing for this variant can be useful before thyroidectomy to anticipate which patients may struggle with T4 monotherapy. However, genetic testing is not widely available through standard medical channels; historically, 23andMe included rs225014 on their v3 and v4 chips, but it was removed from the v5 chip1616 it was removed from the v5 chip
No longer genotyped as of 2017
. Specialized laboratories like Regenerus Labs offer targeted DIO2 genotyping.

For those with hypothyroidism who are not on thyroid medication, ensuring adequate selenium and iodine intake1717 selenium and iodine intake
DIO2 is a selenoprotein requiring selenium for function
supports whatever DIO2 enzyme activity remains. However, dietary interventions alone are unlikely to fully compensate for reduced enzyme efficiency in homozygous C-allele carriers.

Interactions

The DIO2 variant interacts with rs17606253 in the MCT10 gene, which encodes a thyroid hormone transporter. The Danish RCT showed that patients with both polymorphisms had 100% preference for T4+T3 combination therapy1818 The Danish RCT showed that patients with both polymorphisms had 100% preference for T4+T3 combination therapy
Carle et al. 2017
, suggesting the combination creates a more severe impairment in cellular thyroid hormone availability than either variant alone. This makes biological sense: MCT10 transports thyroid hormones into cells, and DIO2 converts T4 to T3 once inside; defects in both steps compound the problem.

Another variant within the DIO2 gene, rs12885300 (ORFa-Gly3Asp), has been studied alongside Thr92Ala in several trials. While less consistently associated with clinical outcomes, it may modulate DIO2 expression levels and has been linked to body weight changes after Graves' disease treatment1919 linked to body weight changes after Graves' disease treatment
Combined analysis shows additive effects
.

Compound implication for DIO2 Thr92Ala + MCT10 rs17606253: Individuals carrying both the DIO2 C allele (CT or CC) and the MCT10 variant may experience more pronounced difficulties with T4 monotherapy and show the strongest preference for T4+T3 combination treatment. If you match this profile and have persistent hypothyroid symptoms despite normal TSH on levothyroxine, discuss combination therapy with your endocrinologist, citing the Carle et al. 2017 study.