Vitamin D Binding Protein — The Carrier That Shapes Your Vitamin D Status
Vitamin D travels through your bloodstream bound to a carrier protein called
VDBP11 VDBP
Vitamin D binding protein, also known as group-specific component (GC), carries approximately 85-90% of circulating 25(OH)D and 85% of 1,25(OH)2D in the blood
(vitamin D binding protein). Roughly 85-90% of the vitamin D in your blood is
bound to VDBP, making it the single largest determinant of how vitamin D is
transported to tissues, how long it remains in circulation, and how much is
available for cellular uptake. The rs7041 variant in the GC gene changes a
single amino acid in this carrier protein, creating isoforms with different
binding properties that measurably affect your vitamin D levels.
The Mechanism
The rs7041 variant produces a missense change22 missense change
A missense variant changes one amino acid to another in the protein sequence
at position 432 of the VDBP protein: aspartic acid (Asp) in the reference form
and glutamic acid (Glu) in the alternate form. Together with a second variant
in the same gene (rs458833 rs4588
The companion GC variant at codon 436, where Thr defines Gc1 and Lys defines Gc2),
rs7041 defines the three major VDBP isoforms that differ in binding affinity
and glycosylation:
- Gc1f (rs7041-A + rs4588-C): highest binding affinity for 25(OH)D
- Gc1s (rs7041-C + rs4588-C): intermediate binding affinity
- Gc2 (rs7041-A + rs4588-A): lowest binding affinity
The Gc1f isoform (A allele at rs7041) binds vitamin D metabolites most tightly.
This means more total vitamin D is protein-bound and less circulates as
free 25(OH)D44 free 25(OH)D
The unbound fraction of vitamin D that can enter cells directly without receptor-mediated uptake; represents about 0.03% of total circulating 25(OH)D.
Paradoxically, individuals with the highest-affinity carrier (AA genotype) tend
to have the lowest levels of free, bioavailable vitamin D despite potentially
adequate total levels.
The isoforms also differ in glycosylation patterns that affect conversion to
Gc-MAF55 Gc-MAF
GC protein-derived macrophage activating factor, an immune modulator produced by enzymatic modification of VDBP that activates macrophages.
Gc1 isoforms are more efficiently converted to Gc-MAF than Gc2, with
implications for innate immune function.
The Evidence
A landmark GWAS of 4,501 Europeans66 landmark GWAS of 4,501 Europeans
Ahn J et al. Genome-wide association study of circulating vitamin D levels. Hum Mol Genet, 2010
identified rs7041 as a genome-wide significant determinant of circulating
25(OH)D concentrations (P = 4.1 x 10-22). A subsequent
GWAS focused on VDBP levels77 GWAS focused on VDBP levels
Moy KA et al. Genome-wide association study of circulating vitamin D-binding protein. Am J Clin Nutr, 2014
found even stronger association: mean serum DBP concentrations were 7,335,
5,149, and 3,152 nmol/L for individuals carrying 0, 1, and 2 copies of the
minor allele respectively (P = 1.42 x 10-246).
A study in women across reproductive states88 study in women across reproductive states
Ganz AB et al. Vitamin D binding protein rs7041 genotype alters vitamin D metabolism in pregnant women. FASEB J, 2018
found that AA (TT on coding strand) carriers had 25(OH)D levels at 80% of CC
(GG) carriers (P = 0.05), but paradoxically had 2.5 times higher free 25(OH)D
(P < 0.0001). This reflects lower VDBP concentrations with the A allele,
resulting in less total binding but more unbound vitamin D available for
cellular uptake.
A supplementation study in 234 vitamin D-deficient adults99 supplementation study in 234 vitamin D-deficient adults
Al-Daghri NM et al. Efficacy of vitamin D supplementation according to vitamin D-binding protein polymorphisms. Nutrition, 2019
found that homozygous A allele carriers were 6.2 times more likely to remain
deficient after supplementation, and heterozygotes 4.2 times more likely,
compared to CC homozygotes (P < 0.001). This makes rs7041 one of the
strongest genetic predictors of vitamin D supplementation response.
In a cohort of 414 smokers1010 cohort of 414 smokers
Janssens W et al. Vitamin D deficiency is highly prevalent in COPD and correlates with variants in the vitamin D-binding gene. Thorax, 2010,
AA homozygotes had a 25% reduction in 25(OH)D levels and an increased risk
for COPD (OR 2.11, 95% CI 1.20-3.71, P = 0.009).
Practical Implications
The key insight from rs7041 is the distinction between total and free vitamin D. Standard blood tests measure total 25(OH)D, which is heavily influenced by VDBP levels. If you carry the A allele, your total vitamin D may appear low on standard testing even when your free (bioavailable) vitamin D is adequate. This matters because clinical decisions about supplementation are usually based on total 25(OH)D.
AA carriers should consider testing both total and free 25(OH)D if available, may need higher doses to reach standard target levels on total 25(OH)D, and should take vitamin D3 with fat-containing meals for optimal absorption. Consistent daily dosing (e.g. 2,000-4,000 IU daily) may work better than large intermittent doses for genotypes with altered VDBP kinetics.
Interactions
rs7041 interacts directly with rs4588 in the same gene to determine the three VDBP isoforms (Gc1f, Gc1s, Gc2). The combination of both variants provides more information than either alone. rs7041-A with rs4588-A creates the Gc2 isoform (lowest binding affinity), while rs7041-A with rs4588-C creates Gc1f (highest affinity). This means the same rs7041 genotype can have different functional consequences depending on rs4588 status.
rs7041 also interacts with VDR (rs1544410) and CYP2R1 (rs10741657). If VDBP transport is impaired (rs7041 AA) alongside reduced vitamin D activation (CYP2R1 AA) or reduced receptor sensitivity (VDR TT), the combined effect on vitamin D status is compounded. These multi-gene interactions are addressed in compound implications when all relevant genotypes are present.
GSTP1 Ala114Val -- The Second Hit in Glutathione Detoxification
Glutathione S-transferase Pi 1 (GSTP1) is one of the most abundant
Phase II detoxification enzymes11 Phase II detoxification enzymes
Phase II enzymes conjugate activated toxins with molecules like glutathione, making them water-soluble for excretion via urine or bile
in the human body, expressed at particularly high levels in the lungs, skin,
oesophagus, and placenta. The enzyme catalyzes the conjugation of
reduced glutathione (GSH)22 reduced glutathione (GSH)
A tripeptide (glutamate-cysteine-glycine) that serves as the body's master antioxidant and detoxification cofactor
to a wide range of electrophilic compounds -- from environmental pollutants
like polycyclic aromatic hydrocarbons and heavy metals to chemotherapy
drugs like cisplatin and carboplatin.
The rs1138272 variant causes an alanine-to-valine substitution at position
114 (Ala114Val, also designated c.341C>T) in exon 6 of the GSTP1 gene on
chromosome 11q13.2. This is the second of two well-characterized functional
polymorphisms in GSTP1, the first being
Ile105Val (rs1695)33 Ile105Val (rs1695)
The more common GSTP1 variant, which has a stronger individual effect on enzyme activity and substrate specificity.
Together, these two SNPs define the classical GSTP1 haplotype system:
*A (Ile105/Ala114, wild-type), *B (Val105/Ala114), *C (Val105/Val114,
lowest activity), and *D (Ile105/Val114).
The Mechanism
The Ala114Val substitution sits near the
H-site44 H-site
The hydrophobic substrate-binding pocket of GST enzymes, which determines what electrophilic compounds the enzyme can process
of the GSTP1 enzyme. A comprehensive
functional genomics study55 functional genomics study
Moyer AM et al. Glutathione S-transferase P1: gene sequence variation and functional genomic studies. Cancer Res, 2008
expressed all known GSTP1 variant allozymes in COS-1 cells and measured
their catalytic activity. The Val114 variant retained approximately 80% of
wild-type enzyme activity (79.9 +/- 5.1%, p<0.05). By comparison, the
Val105 variant dropped to just 21.8% of wild-type activity. The double
variant (Val105/Val114, the *C haplotype) showed 74.1% activity --
suggesting that in the context of an already impaired Val105 enzyme,
the Val114 change partially compensates through altered protein folding.
The protein-level explanation involves both reduced
immunoreactive protein66 immunoreactive protein
The amount of GSTP1 protein detectable by antibodies, which reflects both synthesis rate and protein stability
and altered substrate kinetics. The wild-type enzyme (Ile105/Ala114) has
a Km of 0.33 mM for the standard substrate
CDNB77 CDNB
1-chloro-2,4-dinitrobenzene, the standard laboratory substrate used to measure GST enzyme activity,
indicating high affinity. Variants at position 105 raise the Km to 1.15 mM,
reflecting reduced substrate binding. Position 114 modulates thermal
stability and the geometry of the substrate-binding pocket without
dramatically altering Km on its own, but it contributes meaningfully
when both variants are present.
The Evidence
Cancer susceptibility. A
meta-analysis of 43 case-control studies88 meta-analysis of 43 case-control studies
Kuang M et al. Comprehensive analysis of the association between the rs1138272 polymorphism of the GSTP1 gene and cancer susceptibility. Front Physiol, 2019
totalling 15,688 cancer cases and 17,143 controls found that the TT
genotype increases overall cancer risk (OR 1.45, P = 0.002) under a
recessive model. The effect was strongest in Asian populations (TT vs CC:
OR 6.51) and African populations (T allele: OR 3.66), where the variant
is rare and carriers may face higher relative risk. Among Caucasians, the
association was significant for specific cancer sites: head and neck cancer
(TT: OR 3.11) and lung cancer (dominant model: OR 1.22).
A
South African study of oesophageal cancer99 South African study of oesophageal cancer
Li D et al. The 341C/T polymorphism in the GSTP1 gene is associated with increased risk of oesophageal cancer. BMC Genetics, 2010
found the CT genotype carried an OR of 4.98 and the TT genotype an OR of
10.9 compared to wild-type, with risk amplified dramatically by tobacco
smoking (OR 7.51) and alcohol consumption (OR 15.3) -- environmental
exposures that generate the very electrophilic compounds GSTP1 detoxifies.
Haplotype effects. A
Serbian prostate cancer study1010 Serbian prostate cancer study
Savic-Radojevic A et al. GSTP1 rs1138272 polymorphism affects prostate cancer risk. Medicina, 2020
found that carriers of the GSTP1*C haplotype (Val105 + Val114, combining
both rs1695 and rs1138272 variants) had a 5.46-fold higher risk of
prostate cancer compared to those with the *A haplotype. The cumulative
effect of multiple GST risk alleles (including GSTM1 and GSTT1 deletions)
reached a 12-fold risk increase in individuals carrying all four risk
variants.
Enzyme biochemistry. A
study of all four GSTP1 allozymes1111 study of all four GSTP1 allozymes
Pal A et al. Variants of glutathione S-transferase Pi 1 exhibit differential enzymatic activity and inhibition by heavy metals. PLoS One, 2012
confirmed that allozymes with Ile105 had superior catalytic efficiency and
greater substrate affinity. Heavy metal sensitivity varied by genotype --
the Val105/Ala114 variant was most sensitive to mercury, while
Ile105/Val114 was least sensitive, suggesting that the Ala114Val change
may paradoxically improve tolerance to certain environmental metals.
Practical Implications
The Ala114Val variant alone reduces GSTP1 activity modestly (~20% reduction). The practical significance scales with environmental exposure: individuals with reduced GSTP1 activity who are also exposed to tobacco smoke, heavy metals, pesticides, or occupational chemicals face a disproportionately higher risk because their conjugation capacity is already diminished. Supporting glutathione status through N-acetylcysteine (the most effective oral glutathione precursor), cruciferous vegetables rich in sulforaphane (which upregulates Phase II enzymes including GSTP1), and reducing unnecessary toxicant exposure are the primary actionable strategies.
For individuals undergoing platinum-based chemotherapy (cisplatin, carboplatin), GSTP1 genotype may influence both drug efficacy and toxicity, since GSTP1 directly conjugates platinum compounds. Reduced GSTP1 activity may increase platinum sensitivity but also increase toxicity risk -- a double-edged sword that oncologists should be aware of.
Interactions
The most important interaction is with rs1695 (GSTP1 Ile105Val). The *C haplotype (Val105 + Val114) represents the lowest-activity form of the enzyme, with substantially greater cancer risk than either variant alone. A compound implication covering the combined GSTP1*C haplotype (rs1695 AG or GG + rs1138272 CT or TT) would be clinically meaningful, as the combined recommendation (aggressive glutathione support, minimizing environmental exposures, oncology awareness) goes beyond what either variant alone warrants.
Beyond GSTP1 itself, other glutathione transferase genes (GSTM1, GSTT1) that can be fully deleted (null genotypes) compound the effect. Individuals with GSTP1 variants plus GSTM1-null and/or GSTT1-null genotypes have cumulative reductions in Phase II detoxification capacity. However, GSTM1 and GSTT1 are copy number variants not typically assessed by 23andMe SNP arrays, so this interaction is noted for awareness rather than actionable in this context.
HNF1A Ile27Leu — The Transcription Factor Tweak Linking Beta-Cell Function, LDL, and CRP
Your liver and pancreas run largely on a transcription factor called
hepatocyte nuclear factor 1-alpha (HNF1A)11 hepatocyte nuclear factor 1-alpha (HNF1A)
HNF1A binds DNA as a
homodimer and directly activates dozens of genes involved in glucose
metabolism, lipoprotein production, and the acute-phase response;
pathogenic HNF1A mutations cause MODY3, the most common
monogenic diabetes. The rs1169288
variant introduces an isoleucine-to-leucine substitution at codon 27
(p.Ile27Leu, c.79A>C) — a subtle amino acid swap in HNF1A's
dimerization domain that carries measurable metabolic consequences
across the full spectrum from common polygenic diabetes risk to
modifying the onset age of rare monogenic MODY3.
The Mechanism
Position 27 of HNF1A sits within its N-terminal dimerization domain22 dimerization domain
HNF1A must form a dimer to bind DNA efficiently; the dimerization
domain mediates protein-protein contact that stabilizes the
transcription complex at target gene promoters.
The Leucine-27 substitution (C allele) measurably reduces HNF1A
transcriptional activity on target promoters including GLUT2 (glucose
transporter 2) and albumin in experimental cell models — a finding
replicated across multiple expression systems. Because HNF1A directly
activates the CRP promoter, the Leu allele is associated with
moderately lower basal C-reactive protein levels. Simultaneously,
HNF1A regulates hepatic lipoprotein metabolism, and partial loss of
function shifts the lipid phenotype toward higher LDL cholesterol and
apolipoprotein B — even though CRP falls. This paradoxical combination
(lower inflammation marker, higher LDL) has been replicated in
multiple large population studies and is a hallmark of the rs1169288C
haplotype.
The Evidence
The largest single-study characterization of this variant's metabolic
footprint comes from Reiner et al. 200933 Reiner et al. 2009
community-based
European-American adults from CARDIA (n=2,890 younger adults) and
CHS (n=3,680 older adults); two fully independent replication
datasets. The C allele
was associated with 0.10–0.15 SD units lower CRP and GGT, but
concurrently higher LDL cholesterol, apolipoprotein B, creatinine,
and fibrinogen — a profile consistent with partial HNF1A
hypofunction in hepatic CRP suppression versus lipoprotein synthesis.
For type 2 diabetes, the weight-dependency finding by Yamada et al.
201444 Yamada et al.
2014
861 Japanese subjects (300 T2D, 561 controls); I27L was an
independent determinant of beta-cell function in normal-weight
individuals, measured by HOMA-β
clarifies the earlier inconsistent literature: I27L increases T2D
risk specifically in lean individuals, where genetic beta-cell
dysfunction is not masked by obesity-driven insulin resistance. In
overweight and obese individuals, the HNF1A signal is overwhelmed by
weight-related metabolic noise.
The most clinically striking finding involves MODY3: Bacon et al.
201855 Bacon et al.
2018
meta-analysis of 781 HNF1A-MODY patients from UK and Norwegian
national cohorts; stratified by mutation type (protein-truncating vs
missense) showed that
each Leu-27 allele reduced age at diabetes diagnosis by 1.6 years
(95% CI −2.6 to −0.7) in the subset with protein-truncating HNF1A
mutations. The biological interpretation is clear: the I27L
polymorphism already partially impairs the same transcriptional
pathway that pathogenic HNF1A mutations devastate; stacking a
mild common variant onto a severe rare variant advances disease
onset by years.
An unexpected pharmacogenomics application was reported by Cecchin
et al. 201766 Cecchin
et al. 2017
two independent cohorts of metastatic colorectal
cancer patients treated with FOLFIRI; total n=160 discovery +
82 replication: the
rs1169288C allele (Leu-27) was associated with 45% higher irinotecan
drug exposure and significantly improved progression-free survival.
The mechanism involves HNF1A-driven regulation of ABCC2, an
irinotecan efflux transporter — the Leu allele reduces ABCC2
expression, allowing more drug to remain in tumor tissue.
Practical Implications
For most C-allele carriers, the key actionable signal is the LDL elevation. Monitoring fasting LDL alongside standard diabetes screening captures both the lipid and glycemic dimensions of this variant's risk profile. In lean individuals, the diabetes risk is particularly relevant because the variant exerts its effect through beta-cell transcriptional insufficiency, which can be partly compensated by reducing carbohydrate-driven insulin demand.
The lower CRP associated with Leu-27 should not be interpreted as cardiovascular protection — it reflects reduced hepatic acute-phase expression, not reduced vascular inflammation. The LDL elevation tells the more complete cardiometabolic story.
Interactions
The rs1169288 C allele is in strong linkage disequilibrium (r² > 0.8,
D' > 0.8) with rs2464196 (HNF1A Ser487Asn)77 rs2464196 (HNF1A Ser487Asn)
A second common
coding variant at the 3' end of HNF1A at codon 487; S487N may
have partially independent effects on HNF1A C-terminal transactivation
domain function and
with the intronic rs2244608. These variants form a common haplotype
with a combined frequency of ~30% in Europeans. Because they travel
together, published associations with CRP, LDL, and diabetes risk
likely reflect the cumulative effect of the whole haplotype rather than
Ile27Leu alone.
For individuals carrying pathogenic HNF1A mutations (MODY3 families), the I27L status modifies diagnosis age — a compound interaction between this common variant and rare HNF1A mutations that is now clinically documented. Genetic counselors and MODY clinics should consider rs1169288 genotyping as a modifier when advising MODY3 families about expected disease trajectory.
SNCA rs11931074 — The 3′ UTR Variant That Amplifies Alpha-Synuclein and Parkinson's Risk
The SNCA gene11 SNCA gene
Alpha-synuclein (SNCA) encodes the protein that forms the pathological hallmark of Parkinson's disease — Lewy bodies and Lewy neurites — in dopaminergic neurons of the substantia nigra harbours multiple independent risk variants spread across its genomic structure. rs11931074 sits in the 3′ downstream region of SNCA — within or adjacent to an extended 3′ untranslated region (3′ UTR) that recent research has shown stretches much further than previously appreciated. This position gives the variant direct access to the regulatory machinery that controls SNCA mRNA stability, translation efficiency, and ultimately how much alpha-synuclein protein a cell produces.
rs11931074 has been studied across more published meta-analyses than virtually any other common SNCA variant and emerges in the most recent 2025 systematic review as the single most robust SNCA risk marker — showing consistent, low-heterogeneity associations with Parkinson's disease across all tested genetic models and in both Asian and European populations. Its independence from the 3′-block variant rs356219 (which operates through a different regulatory mechanism in a partially overlapping region) makes it an additive source of risk information at the SNCA locus.
The Mechanism
rs11931074 is located at GRCh38 chromosome 4 position 89,718,364, approximately 5–6 kb downstream of the canonical SNCA gene boundary on the plus strand — placing it within an extended 3′ UTR22 extended 3′ UTR
A 2018 study identified alpha-synuclein transcripts in postmortem human brain samples with a 3′ UTR approximately 1,246 nucleotides longer than the canonical form; rs11931074 falls within this extended region that encompasses risk-associated variants previously considered merely downstream of the gene.
The T risk allele at this position is predicted to alter mRNA stability and translation efficiency33 mRNA stability and translation efficiency
The 3′ UTR is a key regulatory hub: RNA-binding proteins including ELAVL1 and TIAR bind the SNCA 3′ UTR and modulate both mRNA stabilization and translational activation; variants that disrupt these binding sites shift the equilibrium between degradation and active translation in a direction that increases steady-state SNCA expression. Consistent with this, the TT genotype has been directly associated with significantly higher alpha-synuclein protein levels in human brain tissue — providing functional confirmation of the predicted mechanism.
The downstream consequence mirrors the rs356219 story: chronically elevated alpha-synuclein increases the probability of misfolding, oligomerization, and seeding of the insoluble fibrillar aggregates that destroy dopaminergic neurons in the substantia nigra and trigger the progressive motor and non-motor symptoms of Parkinson's disease.
The Evidence
The largest meta-analysis44 The largest meta-analysis
Liu et al. An updated analysis with 45,078 subjects confirms the association between SNCA rs11931074 and Parkinson's disease. Neurological Sciences, 2018 pooled 33 studies involving 15,368 PD patients and 29,710 controls. Every tested genetic model reached significance: allelic OR 1.36 (95% CI 1.31–1.42); heterozygous OR 1.44 (95% CI 1.35–1.55); homozygous TT vs. GG OR 1.87 (95% CI 1.68–2.09); recessive OR 1.58 (95% CI 1.46–1.72). Crucially, significant associations appeared in both Asian and Caucasian subgroups, confirming that the risk signal is not ethnicity-specific.
A 2020 meta-analysis55 A 2020 meta-analysis
Du et al. Association between alpha-synuclein (SNCA) rs11931074 variability and susceptibility to Parkinson's disease: an updated meta-analysis of 41,811 patients. Neurological Sciences, 2020 (13,403 cases, 28,408 controls) confirmed these results with allelic OR 1.28 (95% CI 1.12–1.45; p=0.0001) and recessive OR 1.40 (95% CI 1.18–1.68; p=0.0002). The study specifically noted low heterogeneity across studies and no evidence of publication bias — two hallmarks of a genuine, reproducible association rather than an artefact of selective reporting.
The 2025 systematic review and meta-analysis66 The 2025 systematic review and meta-analysis
Mohammadi et al. Common SNCA Genetic Variants and Parkinson's Disease Risk. International Journal of Molecular Sciences, 2025 including 27 studies explicitly identified rs11931074 as showing "consistent associations with PD across all models" and confirmed it as the most robust common SNCA PD risk variant — more consistent than rs356219, which showed greater heterogeneity across geographic regions.
Beyond disease susceptibility, rs11931074 shows associations with PD clinical features. A 2015 Chinese cohort study77 A 2015 Chinese cohort study
Chen et al. Hyposmia correlates with SNCA variant and non-motor symptoms in Chinese patients with Parkinson's disease. Parkinsonism & Related Disorders, 2015 (218 PD patients) found the TT genotype conferred OR 3.24 (95% CI 1.23–8.51) for hyposmia — reduced sense of smell — a well-recognized early non-motor marker of PD. The same study confirmed TT genotype was associated with significantly higher alpha-synuclein levels in brain tissue (p=0.0082), providing the mechanistic link between genotype and phenotype.
A 2019 resting-state fMRI study88 2019 resting-state fMRI study
SNCA rs11931074 polymorphism correlates with spontaneous brain activity and motor symptoms in Chinese patients with Parkinson's disease. Journal of Neural Transmission, 2019 found that TT carriers showed altered spontaneous brain activity in the right angular gyrus compared to GT/GG carriers, with ALFF values negatively correlated with UPDRS III motor scores — suggesting the variant modulates the neural circuitry underlying motor control in affected patients.
Practical Actions
The functional consequence of rs11931074 T-allele carriage — higher alpha-synuclein expression — is the same biological target as for rs356219, meaning the protective strategies that address elevated SNCA expression apply here too: supporting mitochondrial function against alpha-synuclein-driven complex I dysfunction, promoting autophagy to clear misfolded protein, and reducing environmental exposures that independently upregulate SNCA.
The clinically important addition from rs11931074 research is the hyposmia connection. Loss of smell is one of the earliest detectable markers of Parkinson's pathology, appearing years before motor symptoms. TT carriers who notice a declining sense of smell should discuss this with their doctor as a potential early indicator warranting neurological evaluation — the combination of genotype and symptom substantially elevates clinical concern.
Because T allele frequency differs dramatically between populations (about 7% in Europeans versus 53% in East Asians), the genotype interpretation varies substantially by ancestry: TT homozygosity is rare in Europeans (about 1%) but relatively common in East Asians (about 28%), meaning the recessive risk signal has very different population-level impact.
Interactions
rs11931074 and rs356219 are both located in the 3′ region of the SNCA locus but are in different linkage disequilibrium blocks and likely act through partially distinct regulatory mechanisms — rs356219 primarily by altering SNCA transcription, rs11931074 by affecting mRNA stability/translation in the extended 3′ UTR. Independent carriage of risk alleles at both loci may compound alpha-synuclein overproduction through additive regulatory effects, though no single study has formally quantified the joint genotype effect.
rs356182, an intronic SNCA variant affecting neuronal differentiation, represents a third independent risk signal at the SNCA locus. Carriers of risk alleles at rs11931074, rs356219, and rs356182 simultaneously may face substantially elevated cumulative PD susceptibility through three distinct mechanisms — disease risk, elevated expression, and impaired neuronal differentiation — though this triple-carrier scenario has not been formally studied.
ZMYM4 rs12094543 — A Chromatin Regulator at the Crossroads of Fat Distribution and Immunity
The ZMYM4 gene encodes a zinc finger MYM-type containing protein11 zinc finger MYM-type containing protein
a family of
chromatin-associated transcription factors that use zinc-coordinated protein domains
to regulate gene expression by modifying how tightly DNA is wrapped around histone
proteins located on chromosome 1p32. It is expressed broadly across tissues and
has emerging roles in fat distribution, immune cell regulation, and developmental
gene programs. The rs12094543 variant is an intronic tag SNP within ZMYM4 — it
does not alter the protein directly but may influence how and when ZMYM4 is
expressed by tagging a regulatory haplotype across the gene body.
The Mechanism
rs12094543 sits at chr1:35,351,102 (GRCh38), approximately 24 kb upstream of rs559986 — the GWAS-significant variant in the same gene associated with waist-hip ratio — and 51 kb upstream of rs113408476, the indel associated with BMI-adjusted waist circumference and A Body Shape Index. All three variants are intronic to ZMYM4, suggesting they tag a common regulatory haplotype.
ZMYM4 encodes a B-MYB binding protein22 ZMYM4 encodes a B-MYB binding protein
B-MYB is a transcription factor that
controls cell cycle progression, proliferation, and differentiation in many tissues
including adipose and hematopoietic cells.
ZMYM4 is highly SUMOylated33 SUMOylated
SUMOylation is a post-translational modification that
changes where a protein goes in the cell and how it interacts with DNA; it often
marks chromatin regulatory proteins,
and its interaction with B-MYB strengthens after DNA damage, suggesting a role in
transcriptional stress responses that could affect adipocyte differentiation or
immune cell maturation under metabolic challenge.
The Evidence
Fat distribution: Two large body-composition GWAS have implicated the ZMYM4
locus. Kichaev et al. (2019)44 Kichaev et al. (2019)
FINDOR GWAS of 27 UK Biobank traits, N≈416,000
identified rs559986 in ZMYM4 at genome-wide significance for waist-hip ratio, a
marker of central adiposity independent of overall weight. Separately,
Christakoudi et al. (2021)55 Christakoudi et al. (2021)
GWAS of allometric body-shape indices, N=406,697 UK
Biobank participants associated ZMYM4
(rs113408476) with BMI-adjusted waist circumference (p~2×10⁻⁸) and A Body Shape
Index (ABSI), a validated marker of visceral fat burden. The ABSI association is
notable because it captures waist-to-height risk beyond BMI — a better predictor of
cardiometabolic disease than either measure alone.
Rare variant obesity burden: Marenne et al. (2020)66 Marenne et al. (2020)
exome sequencing in 2,737
severely obese children vs 6,704 controls, Cell Metabolism
identified ZMYM4 among three genes carrying an excess burden of very rare predicted-
deleterious variants in cases. This is distinct from the common-variant GWAS signals
— it suggests that, beyond the population-frequency tag SNPs, rare loss-of-function
variants in ZMYM4 may individually cause severe early-onset obesity through disruption
of the transcriptional programs that control fat cell development.
Immune and monocyte regulation: Astle et al. (2016)77 Astle et al. (2016)
173,480 participants, 36
blood cell traits, Cell associated the
ZMYM4 region (rs11581846) with monocyte percentage of white cells at p=1×10⁻⁹.
Elevated monocyte percentage is a marker of chronic low-grade inflammation — the
same inflammatory state that underlies visceral fat accumulation and metabolic
syndrome. A large allergy GWAS (N=360,838)88 large allergy GWAS (N=360,838)
Ferreira et al. 2017, Nature
Genetics also identified a ZMYM4-
region variant among 136 risk loci for asthma, hay fever, and eczema, implying
ZMYM4 sits at an intersection of metabolic and immune gene regulation.
The rs12094543 G allele is uncommon in European populations (~2%) but reaches ~41% in East Asian populations — one of the sharpest allele-frequency gradients at this locus — suggesting meaningful evolutionary differentiation that has not yet been mechanistically explained.
Practical Implications
rs12094543 should be read as a genomic signal pointing to ZMYM4's role in fat distribution, not as a direct functional variant. Individuals carrying the G allele (AG or GG) have a genomic profile consistent with increased central fat accumulation tendency, independent of total body weight. Waist circumference and waist-to-hip ratio are the most actionable proxies: when these diverge from weight-based BMI predictions, the ZMYM4 signal may be part of the explanation.
The co-occurrence of fat-distribution and immune associations at this locus is biologically coherent: visceral adipose tissue is metabolically active immune tissue. Elevated monocyte infiltration into visceral fat depots is an early event in insulin resistance, and chromatin regulators like ZMYM4 may coordinate both the adipogenic differentiation program and the innate immune tone of adipose tissue.
Interactions
rs559986 and rs113408476 (ZMYM4 intronic, same gene): These are the GWAS- significant body-shape variants in ZMYM4, both intronic and located within ~50 kb of rs12094543. They likely tag an overlapping regulatory haplotype. If multiple ZMYM4 intronic variants are present together, they may represent a high-risk haplotype for central adiposity rather than independent signals — their combined effect has not been formally tested.
rs11581846 (ZMYM4 region, monocyte percentage and telomere length): This same ZMYM4-region variant has been independently associated with both monocyte immune regulation and telomere length. The convergence of adiposity, immunity, and cellular aging signals at the ZMYM4 locus is consistent with a chromatin regulator that coordinates stress-responsive transcriptional programs across multiple tissue types.
IL6R Intronic Variant — Where Atopic Risk Meets Cardiovascular Protection
The interleukin-6 receptor sits at one of the most versatile crossroads in human immunology.
IL-6 is the cytokine that bridges innate and adaptive immunity, drives the acute-phase
response (including C-reactive protein production), and modulates the balance between
Th1-driven autoimmune inflammation and Th2-driven allergic disease. rs12133641 is an
intronic variant11 intronic variant
Deep intronic variants (>100 bp from any exon) can act as cis-regulatory
elements, affecting transcription factor binding, alternative splicing enhancers, or
gene expression levels without changing the protein sequence itself
located 1,226 bp downstream of an exon boundary in IL6R (c.1160+1226A>G), and it tags
a regulatory haplotype with a striking bidirectional disease association: the G allele
reduces systemic IL-6 signaling — lowering CRP and cardiovascular inflammation risk —
while simultaneously elevating risk for atopic dermatitis and other Th2-mediated
allergic conditions.
The Mechanism
IL6R encodes the membrane-bound alpha subunit of the IL-6 receptor22 IL-6 receptor
IL-6 signals through
a two-component complex: IL6Rα (CD126, encoded by IL6R) binds IL-6 with low affinity,
then recruits gp130 (IL6ST) to form the high-affinity signaling complex that activates
JAK1/STAT3 pathways. A soluble form of IL6Rα
(sIL-6R) is shed from the cell surface by ADAM10 and ADAM17 proteases; sIL-6R enables
IL-6 trans-signaling33 trans-signaling
Trans-signaling allows IL-6 to activate cells that do not express
membrane-bound IL6R, dramatically broadening IL-6's reach to endothelial cells, neurons,
and other non-immune cell types on cells that
lack membrane IL6R. rs12133641 lies in a deep intronic regulatory region and is in partial
linkage disequilibrium with the coding variant rs2228145 (Asp358Ala, c.1073A>C), which
reduces IL6R ectodomain shedding and lowers circulating sIL-6R levels. The intronic
rs12133641 likely contributes independently to regulatory control of IL6R expression or
alternative splicing.
The apparent paradox — reduced IL-6 signaling causing both cardiovascular protection and atopic risk — reflects IL-6's dual role in immune homeostasis. High systemic IL-6 signaling drives Th17 cell differentiation and pro-inflammatory CRP production (bad for heart disease); but IL-6 also suppresses Th2-type eosinophilic inflammation by promoting regulatory immune states. When IL6R variants dampen this suppressive arm, Th2 skewing and IgE-mediated atopic responses may increase.
The Evidence
The atopic dermatitis association was established in a large multi-ancestry GWAS meta-analysis44 large multi-ancestry GWAS meta-analysis
Budu-Aggrey et al., Nature Communications 2023; combining European, East Asian, Latin American,
and African ancestry datasets with over 800,000 total participants
published in Nature Communications (2023, n > 800,000 combined). The G allele was associated
with increased AD risk at genome-wide significance (OR ~1.04, p=3×10⁻⁴⁵), an effect that is
modest per-allele but highly replicated and consistent across ancestries.
The complementary cardiovascular signals confirm the variant's impact on systemic IL-6 tone.
In CAD GWAS data55 CAD GWAS data
Hartiala et al. and CARDIoGRAMplusC4D consortium, combining 547,261 participants,
the A allele (not G) associated with coronary artery disease risk (p=3×10⁻¹¹) — meaning
the G allele is cardiovascular-protective. The most mechanistically direct signal is the
CRP association66 CRP association
From large-scale CRP GWAS; the G allele consistently lowers CRP across
multiple independent studies and populations:
the G allele is one of the strongest CRP-reducing variants in the genome (β -0.116 log-units,
p=4×10⁻⁴⁷). This confirms that rs12133641 genuinely modifies IL-6 trans-signaling output,
not merely tags it.
In skin-specific immune responses, GWAS of mosquito bite reactions77 GWAS of mosquito bite reactions
Mitchell et al.,
PLOS Genetics 2017; 84,724 participants for bite size, 69,057 for itch intensity
found the G allele associated with reduced itch intensity (β -0.021, p=9×10⁻⁹) and
larger bite size (β +0.028, p=2×10⁻⁷) — consistent with a blunted IL-6-mediated
acute inflammatory response to insect antigen, with compensatory Th2-driven wheal
formation dominating instead.
Practical Implications
For GG homozygotes, the elevated atopic dermatitis risk is meaningful but modest on an absolute scale. AD affects ~15-20% of children and ~5-10% of adults at baseline; an OR of ~1.04 per allele translates to ~8% higher risk for GG versus AA, not a dramatic increase. The more actionable implication is recognizing the IL-6 signaling context when managing skin inflammation: GG carriers may respond differently to IL-6 pathway-targeting treatments, and the lower basal CRP may obscure inflammatory activity that would otherwise appear on standard inflammatory markers.
For AG heterozygotes — the most common genotype — the modest atopic risk elevation warrants awareness of AD triggers but not targeted intervention beyond standard skincare approaches specific to genetic barrier vulnerability.
Carriers of the G allele who are prescribed IL-6 receptor blockers (tocilizumab, sarilumab) for rheumatoid arthritis, giant cell arteritis, or cytokine release syndrome should be aware that rs12133641 status may influence baseline IL6R expression levels, potentially modifying therapeutic response magnitude.
Interactions
The coding variant rs2228145 (Asp358Ala, 1,313 bp upstream of rs12133641) is the most studied functional variant in IL6R and reduces IL6R ectodomain shedding. rs2228145 is the primary Mendelian randomization instrument for IL-6 signaling studies and has been used to model the effects of tocilizumab pharmacologically. rs12133641 and rs2228145 lie in the same gene and likely tag overlapping but not identical regulatory signals; the complete IL6R haplotype structure including both variants gives the most accurate representation of an individual's IL-6 receptor biology.
rs4129267 is another intronic IL6R variant at chr1:154,453,788 with similar population frequencies (~39% T allele in Europeans) that has been associated with CRP levels, asthma, and cardiovascular phenotypes — it may be in partial LD with rs12133641 and represent the same functional haplotype from a different tag position.
ABCA1 — The Cholesterol Efflux Transporter and HDL Factory
Every cell in your body faces a fundamental challenge: how to safely remove
excess cholesterol before it can accumulate and trigger damage. The solution
is a large membrane pump called
ABCA111 ABCA1
ATP-binding cassette transporter A1 — a 220 kDa membrane protein
that exports cholesterol and phospholipids from cells to form nascent HDL
particles. This protein is
the master regulator of the first step in reverse cholesterol transport,
and variation in the ABCA1 gene substantially shapes an individual's HDL
cholesterol level and cardiovascular risk trajectory.
rs12686004 is a variant located deep within an intron of ABCA1 (transcript position c.67-1950, approximately 1,950 base pairs upstream of exon 2 in the reference sequence). It sits within a gene that spans 149 kb across chromosome 9q31.1, containing 50 exons and 49 introns — a large genomic territory with multiple regulatory elements controlling when and how much ABCA1 protein each tissue makes.
The Mechanism
Because rs12686004 lies in a non-coding intronic region, it does not change
the ABCA1 protein sequence directly. Its biological impact is most likely
regulatory — either altering enhancer activity within the intron, modifying
pre-mRNA splicing efficiency, or tagging a nearby functional variant through
linkage disequilibrium. This pattern is well-established in ABCA1: two other
independent intronic variants,
rs2575875 (intron 2) and rs3847301 (intron 3)22 rs2575875 (intron 2) and rs3847301 (intron 3)
Howard et al. demonstrated
that both SNPs function as allele-specific enhancers that physically loop back
to contact the ABCA1 promoter via chromatin remodeling. PLoS One, 2019,
have been shown to act as allele-specific enhancers that physically interact
with the ABCA1 promoter through chromatin looping, with genome-wide significant
effects on HDL cholesterol (p = 1×10⁻¹⁰ to 9×10⁻¹³).
ABCA1 expression is regulated at multiple levels. The liver X receptor (LXR) — activated by oxysterols when intracellular cholesterol rises — drives ABCA1 transcription through a response element in the proximal promoter. Intron 1 contains an LXR response element that is critical for dietary fat-responsive upregulation in animal models. Intronic variants that alter enhancer activity or splicing can shift the set-point of this response, changing how much ABCA1 protein is produced when cholesterol loads increase.
The Evidence
The most direct evidence for the importance of ABCA1 intronic regulation on HDL comes from two independent signals within the gene itself. Howard et al. (2019) demonstrated that rs2575875 and rs3847301 each independently associate with HDL at genome-wide significance and function as allele-specific enhancers, with the active allele at rs2575875 creating a STAT3 binding site that drives ABCA1 transcription.
Delgado-Lista et al.33 Delgado-Lista et al.
Delgado-Lista et al. ABCA1 gene variants regulate
postprandial lipid metabolism in healthy men. Arterioscler Thromb Vasc Biol, 2010
showed in 88 healthy men that carriers of the major allele at ABCA1 intronic
variants (rs2575875/rs4149272) had significantly higher fasting and postprandial
apoA1, lower postprandial triglycerides, and a better apoA1/apoB ratio compared
to minor allele carriers — demonstrating that intronic ABCA1 variation
functionally shapes lipid metabolism after fat intake.
rs12686004 itself is cited in studies examining ABCA1 genetic variation in the
context of cholesterol homeostasis and
Alzheimer's disease44 Alzheimer's disease
Koldamova et al. Role of ABCA1 in Alzheimer's disease
and neurodegeneration. Biochim Biophys Acta, 2010.
ABCA1 controls brain cholesterol efflux and apoE lipidation, and the
LXR–ABCA1–APOE regulatory axis is considered a therapeutic target in
neurodegeneration. The A allele's population frequency pattern —
approximately 12% in Europeans but only 3% in African populations and
approximately 21% in East Asians — suggests population-specific allelic
history consistent with ancient demographic events rather than selection for
a deleterious variant.
The evidence level for rs12686004 specifically is emerging: it appears in population-scale databases as an ABCA1 intron variant and has been cited in mechanistic studies of the gene, but a definitive effect size for HDL modulation from this variant alone has not been published in a primary association study. The broader framework of ABCA1 intronic regulation is well-established; this variant is a less-characterized member of that class.
Practical Actions
For A allele carriers, the modifiable lever is dietary fat composition and strategies that directly support cholesterol efflux. Because ABCA1 is transcriptionally activated by LXR when intracellular cholesterol accumulates, ensuring adequate dietary cholesterol efflux support (via omega-3 fatty acids and niacin-rich foods that raise HDL) can partially compensate for any intrinsic reduction in ABCA1 regulatory response. Monitoring fasting HDL and triglycerides provides the most direct readout of ABCA1 efflux capacity in practice.
GG homozygotes — carrying the common reference allele — need not take special steps, but awareness of ABCA1's diet-sensitive regulation is relevant for anyone making long-term cardiovascular risk decisions.
Interactions
ABCA1 function is intimately connected to the downstream HDL lifecycle. ABCA1's lipid-export product — nascent HDL — is immediately esterified by LCAT (rs4420638 region) and ultimately cleared by SR-BI receptors in the liver (SCARB1 variants). Variants in APOA1 (the structural protein of HDL) and CETP (the cholesteryl ester transfer protein) further modulate how HDL particles are remodeled downstream of ABCA1's initial efflux step. The APOE genotype (rs429358, rs7412) also interacts with ABCA1 activity in the brain, where APOE isoform affects how well ABCA1-derived cholesterol is recycled for neuronal membrane maintenance.
IFNL4 — The Antiviral Immunity Switch
When your body encounters a viral infection in the liver or airways, it deploys
type III interferons11 type III interferons
a family of antiviral signalling proteins that activate the JAK-STAT
pathway and interferon-stimulated genes in epithelial and hepatocyte cells
as a first line of defence. The IFNL4 gene on chromosome 19q13.2 encodes
interferon lambda-422 interferon lambda-4
one of four interferon lambda proteins (IFNL1–4) that restrict
viral replication at mucosal and hepatic barriers,
but only in people who carry the T allele at rs12979860. Carriers of the CC
genotype produce no functional IFN-λ4 protein at all — their IFNL4 gene is silenced
— and paradoxically, this silencing is protective against hepatitis C.
rs12979860 was originally attributed to the nearby IL28B (IFNL3) gene in the landmark 2009 GWAS studies. The discovery in 2013 that the variant actually lies in intron 1 of a newly identified gene, IFNL4, resolved the biological mystery of how an intronic change could exert such profound effects on viral clearance.
The Mechanism
The rs12979860 C>T variant is in strong
linkage disequilibrium33 linkage disequilibrium
non-random co-inheritance of nearby variants
with a dinucleotide frameshift variant (ss469415590 TT/ΔG) that either
creates or destroys IFNL4 as a functional gene. The T allele at rs12979860
tags the ΔG allele, which generates a functional IFNL4-encoded protein with
genuine antiviral activity. The C allele tags the TT allele, which is a
loss-of-function that silences IFNL4 entirely.
The apparent paradox — a functional interferon protein
impairing viral clearance — is explained by recent mechanistic work.
IFN-λ4 is largely retained in the endoplasmic reticulum44 IFN-λ4 is largely retained in the endoplasmic reticulum
it fails to be secreted efficiently
rather than being released to activate neighbouring cells. Instead, ER-retained IFN-λ4
induces ER stress, and ER-stressed hepatocytes are substantially weaker activators
of HCV-specific CD8+ T cells, crippling the adaptive immune response needed to
eradicate the virus. Additionally, chronic IFN-λ4 signalling causes pre-activation
of interferon-stimulated genes (ISGs) that desensitises hepatocytes to exogenous
interferon treatment — explaining both natural and treatment-related impairment.
The Evidence
The 2009 GWAS studies identified rs12979860 as the strongest host genetic predictor of
HCV treatment response55 strongest host genetic predictor of
HCV treatment response
measured as sustained virologic response, SVR, meaning undetectable
virus 12–24 weeks after completing therapy,
with an odds ratio of approximately 5.8 for SVR in European patients carrying CC
versus non-CC (TT/CT) genotypes. In a Japanese cohort, SVR rates were
76.9% in CC, 56.4% in CT, and 12.5% in TT patients66 76.9% in CC, 56.4% in CT, and 12.5% in TT patients
receiving peginterferon/ribavirin.
For spontaneous viral clearance (never needing treatment at all), a meta-analysis of 17 studies77 meta-analysis of 17 studies found rs12979860 CC confers OR 2.98 (95% CI 2.53–3.50) for HCV elimination without treatment versus CT or TT genotypes. The effect is stronger in Caucasian and African populations than in Asians, where the favourable C allele is near-universal (East Asian T allele frequency ~0.04).
Beyond HCV, the T allele has been associated with impaired viral defences more broadly. A Spanish study found the T allele was overrepresented in COVID-19 patients88 the T allele was overrepresented in COVID-19 patients relative to the general population (36.2% vs 26.4%; OR 0.63 for the protective C allele, p=6.4×10⁻⁴). The variant's role in hepatitis B clearance is debated — some studies show the CC genotype predicts HBsAg seroclearance in interferon-treated HBeAg-negative patients, while others find no effect on untreated HBV natural history.
With modern direct-acting antiviral (DAA) regimens for HCV, the IFNL4 variant retains clinical relevance. In the pivotal NEUTRINO trial of sofosbuvir-based therapy, SVR12 was 99% in CC versus 87% in non-CC patients99 SVR12 was 99% in CC versus 87% in non-CC patients, and the IFNL4-ΔG genotype is specifically associated with slower early viral decay kinetics even with DAA treatment, influencing whether shorter (8-week) treatment courses can be used safely.
The variant is also an aetiology-independent predictor of liver fibrosis: in a cohort of 4,172 patients with diverse liver diseases1010 4,172 patients with diverse liver diseases — including NAFLD — those with non-CC genotypes (carrying the T/ΔG allele) showed greater hepatic inflammation and fibrosis, confirming IFNL4 signalling promotes liver inflammation beyond viral contexts.
Practical Actions
People with CT or TT genotypes who have ever been exposed to hepatitis C or are at risk should discuss screening and, if infected, the implications for treatment duration with their doctor. While DAA therapy achieves high cure rates even with unfavourable genotypes, the IFNL4 genotype affects how quickly the virus clears and whether shorter treatment protocols are suitable.
For TT carriers who were treated with older peginterferon-based regimens and failed — this failure was largely biologically predetermined, and modern DAA regimens offer a much better chance of cure.
The broader implication for viral immunity (COVID-19, other respiratory viruses) is emerging but suggests that CT and TT carriers have a subtly impaired first-line antiviral response at hepatic and mucosal surfaces. Prioritising vaccination against preventable infections is a rational response.
Interactions
rs12979860 is in strong linkage disequilibrium with rs80999171111 rs8099917
another IFNL
locus variant frequently used for HCV pharmacogenomics testing, r²=0.43–0.65
depending on population and with
rs129802751212 rs12980275
third IFNL3 region variant, r²=0.68–1.0 with rs12979860.
Commercial HCV pharmacogenomics panels often report all three; rs12979860 is
generally considered the most predictive. The variants should not be summed
as independent effects — they tag the same underlying IFNL4 functional state.
NFKBIZ p.Gly102Ala — A Rare Shield Against Autoimmune Thyroid Disease
Your thyroid is one of the most common targets of the immune system turning
against itself. Hashimoto's thyroiditis11 Hashimoto's thyroiditis
The most prevalent autoimmune
thyroid disease globally, in which immune cells progressively destroy thyroid
tissue, causing hypothyroidism in millions
affects up to 10% of women and is the leading cause of hypothyroidism in
iodine-sufficient countries. A 2025 large-scale genetic study revealed that a
rare variant in NFKBIZ — the gene encoding NF-κB inhibitor zeta (IκBζ) —
provides meaningful protection against this autoimmune attack.
The Mechanism
NFKBIZ encodes IκBζ (IkappaBzeta)22 IκBζ (IkappaBzeta)
A nuclear protein of the IκB family
that, unlike classical IκBs which sequester NF-κB in the cytoplasm, acts
inside the nucleus as a selective transcriptional coactivator for specific
NF-κB target genes. IκBζ is
indispensable for the differentiation of Th17 cells33 Th17 cells
A subset of pro-inflammatory
CD4+ helper T cells that produce interleukin-17 (IL-17), a cytokine central
to many autoimmune diseases. It
cooperates with nuclear receptors RORγt and RORα to bind directly to the
regulatory region of the IL17A gene, amplifying IL-17 production.
The rs149007883 variant substitutes glycine (GGG codon) for alanine (GCG) at position 102 of the IκBζ protein. This glycine-to-alanine change occurs in the N-terminal region of the protein before the conserved ankyrin-repeat domain. Adding a methyl side chain where there was none alters the local backbone geometry, likely reducing the efficiency with which IκBζ assembles into transcriptional complexes at the IL17A locus. The net result is a partial dampening of Th17 differentiation and IL-17 output — enough to reduce the thyroid-directed autoimmune pressure without abrogating normal immune defense.
The Evidence
The protective association was identified in a landmark
genome-wide meta-analysis by Rand et al. 202544 genome-wide meta-analysis by Rand et al. 2025
Rand SA et al. Genome-wide
association study and polygenic risk prediction of hypothyroidism. Nature
Genetics, 2025. This study
analysed 113,393 hypothyroidism cases and 1,065,268 controls — the largest
genetic study of hypothyroidism to date. Among the 350 significant loci
identified, the NFKBIZ p.Gly102Ala variant (C allele) emerged as protective
with an odds ratio of approximately 0.83, meaning C allele carriers have
roughly 17% lower odds of hypothyroidism compared to GG individuals. The
study highlighted that many hypothyroidism risk loci regulate immune cell
counts and inflammatory pathways, consistent with the Th17-axis biology
of NFKBIZ.
The mechanistic rationale is well-supported by a foundational
mouse knockout study by Okamoto et al. 201055 mouse knockout study by Okamoto et al. 2010
Okamoto K et al. IkappaBzeta
regulates T(H)17 development by cooperating with ROR nuclear receptors.
Nature, 2010, which demonstrated
that NFKBIZ-null mice cannot generate Th17 cells and are resistant to
experimental autoimmune disease. Separately,
Konca Degertekin et al. 201666 Konca Degertekin et al. 2016
Konca Degertekin C et al. Circulating Th17
cytokine levels are altered in Hashimoto's thyroiditis. Cytokine,
2016 showed that IL-17 and IL-23
levels are significantly elevated in patients with Hashimoto's thyroiditis
(p=0.041 for IL-17 vs controls), directly linking the Th17 axis — which
NFKBIZ governs — to human autoimmune thyroid pathology.
Practical Actions
This variant is rare (C allele frequency ~0.9% in Europeans, essentially absent elsewhere), so the GC genotype is found in roughly 1 in 100 people of European descent. Carriers of the C allele have a moderately reduced lifetime risk of autoimmune hypothyroidism. This does not eliminate thyroid risk, but it is a meaningful biological buffer.
For GC carriers, the protective signal primarily justifies reduced vigilance for early hypothyroidism screening compared to individuals with other autoimmune risk variants. The effect operates through reduced Th17 activity, so conditions that amplify Th17 responses — such as vitamin D deficiency, gut dysbiosis, or chronic iodine excess — could theoretically partially offset this genetic advantage.
Homozygous CC carriers are so rare (<0.003% globally) that they would represent the most extreme dampening of IκBζ-driven Th17 activity, though no clinical data exist for this genotype given the tiny sample sizes involved.
Interactions
NFKBIZ sits at the intersection of NF-κB signaling and Th17 biology. Its protective effect on autoimmune thyroid disease may compound with other immunomodulatory variants. The PTPN22 R620W variant (rs2476601) is a major risk factor for Hashimoto's thyroiditis via T-cell activation thresholds; individuals carrying both PTPN22 risk alleles and NFKBIZ protection (GC/CC) may experience partial offset of PTPN22-mediated risk. CTLA4 rs3087243 is another autoimmune thyroid locus affecting T-cell co-stimulation — the two pathways (co-stimulation threshold via CTLA4 vs Th17 effector output via NFKBIZ) are mechanistically additive rather than redundant.
IL-6 Upstream Variant rs17147230: An Independent Inflammaging Signal
Interleukin-6 (IL-6) is the master cytokine of inflammaging11 inflammaging
the chronic, low-grade
sterile inflammation that accumulates with age and drives most age-related diseases,
coined by Claudio Franceschi — the biological state where the immune system runs
a persistent low-level inflammatory programme that damages tissues over decades.
The rs17147230 variant sits approximately 3,300 base pairs upstream of the IL6 gene
on chromosome 7, in a position that can influence how the gene is expressed. It
represents an independent IL-6 genetic signal beyond the more-studied -174G/C variant
(rs1800795), and its primary clinical evidence links elevated IL-6 output to
hepatocellular carcinoma22 hepatocellular carcinoma
primary liver cancer arising from hepatocytes, one of the
most prevalent cancers globally and strongly associated with chronic liver inflammation
risk and altered inflammatory protein regulation.
The Mechanism
IL6 spans a tightly regulated promoter region with multiple transcription factor binding sites scattered across several kilobases upstream of the transcription start site. The rs17147230 variant at chr7:22,722,557 lies in this upstream regulatory zone. Though its precise molecular mechanism has not been characterised in the same detail as the -174G/C (rs1800795) promoter variant, its position within a region known to modulate IL-6 transcription suggests it can influence the amount of IL-6 produced in response to inflammatory stimuli such as viral infection, tissue damage, or metabolic stress.
IL-6 signals through two pathways:
classic signalling33 classic signalling
IL-6 binds a membrane-bound IL-6 receptor (IL-6R) on cells
that express it, mainly immune and liver cells — largely anti-inflammatory in context
and trans-signalling44 trans-signalling
IL-6 binds a soluble form of IL-6R (sIL-6R) and signals
to virtually all cell types — the mode most associated with chronic inflammation
and disease. In the liver, chronically elevated IL-6 activates
STAT355 STAT3
Signal Transducer and Activator of Transcription 3 — a transcription
factor that, when persistently activated by IL-6, promotes cell survival, proliferation,
and immune evasion in tumour cells, which drives hepatocyte proliferation and
survival signalling that can tip chronically inflamed liver tissue toward malignancy.
The haplotype pairing of the rs17147230 T allele with the rs2069837 G allele produces a particularly potent HCC risk signal (OR 3.125 in the Wang et al. study), suggesting these two upstream variants co-operate to create an IL-6 expression profile that is especially conducive to chronic hepatic inflammation and carcinogenesis.
The Evidence
Hepatocellular carcinoma association: A case-control study by
Wang et al.66 Wang et al.
Association of interleukin-6 polymorphisms with susceptibility to
hepatocellular carcinoma. World J Gastroenterol, 2015
in 226 HCC cases and 220 healthy controls found the TT genotype carried a 2.1-fold
increased HCC risk (OR=2.089, 95% CI 1.135–3.845, P=0.017) and the T allele carried
a 1.3-fold increased risk (OR=1.326, 95% CI 1.010–1.740, P=0.042). The G-T haplotype
combining rs2069837-G with rs17147230-T showed the strongest signal (OR=3.125,
95% CI 1.845–5.294, P<0.001).
Meta-analytic confirmation: A meta-analysis by He et al.77 meta-analysis by He et al.
Association between
interleukin 6 polymorphisms and hepatocellular carcinoma susceptibility. J Clin Lab
Anal, 2021 pooling 13 studies confirmed
that the rs17147230 T allele (OR=1.31, P=0.03) and TT genotype (OR=1.83, P=0.02)
were significantly associated with increased HCC susceptibility. This meta-analytic
OR of 1.31–1.83 places rs17147230 in the moderate-risk category, consistent with
a regulatory variant that modulates cancer risk rather than causing it directly.
Adrenomedullin regulation: A study by Lam et al.88 Lam et al.
A single nucleotide
polymorphism of interleukin-6 gene is related to plasma adrenomedullin levels.
Ann Med, 2013 found rs17147230 was
associated with plasma adrenomedullin99 adrenomedullin
a vasodilatory peptide hormone with
anti-inflammatory and cardioprotective properties (ADM) levels (β=−0.096,
P=0.034) after adjusting for age and sex. Individuals with the TT genotype had
approximately 12.8% lower ADM levels than AA homozygotes — a significant finding
because lower ADM is associated with reduced vascular protection and higher
inflammatory tone. The effect was significant in women (β=−0.115, P=0.021) but
not men, suggesting sex-specific modulation.
Population context: The risk evidence for rs17147230 derives primarily from East Asian (Chinese) study populations, where the T allele has a frequency of approximately 34–41% — making it the minor allele in this ancestry group. Globally, T is the common allele (~81%), while in East Asian populations the allele frequency pattern inverts. This population stratification is clinically important: the HCC risk appears most pronounced in populations where T is the minority allele (East Asian), and may not translate uniformly to European populations.
Practical Implications
The core message of rs17147230 is about chronic liver inflammation management. The T allele, particularly in East Asian populations and in combination with the rs2069837-G haplotype, creates an IL-6 regulatory environment that can amplify hepatic inflammatory signalling over decades. Given that the transition from chronic liver inflammation → fibrosis → cirrhosis → HCC takes years to decades, this variant is most relevant as an early warning to be proactive about liver health behaviours.
The variant also connects to the broader inflammaging framework: IL-6 is one of the most important drivers of the chronic low-grade inflammatory state associated with accelerated biological aging. TT homozygotes — especially in East Asian populations where T is the minor allele — face the combination of elevated HCC risk and potentially accelerated inflammaging trajectories.
Because IL-6 regulation is influenced by diet, exercise, sleep, alcohol, and viral exposures (hepatitis B and C), there are concrete lifestyle levers available to individuals with this genotype.
Interactions
rs17147230 shows strong haplotypic interaction with rs2069837 (also in the IL6 upstream region): the G-T haplotype (rs2069837-G + rs17147230-T) triples HCC risk compared to the reference haplotype. Both variants lie upstream of IL6 and may co-operatively shape the IL-6 expression response to hepatic stress. The primary IL6 promoter variant rs1800795 (-174G/C) is an independent signal in strong linkage disequilibrium with rs1800797 but in weaker LD with rs17147230, providing an additive independent contribution to overall IL-6 regulation. For individuals carrying both TT at rs17147230 and GG at rs1800795, the combined inflammatory drive to the liver warrants heightened attention to hepatic health monitoring.