rs17612852

HLA-DQA1 HLA-DQA1 Peanut Allergy Tag SNP

Strong Risk Factor

HLA-DQA1 rs17612852 — The Peanut Allergy Haplotype Tag That Switches Sides

Buried in the most complex region of the human genome — the HLA class II locus11 HLA class II locus
Human Leukocyte Antigen class II genes on chromosome 6p21 encode the antigen-presenting molecules that determine which proteins the immune system tolerates and which it attacks
— lies a variant with a paradoxical property: the same genetic signal that increases peanut allergy risk when peanut is avoided promotes powerful immune protection when peanut is consumed early. rs17612852 is an intronic variant in HLA-DQA1 at chromosome 6:32,652,795 (GRCh38), in strong linkage disequilibrium (D′=0.99, r²=0.65) with the HLA-DQA1*01:02 haplotype22 HLA-DQA1*01:02 haplotype
A specific variant of the alpha chain of the HLA-DQ antigen-presenting molecule, determined by classical HLA typing
. Because G and A are the two alleles at this position and the G allele nearly always travels with HLA-DQA1*01:02, rs17612852 functions as a convenient tag SNP — a genotyping proxy for a classical HLA haplotype that is expensive and technically demanding to type directly.

The Mechanism

HLA-DQ molecules are heterodimers on the surface of antigen-presenting cells33 antigen-presenting cells
Dendritic cells, B cells, and macrophages that process proteins into peptide fragments and display them to naïve CD4+ T cells, shaping whether tolerance or immune activation follows
. The specific DQ alpha chain encoded by DQA1*01:02 has a binding groove that presents peanut protein peptides — particularly from Ara h 244 Ara h 2
The dominant IgE target in clinical peanut allergy; a storage protein in the peanut seed that drives most anaphylactic reactions
— with high efficiency to naïve T cells.

Crucially, rs17612852 is not just a passive marker of the haplotype. It is the primary cis-eQTL55 cis-eQTL
A variant that controls expression of a nearby gene in cis — on the same chromosome segment
for HLA-DQB1 in CD4+ T cells: each additional copy of the G minor allele drives significantly higher HLA-DQB1 RNA levels (p=8.34×10⁻¹³), increasing the amount of HLA-DQ molecule on the antigen-presenting cell surface. The direction of immune outcome this creates — tolerance or allergy — depends entirely on whether peanut protein arrives via the gut route (oral, tolerogenic) or the skin/airway route (sensitising)66 gut route (oral, tolerogenic) or the skin/airway route (sensitising)
The route of first antigen encounter decisively shapes T cell polarisation; gut mucosal exposure favours Treg and IgG4; cutaneous exposure favours Th2 and IgE
. More HLA-DQ expression on antigen-presenting cells in the gut amplifies the tolerogenic signal; the same upregulation in sensitised individuals amplifies the allergic response.

The Evidence

The Hong et al. 2015 GWAS77 Hong et al. 2015 GWAS
Genome-wide association study identifies peanut allergy-specific loci and evidence of epigenetic mediation in US children. Nature Communications, 2015; 2,197 participants of European ancestry from the Chicago Food Allergy Study
was the first genome-wide study of peanut allergy and identified the HLA-DQ/DR region at 6p21.32 as the dominant susceptibility locus, with the lead SNPs rs9275596 and rs7192 tagging the same HLA-DQA1*01:02 haplotype that rs17612852 also marks. The Asai et al. 2018 Canadian GWAS88 Asai et al. 2018 Canadian GWAS
A Canadian genome-wide association study and meta-analysis confirm HLA as a risk factor for peanut allergy independent of asthma. JACI, 2018; >7,800 subjects across 8 studies
confirmed this locus as the primary genetic signal for peanut allergy and identified rs17612852 specifically among HLA SNPs significantly associated with peanut allergy, particularly in individuals with mild reaction history.

The mechanistic insight came from the LEAP trial. Kanchan et al. 202699 Kanchan et al. 2026
Genetic Determinants of Peanut-Specific IgG4 Levels in the Context of Sustained Oral Peanut Exposure in the LEAP Study. Immunology, 2026; 267 LEAP peanut-consumption group participants
conducted a GWAS of peanut-specific IgG4 in participants who consumed peanut throughout the trial. rs17612852 showed the strongest genome-wide signal (p=5.80×10⁻⁷) for total peanut-specific IgG4, with a component-specific association for Ara h 2 IgG4 (p=7.28×10⁻⁶, beta=0.324). Critically, no association was observed in the peanut-avoidance group — the relationship between the G allele and IgG4 exists only when peanut is regularly consumed.

Dantzer et al. 20221010 Dantzer et al. 2022
HLA-associated outcomes in peanut OIT trials. Frontiers Immunology, 2022; IMPACT (ages 12–48 months) and POISED (ages 7–55 years) OIT trials
demonstrated that HLA-DQA1*01:02 carriers — a group overwhelmingly identified by their G allele at rs17612852 — show substantially superior outcomes in peanut oral immunotherapy: desensitization in 93% vs 78% (IMPACT) and sustained unresponsiveness in 52% vs 31% (POISED). The protective immune benefit weakens significantly with age — younger children show the strongest effect, consistent with the critical window for immune tolerance induction.

Practical Actions

The most important clinical application of this variant is early-life peanut introduction. For G allele carriers whose infants are at elevated risk for peanut allergy (eczema, egg allergy), early peanut introduction (LEAP protocol: before 11 months) completely overrides the genetic susceptibility and channels the DQA1*01:02 haplotype toward high IgG4 protection rather than IgE sensitisation. The G allele that raises allergy risk under avoidance conditions actively promotes the strongest protective response under consumption conditions.

For individuals who have already developed peanut allergy, G allele status predicts superior response to oral immunotherapy across multiple independent trials — making this genotype a useful predictor of OIT candidacy and expected outcomes.

Interactions

rs17612852 operates within the broader HLA class II haplotype architecture centred on chromosome 6p21. Its closest functional interaction is with rs9275596, the intergenic tag SNP between HLA-DQB1 and HLA-DQA2 that also tags HLA-DQA1*01:02 and was the top GWAS hit in the Hong 2015 study. Both SNPs are in LD with the same DQA1*01:02 haplotype and have overlapping but distinct LD relationships; carrying risk alleles at both positions provides the strongest haplotype resolution.

The interaction with rs2187668 (tagging HLA-DQ2.5, associated with celiac disease and type 1 diabetes) is architecturally important: DQA1*01:02 and DQ2.5 are generally on different haplotypes, so compound heterozygotes carrying the G allele at rs17612852 and the A allele at rs2187668 carry risk across distinct HLA-mediated immune-condition pathways simultaneously. This combination is documented in the compound actions in this database.

rs1800547

MAPT H1/H2 Haplotype Splice Regulator

Strong Risk Factor

MAPT rs1800547 — The Molecular Switch in Tau's H1/H2 Divide

Within the microtubule-associated protein tau (MAPT) gene on chromosome 17q21, a single nucleotide difference at rs1800547 marks one of the most consequential forks in human neurological risk: the ancient H1/H2 haplotype boundary. While the broader H1/H2 distinction spans a 900-kilobase chromosomal inversion, rs1800547 is the canonical SNP that directly differentiates the H1 and H2 clades11 canonical SNP that directly differentiates the H1 and H2 clades
The GenePD Study found rs1800547 was the single most statistically significant variant in the region for Parkinson's disease after multiple testing correction
. Unlike rs17649553, which is another H1/H2 tag in the region, rs1800547 has been shown to have direct molecular function — not merely a passive marker of haplotype membership.

The MAPT gene produces tau, a protein whose primary job is stabilizing the neuronal microtubule skeleton and supporting axonal transport. When tau becomes hyperphosphorylated and misfolds, it aggregates into neurofibrillary tangles, the pathological hallmark of tauopathies including Alzheimer's disease, Parkinson's disease, progressive supranuclear palsy (PSP), and corticobasal degeneration (CBD).

The Mechanism: Splice Factor Binding and Exon 3

A landmark 2017 study using whole-locus MAPT expression vectors demonstrated that rs1800547, working together with rs17651213, directly controls haplotype-specific inclusion of MAPT exon 322 A landmark 2017 study using whole-locus MAPT expression vectors demonstrated that rs1800547, working together with rs17651213, directly controls haplotype-specific inclusion of MAPT exon 3
Exon 3 encodes a region of the N-terminal projection domain of tau that affects its interaction with membranes and the neuronal cytoskeleton
. Using electrophoretic mobility shift assays, researchers found that the H1 (A) and H2 (G) alleles at rs1800547 create distinct RNA-protein binding patterns with splicing factors hnRNP F and hnRNP Q — critical regulators of alternative splicing in neurons. The H2 sequence at this position allows 1.76-fold higher exon 3 inclusion compared to H1, altering the proportion of N1 and N2 tau isoforms.

This matters because the balance of tau isoforms — not just total tau levels — appears critical to which tauopathy, if any, develops. H1 is associated with elevated 4-repeat (4R) tau isoforms, which are the primary constituents of pathological aggregates in PSP and CBD. H2's different splicing pattern may shift the balance away from aggregation-prone 4R isoforms.

The Evidence for Parkinson's Disease

The GenePD Study genotyped 21 SNPs across the MAPT region in PD families and controls, finding that rs1800547 emerged as the most statistically significant variant for PD association, surviving multiple testing correction33 The GenePD Study genotyped 21 SNPs across the MAPT region in PD families and controls, finding that rs1800547 emerged as the most statistically significant variant for PD association, surviving multiple testing correction
The study also found 4-repeat MAPT isoforms significantly elevated in PD brains (p=0.002), linking the H1 splicing signature to disease pathology
. A large case-control study of 1,762 PD patients and 2,010 controls found that H1/H1 homozygotes had an odds ratio of 1.46 (95% CI 1.25–1.69, p=8×10⁻⁷) for PD compared to H1/H2 and H2/H2 carriers44 1,762 PD patients and 2,010 controls found that H1/H1 homozygotes had an odds ratio of 1.46 (95% CI 1.25–1.69, p=8×10⁻⁷) for PD compared to H1/H2 and H2/H2 carriers
The association held across familial and sporadic disease, both sexes, and early- and late-onset subgroups
.

Progressive Supranuclear Palsy: The Strongest Association

The H1/H1 genotype is found in approximately 94% of PSP patients compared to ~64% of the general population — a striking enrichment. A JAMA Neurology study of 802 neuropathologically confirmed PSP cases identified H1 subhaplotypes with markedly elevated risk: H1d (OR 1.86), H1g (OR 3.64), and H1o (OR 2.60)55 802 neuropathologically confirmed PSP cases identified H1 subhaplotypes with markedly elevated risk: H1d (OR 1.86), H1g (OR 3.64), and H1o (OR 2.60)
These sub-haplotype associations suggest that specific combinations of H1-background variants, on top of the rs1800547 A allele, determine the magnitude of PSP risk
. The chromosome 17q21.31 region — anchored by rs1800547 — represents the single strongest genetic risk locus for PSP identified to date.

Alzheimer's Disease: A Different Pathway

A study of 17,996 participants (8,559 AD cases, 9,437 controls) across Spanish and international cohorts found that rs1800547 itself was associated with AD risk (OR 1.12, p=0.0025)66 A study of 17,996 participants (8,559 AD cases, 9,437 controls) across Spanish and international cohorts found that rs1800547 itself was associated with AD risk (OR 1.12, p=0.0025)
The effect was strongest in APOE ε4 non-carriers — suggesting MAPT H1 represents an alternative causal pathway to AD distinct from amyloid-driven disease
. The risk was highest in individuals over age 77 without APOE ε4 (p=0.001), suggesting a late-life tau-driven pathway independent of beta-amyloid accumulation. For people without the APOE ε4 allele, the MAPT H1 haplotype tagged by rs1800547 becomes a more prominent contributor to AD risk.

ALS and Frontotemporal Spectrum

Beyond the primary tauopathies, a 2023 study of Bulgarian ALS patients found the H1b subhaplotype (containing the rs1800547 A allele) conferred a nearly 2-fold increased risk for sporadic ALS77 a 2023 study of Bulgarian ALS patients found the H1b subhaplotype (containing the rs1800547 A allele) conferred a nearly 2-fold increased risk for sporadic ALS
The authors propose that fine transcriptional regulation at the MAPT locus, including rs1800547's splice factor interactions, may influence ALS susceptibility through shared tau biology with FTD
. ALS and FTD share genetic and pathological overlap, and MAPT variation may contribute to the clinical spectrum between them.

Practical Actions

For H1/H1 carriers (AA genotype), the relevant clinical considerations are monitoring for motor symptoms that might indicate early parkinsonism, PSP, or CBD — conditions where early specialist evaluation matters for accurate diagnosis and prognosis. PSP in particular is frequently misdiagnosed as Parkinson's disease but responds differently to treatment. For the Alzheimer's disease risk — especially relevant for APOE ε4 non-carriers — knowing your MAPT status can help contextualize the late-life cognitive monitoring picture.

There are currently no approved pharmacological agents specifically targeting MAPT splicing or H1-driven tau isoform imbalance, though several anti-tau therapies are in clinical trials. Lifestyle factors — particularly aerobic exercise and head trauma prevention — have independent evidence for neuroprotection across multiple pathways relevant to tauopathy risk.

Interactions

rs1800547 and rs17649553 both tag the same H1/H2 haplotype and are in very strong linkage disequilibrium. If a person's genome contains both SNPs, their results should be concordant. The H1 risk at this locus compounds with rs356182 (SNCA) for Parkinson's disease risk — though interaction analyses have found these act independently rather than epistatically. In Alzheimer's disease, the H1/H2 distinction interacts with APOE genotype (rs429358), with H1 risk most pronounced in APOE ε4 non-carriers.

rs1801133

MTHFR C677T

Established Risk Factor

MTHFR C677T — The Methylation Gatekeeper

MTHFR (methylenetetrahydrofolate reductase) is arguably the most talked-about gene in nutritional genomics, and for good reason. It encodes the enzyme that converts 5,10-methylenetetrahydrofolate into 5-methyltetrahydrofolate 11 The active form of folate that enters the methylation cycle (methylfolate), the biologically active form of folate that your body actually uses. Methylfolate is essential for the methylation cycle 22 The methylation cycle adds methyl groups to DNA, proteins, and neurotransmitters — essential for hundreds of reactions, which affects DNA repair, neurotransmitter production, detoxification, and hundreds of other biochemical reactions.

The Mechanism

The C677T variant (rs1801133) causes an alanine-to-valine substitution 33 Alanine-to-valine substitution at position 222 of the enzyme (p.Ala222Val) at position 222 of the MTHFR enzyme. This makes the enzyme thermolabile 44 Thermolabile: the enzyme loses stability and function at normal body temperature — it loses activity at body temperature. The AA genotype 55 TT on the coding strand — 23andMe reports the complementary strand retains only about 30% of normal enzyme activity, while the AG genotype 66 CT on the coding strand retains about 65%. This means less dietary folate and supplemental folic acid gets converted to the methylfolate your cells need.

The Evidence

The C677T variant is one of the most extensively studied genetic variants in human biology. A meta-analysis of over 80 studies77 meta-analysis of over 80 studies
Wen YY et al. Meta-analysis across 82 studies confirming the MTHFR-homocysteine link
confirmed that the TT genotype is associated with 25% higher homocysteine levels when folate intake is low. Elevated homocysteine is an independent risk factor for cardiovascular disease88 cardiovascular disease
Mangoni AA & Jackson SHD. Homocysteine and cardiovascular disease. Am J Med, 2002
, neural tube defects, and possibly cognitive decline. However, the key finding is that adequate folate intake essentially normalizes homocysteine in most TT individuals. A large meta-analysis99 large meta-analysis
Clarke R et al. Homocysteine and coronary heart disease meta-analysis, 2012
found a 15% excess coronary heart disease risk in TT homozygotes compared to CC homozygotes.

The Folic Acid Question

Synthetic folic acid (found in fortified foods and cheap supplements) must be converted by MTHFR to become active methylfolate. If your MTHFR is working at only 30% capacity, this conversion is a bottleneck. Methylfolate supplements bypass this step entirely, which is why they are often recommended for people with the TT genotype. Riboflavin (vitamin B2) is an essential cofactor for MTHFR and has been shown to lower blood pressure1010 shown to lower blood pressure
McNulty H et al. showed riboflavin 1.6mg/day lowers blood pressure in MTHFR TT individuals by stabilizing the thermolabile enzyme
in TT individuals by stabilizing the thermolabile enzyme.

Practical Implications

The MTHFR C677T variant is extremely common — about 10-15% of Europeans are TT and about 40% are CT. It is not a disease-causing mutation. With adequate folate (especially as methylfolate), B12, B2, and B6 intake, most people with the TT genotype function perfectly normally. The key is knowing your status so you can optimize your B vitamin strategy.

Interactions

The C677T variant interacts importantly with the A1298C variant (rs1801131) — compound heterozygosity (one copy of each) can reduce MTHFR activity to 40-50%. It also interacts with SLC19A1 (rs1051266), which controls folate transport into cells, and COMT (rs4680), which determines tolerance for methyl donors. Methotrexate, an antifolate drug, has increased toxicity in C677T carriers.

rs1884613

HNF4A HNF4A P2 Promoter T2D Variant

Moderate Risk Factor

HNF4A P2 Promoter — The Population-Specific Beta-Cell Diabetes Switch

HNF4A11 HNF4A
Hepatocyte Nuclear Factor 4 Alpha — a nuclear receptor transcription factor that controls dozens of metabolic genes in liver, intestine, kidney, and pancreatic beta cells
is one of the master regulators of glucose and lipid homeostasis. It has a unique dual-promoter architecture: P1 drives the adult-liver isoform, while the P2 promoter22 P2 promoter
Located approximately 46 kb upstream of P1, the P2 promoter is active specifically in pancreatic beta cells and the fetal liver. Essentially all beta-cell HNF4A expression derives from P2-driven transcripts — making this promoter region critical for pancreatic function while being largely irrelevant to hepatic HNF4A activity.
drives the "beta-cell-specific" isoform (HNF4A7–12). rs1884613 is an intronic variant that falls within the P2 haplotype block — a cluster of highly correlated SNPs tagging the P2 promoter risk haplotype. Rare P2 mutations that inactivate HNF4A cause MODY133 MODY1
Maturity-Onset Diabetes of the Young type 1 — an autosomal dominant, early-onset monogenic diabetes arising from HNF4A haploinsufficiency in beta cells
. rs1884613 is not a MODY mutation, but it operates along the same biological axis at vastly smaller effect sizes.

The Mechanism

The P2 promoter drives HNF4A isoforms that regulate a suite of genes essential for glucose-stimulated insulin secretion — including glucokinase (the islet glucose sensor), the Kir6.2 subunit of the ATP-sensitive potassium channel, and the insulin gene itself. The G allele at rs1884613 is in near-perfect LD (r²≈0.99) with rs2144908 and marks carriers of a P2 haplotype associated with subtly reduced P2 promoter activity and lower HNF4A expression in pancreatic beta cells. Less HNF4A means quieter insulin secretion signaling, which over years accumulates into impaired glucose tolerance and type 2 diabetes susceptibility.

In established type 2 diabetes, the HNF4A P2 isoform becomes aberrantly re-expressed in adult liver44 aberrantly re-expressed in adult liver
Glucagon (chronically elevated in T2D) activates TET3, which demethylates the P2 promoter. FOXA2 then drives P2 transcription in hepatocytes, causing the fetal HNF4A isoform to stimulate excessive hepatic glucose output — a feed-forward loop worsening hyperglycemia.
The P2 risk haplotype may predispose to this aberrant hepatic re-activation under metabolic stress, compounding both impaired beta-cell secretion and excess hepatic glucose production.

The Evidence

The HNF4A P2 haplotype was first linked to T2D in 2004 when Weedon et al.55 Weedon et al.
Weedon MN et al. Common variants of the hepatocyte nuclear factor 4-alpha P2 promoter are associated with type 2 diabetes in the U.K. population. Diabetes 2004. PMID:15504983
identified rs1884613 among four P2 region variants associated with T2D in 5,256 UK subjects (risk haplotype OR 1.15, 95% CI 1.02–1.33). A 2007 Scandinavian meta-analysis by Johansson et al.66 Johansson et al.
Johansson S et al. Studies in 3,523 Norwegians and meta-analysis in 11,571 subjects indicate that variants in the HNF4A P2 region are associated with type 2 diabetes in Scandinavians. Diabetes 2007. PMID:17827402
confirmed the signal with OR 1.14 (95% CI 1.06–1.23, P=0.0004) across 4,000 Norwegian cases and 7,571 controls.

The most striking finding emerged from the Barroso et al. 2008 population comparison77 Barroso et al. 2008 population comparison
Barroso I et al. Population-specific risk of type 2 diabetes conferred by HNF4A P2 promoter variants: a lesson for replication studies. Diabetes 2008. PMID:18728231
: rs1884613 showed OR ~1.7 in Ashkenazi Jewish subjects (n=991; P<1.6×10⁻⁶) but only OR 1.04 (non-significant) in UK populations (n=4,022). This dramatic difference at nearly identical G allele frequencies (~17–23%) implies the causal variant within the P2 haplotype block is in closer LD with rs1884613 in Ashkenazi Jewish ancestry due to extended founder-effect haplotype structure.

The Ashkenazi signal was independently confirmed by Neuman et al. 201088 Neuman et al. 2010
Neuman RJ et al. Gene-gene interactions lead to higher risk for development of type 2 diabetes in an Ashkenazi Jewish population. PLoS One 2010. PMID:20361036
in 974 cases and 896 controls: rs1884613 gave unadjusted OR 1.69 (95% CI 1.40–2.03, P<0.0001) and adjusted OR 1.77 (95% CI 1.39–2.24). Critically, gene-gene interaction analyses revealed that carriers of both rs1884613 G and WFS1 rs10010131 risk alleles had OR 3.0 (95% CI 1.7–5.3) for T2D, and carriers of both HNF4A and TCF7L2 rs12255372 risk alleles had OR 2.4 (95% CI 1.7–3.4) — demonstrating that HNF4A P2 haplotype risk is substantially amplified by co-occurring beta-cell stressors.

Practical Actions

Because the primary risk mechanism is impaired HNF4A-driven beta-cell insulin secretion, the most actionable dietary lever is reducing the insulin secretory demand placed on pancreatic beta cells — specifically choosing lower-glycemic-load carbohydrates (legumes, intact grains, non-starchy vegetables) over rapidly absorbed refined starches that demand peak insulin secretory responses. Periodic monitoring of fasting glucose and HbA1c allows early detection of declining beta-cell reserve. For Ashkenazi Jewish individuals carrying the G allele, the ~1.7-fold per-allele risk estimate is clinically meaningful and warrants more proactive surveillance than for non-Ashkenazi Europeans where the effect is modest (~OR 1.14).

Interactions

rs1884613 and rs2144908 are in near-perfect LD (r²=0.99) and tag the same P2 haplotype signal — they cannot be separated in published studies. Other P2 haplotype tags (rs4810424, rs1884614, rs6031552) probe the same signal with varying LD. rs4812829 is in a different HNF4A intronic LD block and represents an independent GWAS signal in South Asians; carrying risk alleles at both rs1884613 and rs4812829 would represent additive, not correlated, HNF4A-linked risk.

The Neuman 2010 gene-gene interaction data identifies WFS1 rs10010131 (beta-cell ER homeostasis) and TCF7L2 rs12255372 (Wnt-driven incretin signaling) as the two most important interacting variants. The WFS1 interaction (OR 3.0) is especially notable because WFS1/wolframin controls ER calcium handling in beta cells — an independent beta-cell stress mechanism that synergizes with transcriptional HNF4A deficiency.

rs2108622

CYP4F2 V433M (*3)

Established Risk Factor

CYP4F2*3 — Vitamin K Metabolism and Warfarin Dosing

CYP4F2 encodes a cytochrome P450 enzyme11 cytochrome P450 enzyme
The CYP4F2 enzyme is the primary hepatic vitamin K1 oxidase
that metabolizes vitamin K1 to hydroxyvitamin K1, effectively removing it from the vitamin K cycle. This serves as a counterbalance to VKORC1 (vitamin K epoxide reductase), preventing excessive accumulation of vitamin K. The V433M variant, also known as CYP4F2*3, is a common missense mutation that significantly impacts warfarin dosing requirements22 warfarin dosing requirements
Warfarin is an anticoagulant that works by inhibiting VKORC1, thereby limiting vitamin K availability for clotting factor activation
.

The Mechanism

The rs2108622 variant causes a valine-to-methionine substitution at position 433 in the CYP4F2 protein. Research using human liver microsomes33 human liver microsomes
Tissue samples analyzed from liver banks
genotyped for this variant demonstrates that individuals carrying the T allele (433Met) have both reduced CYP4F2 protein concentrations and decreased vitamin K1 oxidation activity. The T allele is associated with approximately 40-45% reduction in enzyme activity compared to the wild-type. Because less vitamin K is being metabolized and removed, hepatic vitamin K1 levels rise, providing more substrate for VKORC1 to convert into the active form needed for clotting factor synthesis. This elevated vitamin K counteracts warfarin's anticoagulant effect, necessitating higher warfarin doses to achieve the same therapeutic response.

The Evidence

The association between CYP4F2*3 and warfarin dosing was first identified in 200844 first identified in 2008
Caldwell et al. CYP4F2 genetic variant alters required warfarin dose. Blood, 2008
through a genome-wide association study that screened over 1,200 SNPs in a cohort of warfarin patients. The discovery study found that TT homozygotes required approximately 1 mg/day more warfarin than CC homozygotes across three independent cohorts representing diverse US geographic regions.

This finding has been extensively replicated worldwide55 extensively replicated worldwide
Liang et al. Influence of CYP4F2 genotype on warfarin dose requirement: a systematic review and meta-analysis. Thrombosis Research, 2012
. A 2012 meta-analysis of 30 studies involving 9,470 participants confirmed that T-allele carriers require an 8.3% higher mean daily coumarin dose than CC homozygotes (95% CI: 5.6-11.1%, P < 0.0001). The effect is consistent across European and Asian populations but appears less pronounced in individuals of African ancestry, where the T allele is also much rarer.

The Clinical Pharmacogenetics Implementation Consortium (CPIC)66 Clinical Pharmacogenetics Implementation Consortium (CPIC)
Johnson et al. CPIC Guideline for Pharmacogenetics-Guided Warfarin Dosing: 2017 Update. Clinical Pharmacology & Therapeutics, 2017
incorporated CYP4F2*3 into their 2017 warfarin dosing guideline update. While the effect size is smaller than that of CYP2C9 and VKORC1 variants (which collectively explain ~40% of dose variability), CYP4F2*3 contributes an additional 1-4% to the explained variance and improves the accuracy of pharmacogenetic dosing algorithms.

Practical Implications

If you are prescribed warfarin and carry one or two copies of the T allele, you will likely need a higher maintenance dose to reach your target INR (International Normalized Ratio, typically 2-3 for most indications). CPIC recommends an optional 5-10% dose increase for non-African American individuals with at least one T allele when using a pharmacogenetic algorithm that already accounts for CYP2C9 and VKORC1 genotypes. This translates to approximately 0.5-1 mg/day additional warfarin.

The effect appears most clinically relevant in patients who also carry VKORC1 variants associated with low warfarin requirements. In these individuals, CYP4F2*3 can explain a significant portion of the remaining dose variability. The interaction makes sense mechanistically: VKORC1 variants that increase warfarin sensitivity reduce the amount of active vitamin K, while CYP4F2*3 increases available vitamin K — the two variants work in opposite directions.

It's important to note that CYP4F2 testing is considered optional rather than essential in clinical warfarin management. The major determinants remain CYP2C9 (which metabolizes warfarin itself) and VKORC1 (warfarin's direct target). However, incorporating CYP4F2*3 into dosing algorithms does incrementally improve prediction accuracy and may be particularly valuable for patients who are difficult to stabilize or who fall outside the predicted dose range from CYP2C9/VKORC1 alone.

Interactions

CYP4F2*3 is one of four genetic variants incorporated into modern pharmacogenetic warfarin dosing algorithms, alongside VKORC1 rs9923231, CYP2C9*2 (rs1799853), and CYP2C9*3 (rs1057910). For individuals of African ancestry, the CYP2C cluster variant rs12777823 is more clinically relevant than CYP4F2*3.

The combined effect of these variants is complex but predictable. A person with VKORC1 AA genotype (high warfarin sensitivity) plus CYP2C9*1/*1 (normal metabolism) plus CYP4F2 TT (reduced vitamin K metabolism) represents competing influences: the VKORC1 variant lowers dose requirements substantially, while CYP4F2 TT modestly increases them. The net effect is still a lower-than-average dose, but not as low as VKORC1 AA alone would predict. Modern dosing algorithms such as those validated by the International Warfarin Pharmacogenetics Consortium77 such as those validated by the International Warfarin Pharmacogenetics Consortium
Available at www.warfarindosing.org
incorporate all these variants simultaneously to generate personalized dose predictions.

Gene-gene interactions worth noting for compound implications include CYP4F2 TT + VKORC1 low-sensitivity genotypes (requiring careful upward dose titration) and CYP4F2 TT + CYP2C9 poor metabolizer status (where warfarin clearance is slow but more drug is needed to overcome elevated vitamin K). However, these interactions are generally handled by existing pharmacogenetic algorithms rather than requiring separate clinical decision-making.

rs228697

PER3 Pro864Ala

Moderate Risk Factor

PER3 Pro864Ala — Your Internal Clock's Tempo

The PER3 gene encodes Period Circadian Regulator 311 Period Circadian Regulator 3
One of three Period proteins (PER1, PER2, PER3) that form the negative arm of the mammalian circadian clock feedback loop
, a protein at the heart of the molecular clock that governs your ~24-hour sleep-wake cycle. Every cell in your body runs a version of this clock, and PER3 helps set its tempo. The rs228697 variant swaps a proline for an alanine at position 864, subtly changing how the clock protein behaves — and, with it, whether you lean toward being a morning lark or a night owl.

PER3 is best known for its VNTR polymorphism22 VNTR polymorphism
A variable number tandem repeat (4 or 5 copies of a 54-bp repeat) in exon 18 that strongly influences sleep timing and homeostatic sleep drive, but is not on SNP genotyping chips
(4-repeat vs 5-repeat), which strongly predicts sleep timing and sleep need but cannot be genotyped on standard SNP chips. The Pro864Ala missense variant (rs228697) is the best SNP-chip proxy for PER3 circadian effects and has its own independent functional consequences.

The Mechanism

The circadian clock runs on a transcription-translation feedback loop33 transcription-translation feedback loop
CLOCK and BMAL1 proteins activate transcription of PER and CRY genes; the PER/CRY protein complex then feeds back to repress CLOCK-BMAL1, creating a ~24-hour oscillation
. CLOCK and BMAL1 proteins bind to E-box elements44 E-box elements
Short DNA sequences (CACGTG) in gene promoters that CLOCK-BMAL1 heterodimers recognize to activate transcription of clock-controlled genes
to activate PER and CRY genes. The resulting PER and CRY proteins accumulate, form complexes, enter the nucleus, and repress their own transcription — completing one cycle roughly every 24 hours.

The Pro864Ala substitution sits in a region containing two potential SH3-binding motifs55 SH3-binding motifs
Src Homology 3 domains mediate protein-protein interactions; the proline-to-alanine change disrupts these binding sites, altering how PER3 interacts with partner proteins
. Replacing proline (a rigid amino acid that enforces tight bends in protein structure) with alanine (a flexible, small amino acid) alters the local protein conformation. Functional experiments66 Functional experiments
Lavebratt C et al. Molecular analyses of circadian gene variants. Transl Psychiatry, 2016
showed that the variant (G allele) protein is more stable than the wild-type — it degrades more slowly, accumulates to higher levels, and recruits more PER2 into the transcription repression complex. The result is a stronger repressor of CLOCK-BMAL1-driven transcription.

When the variant hPER3 was expressed in mammalian fibroblasts, it caused a significant, dose-dependent lengthening of the circadian period. A computational model77 computational model
Liberman AR et al. Circadian clock model supports molecular link between PER3 and human anxiety. Sci Rep, 2017
estimated this lengthening at 2-6% — enough to shift a 24-hour period toward roughly 25 hours. People whose internal clock runs long tend to drift toward later sleep and wake times — the hallmark of evening chronotype.

The Evidence

The initial genetic association88 initial genetic association
Hida A et al. Screening of clock gene polymorphisms demonstrates association of a PER3 polymorphism with morningness-eveningness preference and circadian rhythm sleep disorder. Sci Rep, 2014
came from a Japanese study of 925 controls, 182 delayed sleep phase patients, and 67 free-running type patients. The G allele was significantly associated with eveningness preference (sex-adjusted OR 2.48, 95% CI 1.34-4.60, corrected P = 0.012). More strikingly, G allele frequency was doubled in free-running type patients — people whose internal clock fails to entrain to the 24-hour day (age- and sex-adjusted OR 2.02, 95% CI 1.16-3.52, P = 0.017).

An Italian replication study99 Italian replication study
Lazar AS et al. Diurnal preference, mood and the response to morning light in relation to polymorphisms in the human clock gene PER3. Sci Rep, 2017
of 786 Caucasian subjects confirmed the chronotype association (OR 2.10, 95% CI 1.21-3.65, P = 0.008) and found that G carriers showed lower mood scores in the late afternoon and early evening — the time when a longer-period clock would be most misaligned with the external day.

Beyond chronotype, a case-control study1010 case-control study
Lavebratt C et al. Molecular analyses of circadian gene variants reveal sex-dependent links between depression and clocks. Transl Psychiatry, 2016
of 592 major depressive disorder (MDD) cases and 776 controls found the G allele associated with MDD risk (OR 1.39, allelic P = 0.007), with a stronger effect in women (allelic P = 0.041). Separately, anxiety levels1111 anxiety levels
Liberman AR et al. Sci Rep, 2017
were significantly higher in G allele carriers (F(2,305) = 3.195, P = 0.042), consistent with the broader finding that circadian misalignment elevates anxiety and depression risk.

Practical Implications

This variant does not cause disease. It shifts your circadian tendency. If you carry the G allele and find yourself naturally gravitating toward later bedtimes, the biology supports what you already feel. The key is to work with your chronotype rather than fight it:

Morning light exposure is the most powerful tool for advancing a late-running clock. Even 20-30 minutes of outdoor light before 10 AM can shift your circadian phase earlier. Conversely, avoiding bright light (especially blue-enriched screens) in the 2-3 hours before desired bedtime prevents the clock from being pushed even later.

Meal timing also entrains peripheral clocks. Eating your last substantial meal at least 3 hours before sleep, and anchoring breakfast to a consistent time, provides a secondary timing cue that reinforces the light signal.

For the mood dimension, the association between this variant and depression/anxiety appears to operate through circadian misalignment rather than a direct effect on mood neurocircuitry. Maintaining regular sleep-wake timing — even on weekends — reduces social jetlag1212 social jetlag
The discrepancy between your biological clock and your social schedule, measured as the difference between midpoint of sleep on work days vs free days
and may mitigate the mood risk.

Interactions

PER3 Pro864Ala interacts with the PER3 VNTR (4-repeat vs 5-repeat). The G allele combined with the PER3-4 repeat haplotype shows a stronger association with morningness than either variant alone (OR 2.19 for the haplotype vs OR 2.10 for the SNP alone). However, the VNTR is not genotyped on standard SNP chips, so this interaction cannot be assessed from 23andMe data.

PER3 is part of a broader circadian gene network including CLOCK, BMAL1 (ARNTL), PER1, PER2, CRY1, and CRY2. Variants in these genes may compound or buffer PER3 effects on chronotype, but specific SNP-SNP interactions with rs228697 have not been well characterized in the published literature.

LYPLAL1 and Fat Distribution: A Sex-Dimorphic Signal at Chromosome 1q41

Where you store fat matters as much as how much fat you carry. Two people with identical BMIs can have very different metabolic and health profiles depending on whether fat accumulates centrally (around the abdomen) or peripherally (in the hips, buttocks, and legs). The LYPLAL1 locus on chromosome 1q41 is one of the earliest and most replicated genetic signals for waist-hip ratio adjusted for BMI11 waist-hip ratio adjusted for BMI
WHR-adjBMI is a phenotype that captures fat distribution independently of total adiposity — it reflects where fat is located, not how much there is
, and its effects are dramatically stronger in women than in men.

rs2605100 sits within an intronic region of the LYPLAL1 antisense RNA 1 gene (LYPLAL1-AS1), approximately 259 kb from the LYPLAL1 protein-coding gene itself. LYPLAL1 (Lysophospholipase-Like 1) encodes a serine hydrolase with unclear endogenous substrate22 serine hydrolase with unclear endogenous substrate
Crystal structure shows LYPLAL1 is structurally similar to acyl-protein thioesterases but with a closed hydrophobic tunnel preferring short acyl chains; it has been proposed to act as a triglyceride lipase in adipose tissue and to regulate protein depalmitoylation
. Despite its name, the protein is not a classical lysophospholipase. Its precise physiological substrate and pathway remain under active investigation.

The Mechanism

The G allele at rs2605100 is the major allele (frequency ~71% in Europeans, up to ~89% in Africans) and is the variant associated with higher WHR — meaning more centrally distributed fat — in women. The A allele is the protective (favorable) form linked to a more gynoid (hip-and-thigh-predominant) fat distribution pattern. The variant likely acts as a regulatory signal rather than directly altering the LYPLAL1 protein sequence, possibly influencing the expression of LYPLAL1 or its antisense RNA in adipose tissue.

Mouse knockout studies33 Mouse knockout studies
CRISPR-Cas9 whole-body Lyplal1 KO on high-fat/high-sucrose diet; n=20 per sex per diet
demonstrate a sex-specific role: female (but not male) Lyplal1 KO mice on a high-fat, high-sucrose diet weighed approximately 5 g less than wildtype controls, had reduced total body fat percentage, and showed smaller white adipose tissue depots across inguinal, gonadal, and perirenal sites. This mirrors the human GWAS pattern where LYPLAL1 variants affect fat distribution exclusively or predominantly in women. One proposed mechanism is that LYPLAL1 functions as an acyl thioesterase involved in protein palmitoylation44 protein palmitoylation
Palmitoylation is the reversible attachment of palmitate to cysteine residues, which regulates membrane targeting and activity of proteins including estrogen receptors and metabolic signaling intermediates
, potentially influencing estrogen receptor localization and thus coupling the gene's function to sex hormone signaling.

The Evidence

The original discovery came from a meta-analysis of 38,580 individuals55 meta-analysis of 38,580 individuals
Lindgren et al. 2009, PLoS Genetics; 32 GWAS cohorts in stage 1, followed by replication in 70,689 subjects
that identified rs2605100 as the sentinel variant at the LYPLAL1 locus. The association with WHR reached genome-wide significance in women (p = 1.3×10⁻⁸) but was completely absent in men (p = 0.50). The absolute effect — 0.0014 units of WHR per G allele — is modest, explaining approximately 0.02% of WHR variance in women. This is typical for GWAS hits: individually small effects that are biologically real and replicated.

A subsequent GIANT consortium analysis66 subsequent GIANT consortium analysis
Heid et al. 2010, Nature Genetics; n=77,167 discovery, n=113,636 replication; 32 genome-wide association studies
confirmed the LYPLAL1 signal alongside 13 newly discovered loci. Seven of the 14 confirmed WHR loci showed marked sex dimorphism, all with stronger effects in women — a pattern consistent with the well-established sex differences in adipose tissue distribution driven by estrogen signaling.

Metabolic consequences of the G allele extend beyond fat location. In a Danish population study77 Danish population study
Dalgaard et al. 2011, PLoS ONE; Inter99 cohort, n=6,038 adults
, each additional G allele was associated with 3% higher fasting triglycerides (p = 0.003), 3% higher fasting insulin (p = 0.003), and 4% higher HOMA-IR (p = 0.001). Notably, the triglyceride effect was male-restricted (6% per G allele in men, p = 2.4×10⁻⁴; interaction p = 0.02), suggesting different metabolic routes through which the variant affects men vs. women.

Evidence from bariatric surgery outcomes88 bariatric surgery outcomes
Lund et al. 2016, n=251 RYGB patients (186 women); rs4846567, which is in high LD with rs2605100
shows that individuals homozygous for the favorable T allele of the correlated SNP rs4846567 lost 7% more excess body weight after gastric bypass surgery and reported 74% lower hunger scores and 53% lower disinhibition scores on validated eating behavior questionnaires compared to G-allele carriers.

Overall, evidence for rs2605100 is strong: multiple large, independent GWAS in European and Asian populations, consistent replication, plausible biological mechanism supported by animal models, and metabolic downstream effects that extend beyond the index phenotype.

Practical Actions

For carriers of one or two copies of the G allele — particularly women — the key insight is a genetic tendency toward central fat redistribution. This does not override lifestyle factors but does establish a meaningful background predisposition. Two areas are specifically supported by the biological evidence:

First, triglyceride management is directly implicated. The G allele is associated with elevated fasting triglycerides regardless of fat distribution. Dietary approaches that specifically lower triglycerides — not generic "healthy eating" — are warranted: reducing refined carbohydrate and added sugar intake, increasing omega-3 fatty acids (EPA/DHA from marine sources), and limiting alcohol.

Second, body composition monitoring matters more than scale weight. Since the LYPLAL1 locus affects fat distribution rather than total fat mass, standard BMI tracking misses the relevant phenotype. Waist circumference and waist-hip ratio track the fat depot that this variant influences most directly.

Interactions

The LYPLAL1 locus signal is most commonly indexed by rs4846567, which is in high linkage disequilibrium99 linkage disequilibrium
LD means the two variants are almost always co-inherited; knowing your rs2605100 genotype reliably predicts your rs4846567 genotype
with rs2605100. For practical interpretation, the two variants can be treated as equivalent.

The LYPLAL1 fat distribution signal operates independently of FTO/MC4R total adiposity signals — having a high-risk FTO genotype (rs9939609 AA) drives increased overall fat mass, while LYPLAL1 determines where that fat is preferentially stored. These can compound: an individual with high-risk genotypes at both loci may gain more total fat and distribute it more centrally.

Supervisor note — candidate compound action: individuals carrying the G allele at rs2605100 (LYPLAL1, central redistribution) AND the AA genotype at rs9939609 (FTO, higher total fat mass) represent the highest-risk combination for central obesity among these two loci. A combined recommendation addressing both elevated total fat accumulation and its central distribution — specifically targeting fasting triglycerides and waist circumference rather than weight alone — would be appropriate and supported by additive effects documented in stratified GWAS analyses.

FADS2 rs2727271 — Your Delta-6 Desaturase Throttle

FADS211 FADS2
Fatty acid desaturase 2, encoding the delta-6 desaturase (D6D) enzyme — the first and rate-limiting step in converting the plant-based omega-3 alpha-linolenic acid (ALA) into EPA and ultimately DHA
is one of the most consequential enzymes in human lipid metabolism. It processes both the omega-3 pathway (ALA → stearidonic acid → EPA) and the omega-6 pathway (linoleic acid → gamma-linolenic acid → DGLA → arachidonic acid). Without adequate FADS2 activity, the plant-based omega-3 fats in flaxseed, walnuts, and chia seeds cannot be converted into the long-chain forms that the brain, cardiovascular system, and immune cells actually use.

rs2727271 is an intronic variant within FADS2 on chromosome 11q12-13 that tags a haplotype block associated with altered FADS2 expression. The T allele appears in about 14% of Europeans but in 41% of East Asians, pointing to population-specific evolutionary pressures on fatty acid conversion capacity — likely tied to differences in dietary omega-3 sources across ancestral populations.

The Mechanism

rs2727271 sits deep within an intron of FADS2 (position c.142-1892 relative to the coding sequence), with no direct protein change. Its functional effect is presumed to be cis-regulatory — altering FADS2 mRNA transcription efficiency22 cis-regulatory — altering FADS2 mRNA transcription efficiency
Analogous to the promoter-methylation mechanism of rs174537 in FADS1, where intronic/regulatory variants in the same FADS cluster modulate expression via allele-specific DNA methylation at enhancer sites between FADS1 and FADS2
. The T allele is associated with reduced enzyme activity, which produces a predictable biochemical cascade: ALA and linoleic acid accumulate because the first desaturation step is slowed, while downstream long-chain products (EPA, arachidonic acid) are produced at lower rates.

A genome-wide association study identified rs2727271-A (the major allele) as associated with decreased circulating cis/trans-18:2 fatty acid levels (p=7×10⁻⁹), consistent with A-allele carriers more efficiently converting 18:2 precursors through the desaturation pathway. By contrast, T-allele carriers accumulate more precursor fatty acids — a biochemical signature of reduced D6D throughput.

The Evidence

The strongest evidence for the FADS2 locus comes from large genome-wide association studies of circulating polyunsaturated fatty acids. A landmark 2009 GWAS in 1,075 InCHIANTI participants33 landmark 2009 GWAS in 1,075 InCHIANTI participants
Tanaka et al., PLoS Genetics — the FADS1/2/3 cluster on chromosome 11 emerged as the dominant genetic determinant of plasma PUFA levels, with rs174537 alone explaining 18.6% of additive variance in arachidonic acid (p=5.95×10⁻⁴⁶) and associating strongly with EPA (p=1.07×10⁻¹⁴)
established that natural variation at this locus has an unusually large effect on fatty acid status — larger than most single SNPs in metabolic genetics.

Subsequent studies refined the picture for FADS2 specifically. Zec et al. 2020 in Nutrition Research44 Zec et al. 2020 in Nutrition Research
Cross-sectional study of 286 Serbian adults; FADS2 rs174576 minor allele carriers showed plasma AA β=−1.14 (95% CI: −2.25 to −0.43) and reduced estimated desaturase-5 activity after multivariate adjustment
confirmed that FADS2 minor alleles consistently associate with lower arachidonic acid. Schuchardt et al. 201655 Schuchardt et al. 2016
111 MCI patients; FADS2 minor allele carriers at rs3834458, rs1535, rs174575, and rs174576 showed higher precursor PUFA levels and lower AA in erythrocyte membranes
demonstrated that the effect on circulating PUFA composition is detectable in red blood cell membranes — the most clinically relevant tissue compartment for omega-3 status assessment.

Isotope tracer work directly quantifying in-vivo ALA conversion found that FADS minor allele homozygotes had lower plasma EPA and lower [13C]EPA enrichment at 24 and 48 hours after ALA tracer dosing66 FADS minor allele homozygotes had lower plasma EPA and lower [13C]EPA enrichment at 24 and 48 hours after ALA tracer dosing
Gillingham et al. 2013, Am J Clin Nutr, n=103; four FADS SNPs studied across all dietary conditions
, providing direct mechanistic confirmation that reduced FADS activity translates into measurably less ALA-to-EPA conversion.

The East Asian frequency of the T allele (41%) versus African populations (3%) is striking. This stratification suggests that the T allele may have been neutral or mildly adaptive in populations historically relying on preformed dietary EPA and DHA from fish, while being selected against in populations that depended more on plant-based omega-3 conversion.

Practical Actions

For T-allele carriers, the core problem is that plant-based omega-3 sources (flaxseed, chia, walnuts, hemp) are unreliable. These foods supply ALA, which requires functional FADS2 to produce EPA and DHA. The T allele slows this conversion at the first and rate-limiting step, meaning even high ALA intakes may not translate to adequate EPA and DHA status.

The practical solution is to bypass the impaired conversion step by providing preformed EPA and DHA directly — from fatty fish, concentrated fish oil, or algae-based EPA/DHA supplements. AT heterozygotes benefit from 1–2 g EPA+DHA daily; TT homozygotes require 2–4 g daily to compensate for substantially reduced D6D throughput.

Omega-3 index testing (erythrocyte EPA+DHA as a percentage of total fatty acids) provides the most direct readout of whether supplementation is working. A value below 4% indicates clinically significant deficiency requiring dose adjustment.

Interactions

rs2727271 is in linkage disequilibrium with other FADS2 cluster variants, including rs3834458, rs174575, rs174576, and rs1535. Carrying multiple minor alleles across the FADS cluster (both FADS1 rs174537 T allele and FADS2 rs2727271 T allele) would compound the conversion deficit, since FADS1 (delta-5 desaturase) acts downstream of FADS2 in the same pathway. Combined FADS1+FADS2 minor allele carriers face a double bottleneck in long-chain PUFA synthesis.

FADS2 activity is also substrate-competitive: when dietary saturated fat (palmitic acid, 16:0) is high, FADS2 preferentially processes the saturated substrate rather than ALA or linoleic acid, further reducing long-chain PUFA synthesis in T-allele carriers who have less enzyme capacity to begin with.

ANRIL and Functional Aging — The 9p21.3 Longevity Locus

A single stretch of chromosome 9 — the 9p21.3 locus — is the most consistently replicated region in genome-wide association studies for age-related disease. Coronary artery disease, type 2 diabetes, glioma, melanoma, and now physical aging have all been linked to variation here. At the heart of the locus sits ANRIL (antisense non-coding RNA in the INK4 locus)11 ANRIL (antisense non-coding RNA in the INK4 locus)
Also designated CDKN2BAS or CDKN2B-AS1 — a long non-coding RNA transcribed antisense to the CDKN2A/CDKN2B tumor suppressor genes
, a molecular rheostat for cellular senescence.

rs2811712 is a tag SNP within ANRIL that was identified in a landmark study as associated with physical function in older people22 associated with physical function in older people
Melzer D et al. A common variant of the p16INK4a genetic region is associated with physical function in older people. Mech Ageing Dev. 2007
. The G allele — carried by roughly one in five people of European ancestry — is associated with substantially better preserved physical function in old age, while the common A allele is the risk-conferring genotype.

The Mechanism

The 9p21.3 locus contains three protein-coding genes — CDKN2A (encoding p16INK4a and p14ARF), CDKN2B (encoding p15INK4b), and MTAP — all flanked and overlapped by ANRIL. p16INK4a and p15INK4b are cyclin-dependent kinase inhibitors33 cyclin-dependent kinase inhibitors
CDK inhibitors block CDK4/6, preventing phosphorylation of Rb and halting cell-cycle progression from G1 to S phase — the canonical senescence checkpoint
that drive cellular senescence: as cells age and accumulate damage, p16 levels rise, arresting the cell cycle and converting cells into senescent "zombie" cells44 senescent "zombie" cells
Senescent cells stop dividing but remain metabolically active, secreting pro-inflammatory cytokines (the SASP — senescence-associated secretory phenotype) that damage neighboring tissue
that fuel the chronic inflammation underlying age-related functional decline.

ANRIL regulates CDKN2A/CDKN2B expression in cis through Polycomb group protein recruitment, particularly the PRC2 complex, which methylates histone H3K27 to silence the INK4 locus. In proliferating cells, ANRIL keeps senescence genes suppressed; as ANRIL expression shifts with aging or genetic variation, the balance tips toward elevated p16 and p15, accelerating the senescent phenotype. rs2811712 is an eQTL for CDKN2B expression: studies show that variation here correlates with altered ANRIL isoform abundance and downstream CDKN2B levels, providing a molecular pathway from genotype to functional aging phenotype.

The Evidence

The primary association was established by Melzer et al. (2007)55 Melzer et al. (2007)
A common variant of the p16INK4a genetic region is associated with physical function in older people. Mechanisms of Ageing and Development.
across three independent European cohorts totaling 3,372 elderly individuals (EPIC-Norfolk sets 1 and 2 plus InCHIANTI). Severely limited physical function — defined by performance-based tests of walking speed, chair stand, and balance — was present in 15.0% of AA homozygotes versus 7.0% of GG homozygotes, nearly half the rate. The per-A-allele odds ratio was 1.48 (95% CI 1.17–1.88, p = 0.001), adjusted for age, sex, and study. The association held across all three cohorts tested independently, including a trend in the Iowa-EPESE cohort (n=419, p=0.079 one-sided), making this one of the first robust genetic associations for functional aging specifically.

A Han Chinese case-control study Wu et al. (2012)66 Wu et al. (2012)
Heterozygote genotypes at rs2222823 and rs2811712 SNP loci are associated with cerebral small vessel disease in Han Chinese population
found the heterozygote genotype at rs2811712 was associated with OR 1.75 (CI 1.13–2.71, p=0.004) for cerebral small vessel disease — consistent with the broader pattern that the A (risk) allele at this locus impairs vascular and neurological aging beyond musculoskeletal function.

The broader 9p21.3 locus literature confirms that ANRIL variants influence CDKN2B expression and cellular senescence77 CDKN2B expression and cellular senescence
SNPs here show inverse effects on ANRIL and CDKN2B expression, supporting a role of antisense transcription in regulating senescence pathways
, with the risk alleles generally increasing ANRIL expression in ways that paradoxically reduce appropriate p16/p15 upregulation in vascular cells, impairing the normal senescence response to damage. This locus is the strongest genetic signal for coronary artery disease in Europeans (OR ~1.3 per risk allele), though rs2811712 itself does not independently associate with CAD — its primary documented phenotype is functional physical aging.

Practical Actions

For carriers of two A alleles (AA), the evidence supports a targeted approach to slowing the accumulation of senescent cells and their inflammatory secretome. The most directly relevant interventions for the CDKN2B/p16 axis include:

  • Senolytic compounds: Fisetin (a flavonoid concentrated in strawberries and onions) and quercetin have demonstrated senolytic activity in human studies — selectively promoting apoptosis in p16-expressing senescent cells. Intermittent high-dose fisetin (≥500 mg/day for 2–3 days per month) reduced senescent cell burden and SASP markers in published clinical data.
  • Cardiovascular monitoring: The 9p21.3 region is the strongest common genetic locus for coronary artery disease. While rs2811712 itself does not independently associate with CAD, it tags a haplotype block where other risk variants reside. AA carriers should ensure thorough cardiovascular risk assessment.
  • Strength and resistance training: p16-driven muscle senescence contributes directly to sarcopenia (age-related muscle loss) — the leading mediator of physical impairment in older adults. Resistance training specifically reduces senescent cell burden in skeletal muscle and preserves muscle fiber function through mechanisms that parallel the CDKN2B pathway.

AG carriers carry one protective G allele: physical function risk is intermediate, and the same principles apply with moderated urgency.

Interactions

rs10757278 and rs1333049 (9p21.3 haplotype): These are the primary CAD-associated SNPs at the 9p21 locus, in moderate-to-high LD with rs2811712. If you carry the risk haplotype at rs10757278 (G allele) in addition to AA at rs2811712, the combined burden on ANRIL/CDKN2B regulation is likely greater than either marker alone. The 9p21.3 block spans ~58 kb and multiple SNPs contribute to its overall phenotypic effect.

rs564398 (upstream of CDKN2A): Another 9p21 tag SNP associated with type 2 diabetes and glioma risk, which shares the ANRIL regulatory context. Combined risk at rs2811712 and rs564398 may implicate broader ANRIL dysregulation beyond the physical aging phenotype.

rs35767

IGF1 C-1245T

Strong Risk Factor

The Anabolic Switch — IGF-1 and Your Muscle-Building Potential

Insulin-like growth factor 1 (IGF-1) is one of the most powerful anabolic hormones in the human body. It activates the PI3K/Akt/mTOR pathway11 activates the PI3K/Akt/mTOR pathway
The canonical growth signaling cascade that controls muscle protein synthesis and hypertrophy
, stimulates satellite cell activation22 satellite cell activation
Muscle stem cells that divide and fuse to repair damage and create new muscle tissue
, and drives skeletal muscle hypertrophy in response to training. The rs35767 polymorphism sits in the promoter region of the IGF1 gene, 1,245 base pairs upstream of the transcription start site, where it regulates how much IGF-1 your body produces.

The A allele is associated with higher circulating IGF-1 levels compared to the C allele, and AA carriers tend to have greater muscle mass and superior athletic performance33 greater muscle mass and superior athletic performance
Particularly in power and combined power-endurance sports like decathlon
. This variant has emerged as one of the most replicated genetic markers for elite athletic performance.

The Mechanism

rs35767 is a regulatory variant located in the promoter region of the IGF1 gene on chromosome 12. The T-to-C substitution at position -1245 affects transcription factor binding and gene expression. Studies show the A allele leads to higher IGF-1 production, though the exact transcription factor interactions remain under investigation. Some research suggests the G allele may allow binding of C/EBPD transcription activator44 G allele may allow binding of C/EBPD transcription activator
A DNA-binding protein that regulates gene expression
, while other evidence indicates the A allele results in higher circulating levels through mechanisms that may involve altered promoter activity.

Once IGF-1 is secreted (primarily by the liver in response to growth hormone), it binds to IGF-1 receptors on muscle cells. This triggers a signaling cascade: PI3K converts PIP2 to PIP3, activating PDK1 and Akt. Akt then phosphorylates mTORC1, which activates ribosomal protein S6 and translation initiation factor eIF4E, ramping up protein synthesis. Simultaneously, Akt inhibits FoxO transcription factors, blocking the expression of muscle atrophy genes55 blocking the expression of muscle atrophy genes
E3 ubiquitin ligases like atrogin-1 and MuRF1 that tag muscle proteins for degradation
.

IGF-1 also activates muscle satellite cells—the stem cells responsible for muscle repair and growth. After intense exercise or muscle damage, satellite cells proliferate and differentiate into new myonuclei, contributing approximately half of the muscle mass gained during hypertrophy66 half of the muscle mass gained during hypertrophy
Based on studies using viral IGF-1 delivery in animal models
.

The Evidence

The rs35767 variant has been studied extensively in athletic populations. In a 2013 Israeli study of 87 power athletes and 78 endurance athletes77 2013 Israeli study of 87 power athletes and 78 endurance athletes
Including international and Olympic-level competitors
, the A allele was significantly more frequent in top-level power athletes compared to national-level athletes. Among the elite power cohort, 4.8% carried the TT genotype versus 0% in non-athletic controls—a striking overrepresentation.

A 2022 study of decathlon athletes88 2022 study of decathlon athletes
Decathlon demands both power and endurance across 10 events
found the AA genotype was significantly more prevalent among decathletes compared to other athlete groups, and AA carriers demonstrated superior speed performance. These findings align with the physiological role of IGF-1 in fast-twitch muscle fiber development and force production.

A 2024 meta-analysis99 2024 meta-analysis
Pooling data across multiple cohorts to increase statistical power
confirmed the A allele as a favorable genetic marker for both power and endurance athletic performance, supporting the variant's role across multiple training modalities.

At the molecular level, a 2014 study of European adults1010 2014 study of European adults
n=569 in discovery cohort
measured circulating IGF-1 and found that carriers of the GG genotype (equivalent to TT on the minus strand) had significantly higher IGF-1 levels (218 ng/ml) compared to AA carriers (190 ng/ml, p=0.007). The higher IGF-1 group also showed better insulin sensitivity, suggesting metabolic benefits beyond muscle growth.

However, not all effects are beneficial. A Japanese longitudinal cohort of 1,506 individuals1111 Japanese longitudinal cohort of 1,506 individuals
Followed for long-term health outcomes
found that AA carriers experienced faster decline in renal function over time compared to GG carriers, suggesting chronically elevated IGF-1 may have tradeoffs for kidney health.

Practical Actions

If you carry one or two A alleles, you have a genetic advantage for building muscle and responding to strength training. To capitalize on this:

Prioritize resistance training. Your elevated IGF-1 levels mean you're biochemically primed for hypertrophy. Focus on progressive overload—gradually increasing weight, volume, or intensity over time. AA carriers may see faster strength gains and better recovery from high-volume training compared to CC carriers.

Consume adequate protein. IGF-1 activates mTOR, the master regulator of protein synthesis, but mTOR needs amino acid availability to function. Aim for 1.6-2.2 g/kg body weight daily, with post-workout protein intake1212 post-workout protein intake
20-40g within 2 hours of training
to maximize the anabolic window when IGF-1 signaling is elevated.

Optimize sleep and recovery. Growth hormone (the primary driver of hepatic IGF-1 production) peaks during deep sleep. AA carriers producing more IGF-1 may benefit even more from adequate sleep (7-9 hours) for muscle repair and satellite cell activation.

Consider monitoring kidney function if you're TT. While the athletic benefits are clear, the Japanese cohort data suggests potential long-term renal effects. If you're a TT carrier pursuing intense athletic training, periodic monitoring of eGFR and creatinine may be prudent, especially as you age or if you have other kidney risk factors.

Interactions

rs35767 interacts with other variants in the IGF axis. rs71364461313 rs7136446
An intronic IGF1 variant
has also been associated with athletic performance and may compound with rs35767 to influence IGF-1 levels and muscle phenotype. Similarly, rs9729361414 rs972936
Another IGF1 intronic variant
affects IGF-1 expression and has been linked to neurological outcomes and muscle force production.

Beyond the IGF1 gene, interactions with the growth hormone receptor and myostatin pathway are likely. Carriers of both the IGF1 A allele and myostatin rare R allele1515 myostatin rare R allele
Loss-of-function variants in MSTN that reduce this muscle growth inhibitor
show even greater muscle mass and performance, suggesting an additive or synergistic effect.

For power athletes, the combination of rs35767 TT and ACTN3 RR1616 ACTN3 RR
Alpha-actinin-3, the "gene for speed"
may represent an elite genetic profile for explosive strength and sprint performance.