Skip to main content

The Impact of Gonadotropin-Releasing Hormone Analogues and Subsequent Cross-Sex Hormones on Bone Health: A Systemic Review

This section summarises the literature on the impacts of puberty blockers on bone health with a systemic review. Please cite as: Tegg, S., The Impact of Gonadotropin-Releasing Hormone Analogues and Subsequent Cross-Sex Hormones on Bone Health: A Systemic Review. Fully Informed. 2021.


Methods: A literature search of the Pubmed database for studies reporting bone mineral density (BMD) changes in adolescents receiving GnRHa and subsequent cross-sex hormones (CSH), or GnRHa and follow-up after treatment withdrawl was conducted. A random effects meta-regression was used to compute changes to BMD and BMD z-score (BMD-Z) across included studies. Pairwise T-tests of cohort level results were used to compare follow-up BMD results with baseline, CSH start, and healthy population references.

Results: Six cohort studies reporting changes in BMD while on GnRHa treatment for gender dysphoria. Four of the studies reported BMD changes under subsequent testosterone treatment in females, and three reported BMD changes of males under subsequent estrogen treatment. The total number of participants were 371 for GnRHa, 144 for testosterone, and 58 for estrogen. No studies examined BMD recovery or otherwise after GnRHa was stopped and endogenous sex hormone production began. In the random effects model, the BMD-Z of GnRHa-treated adolescents declined -0.83 standard deviations (SD) (p <.001). BMD-Z increased 0.53 SD under subsequent testosterone treatment and was lower than baseline by -0.22 SD, but this was not significant (p = .12). Under subsequent estrogen treatment BMD-Z increased 0.25 SD but this was not significant, and follow-up values were -0.29 SD below baseline values (p = .08).

Conclusion: Under GnRHa treatment adolescent BMD stabilises or declines slightly during a period when it is expected to increase. Consequently, BMD values compared to age-matched peers decreases. Subsequent testosterone treatment may recover BMD status in female subjects, but the evidence is mixed for recovery in male subjects treated with estrogen, and lacking for subjects who withdraw from hormone treatments. Patients and their parents should be advised of impacts to BMD and that the impacts may be long-term.

Introduction and Background

GnRha treatment induces a state of suspended sex hormone production and pubertal development (hypogonadotropic hypogonadism) . Clinicians treating adolescents with GnRHa (also known as "puberty blockers") propose that the treatment relieves distress associated with undesired pubertal development and will allow the adolescent's adult body to more closely resemble their identified sex.

For patients for whom gender dysphoria persists, CSH hormone treatment usually begins at age 16. CSH treatment directly changes some of the secondary sex characteristics to align with the opposite sex (body hair, fat distribution, vocal register) and can be permanent (Oliphant et al., 2018; Hembree et al., 2017).

In NZ, GnRHa treatment may continue through to age 20 (Oliphant, 2017). It is unclear how common the continuation of GnRHa treatment past age 16 is in other countries or whether this is a NZ-specific practice.

On March 1st, 2021, two Otago University academics, Dr Sue Bagshaw and Dr Jane Spittlehouse released a statement on the results of and unpublished student's summer 2020/2021 literature review. The literature review concerned the impacts and benefits of GnRHa for treating adolescents with gender dysphoria. In that statement, Bagshaw and Spittlehouse make claims regarding the impacts of GnRHa and subsequent cross-sex hormones (CSH), or treatment withdrawal on the bone health of adolescents.

"One exception is that use of puberty blockers does result in a decrease in bone mineral density, but this normalises when GnRH analogues are stopped and either a return to self-produced or cross-sex hormones are started."

The claims were paraphrased in a media article (Broughton, 2021):

"Bagshaw said reduced bone density was similar to the effect of the depo provera contraceptive injection, and was reversible."

In another article, similar claims were made by Dr Rachel Johnson (Chisholm, 2021):

Bone health is one of the areas we particularly focus on because we know that blockers do affect bone development. Research is showing that when you either stop the blocker or add in gender-affirming hormones then the bone density improves. Bone density is restored within approximately three years.

The position above is distinct from that of (a) clinicians at the The Royal Children’s Hospital in Melbourne, Australia who state that under subsequent CSH, BMD “may not reach the bone density levels of age and birth-assigned sex-matched controls.” (Notini et al., 2020, p. 5)

Or (b) clinicians at the Mayo Clinic in Rochester MN, USA “With initiation of [CSH], Z-scores improve but may not return to pretreatment Z-scores.” (Davidge-Pitts and Clarke, 2019, p. 40)

Further, and confusingly, a secondary statement summarising the same literature review makes a contradictory claim regarding the recovery of BMD for those who do not proceed to CSH (Spittlehouse et al., 2021):

The research indicates that there is a decrease in bone mineral density but that appears to normalise after puberty blockers are stopped and the new sex hormones are commenced (for those who choose to transition). There is a lack of evidence around the impacts of bone mineral density for those who do not receive new sex hormones. ” (emphasis added)

This article examines the accuracy of the above claims and attempts to clarify the apparent contradictions. GnRHa and CSH are controversial treatments. Puberty is an important period for bone mass accumulation. Adolescents who enter puberty at an older ages have persistently lower BMD than peers (Elhakeem et al, 2019). An induced pubertal blockade could therefore be expected to have similar effects. An adolescent patient treated with GnRHa has reported a high fracture rate (Bannerman, 2019). A second case study reports an adolescent with BMD -2 SD below the mean after 3 years of GnRHa treatment (Pang et al., 2020). A third case of long-term untreated hypergonadotropic hypogonadism has reported osteoporosis and vertebral fractures in middle age (Carsote et al., 2016).

In adults, CSH hormones increase BMD in the short-term (Wiepjes et al., 2017), but estrogen treatment in males appears to produce a slight decrease in BMD in the long-term (Delgado-Ruiz et al., 2019). However, it is not clear the extent to which lower levels of physical activity in the transgender male population confound these results.

The NHS has recently revised its advice on GnRHa (Kirkup, 2020). Following an ethical review, the Karolinska Institute no longer offers GnRHa or CSH to patients under 16 years of age, citing the risk of osteoporosis in later life (amongst other risks) (SEGM, 2021). Patients, and their parents have a legal right to clear and accurate information on the impacts of any treatment under consideration.

Clinical decision-making is not served by the polarised debate, nor the contradictory statements, that have emerged around this issue. A systematic review can help clarify the likelihood and magnitude of any bone health impacts.

Absolute vs Relative Changes in Bone Mineral Density

A potential source of confusion regarding adolescent bone health is the distinction between absolute changes in BMD versus changes in BMD relative to a healthy population reference. During child-hood, BMD increases until peak bone mass is reached in early adulthood. Puberty is the peak period for BMD accumulation (Figure 1.). According to Boot et al. (1997) puberty stage is the major independent determinant of BMD in girls . Stable absolute BMD in adolescence therefore represents a failure to accumulate adult BMD levels, and a decline in BMD relative to a healthy population reference.

Bone Mineral Density (BMD) by age
Figure 1. Bone Mineral Density by age. Data from Boot et al. (1997)

Therefore reported changes to absolute BMD can obscure adolescent bone health status, while relative measures provide a clearer picture of adolescent bone health under treatment (Klink et al., 2015).

BMD reference population comparisons are measured with Z-scores. A Z-score of 0 represents the mean (50th percentile) for that population. A Z-score of -1 represents BMD 1 standard deviation (SD) below the mean, or approximately the 16th percentile. In adults, a BMD Z-score between -1 and -2.5 is considered to be osteopenia, and below -2.5 (0.6th percentile) is considered osteoporosis. However, adolescent BMD Z-scores are interpreted differently. A Z-score below -2 is considered "low for age" and a diagnosis of paediatric osteoporosis relies on fracture history in addition to z-scores (Bachrach & Gordon, 2016).


The search strategy included articles that: (a) were intervention studies reporting absolute BMD or BMD-Z measures, (b) of gender dysphoric adolescents, (c) treated with puberty blockers, or both puberty blockers and subsequent cross-sex-hormones, (d) and written in English.

The search strategy excluded articles that were systemic reviews, editorials, opinion pieces, and articles on the treatment of central precocious puberty (CPP) with GnRHa. Children with CPP usually start GnRHa treatment at age ~8 and discontinue at age ~12 (Poomthavorn et al., 2011). Gender dysphoric adolescents usually start treatment at age 11-15 and discontinue at age ~16 (Joseph et al., 2019; Klink et al., 2015; Schagen et al., 2020). Only GnRHa treated gender dysphoric adolescents experience a condition analogous to “delayed puberty”. GnRHa-treated children with CPP do not appear to suffer BMD impacts (Guaraldi et al., 2016). The current study considers two groups separate for the purposes of assessing BMD impacts with distinct potential outcomes.

The PubMed database was searched with the following search strategy: {GD} defines a variable term with variations: "gender dysphoria", "gender identity", and "transgender". {puberty} defines a variable term with variations: “puberty” and “adolescent”. {CSH} defines a variable term with variations: “hormone”, “testosterone” and “estrogen”. Searches combined term variations into 15 searches using the following blueprint:

  • GnRHa AND bone AND {puberty} AND {GD} NOT precocious NOT CPP
  • {CSH} AND bone AND {GD}

Data analysis

The R package mixmeta was used to create a random effects meta-regression of treatment impacts on BMD-Z. The main model compared baseline and follow-up values. A secondary model adjusted for treatment duration. Data and analysis code are made available on a google sheet and repository.


The literature search resulted in 111 articles. Results were screened using the above inclusion criteria and narrowed the results to the six studies (Carmichael et al., 2021; Joseph et al., 2019; Klink et al., 2015; Schagen et al., 2020; Stoffers et al., 2019; Vlot et al., 2017). Table 1. presents the studies with a breakdown by participant cohort. Study cohorts are listed by first author, year of publication, treatment, number of participants at baseline, sex, mean age of particpants at the start of treatment, and treatment duration at the final reported follow-up.

Table 1. Study cohorts characteristics

StudyYearTreatmentn (start)SexMean age (start)Duration (months)

All six studies investigated changes to BMD and other bone markers on GnRHa treatment. Four of the studies also investigated BMD changes under subsequent testosterone treatment (Klink et al., 2015; Schagen et al., 2020; Stoffers et al., 2019; Vlot et al., 2017). Only three studies, all from the Netherlands, examined the subsequent changes in BMD after estrogen treatment (Klink et al., 2015; Schagen et al., 2020; Vlot et al., 2017).

Vlot et al. (2017) and Klink et al. (2015) were conducted by the same research team and 12 subjects overlap. No studies tracked BMD after a withdrawal of GnRHa and subsequent endogenous sex hormone production re-started.

Data extraction

BMD location, mean absolute BMD, and mean Z-score, sex and age of subjects at the treatment intervals including baseline, the number of subjects, the treatment type, treatment length, dosage, and variation measures were extracted. Treatments did not vary by dosage regime and was consequently discarded as a unit of interest.

Where the standard error was unavailable, standard deviation and confidence intervals were converted into standard errors using the appropriate formulas. Table 2. summarises the results of GnRHa treatment on absolute BMD and BMD-Z. A tilde (~) indicates the study found no significant differences. Down (↓) and up (↑) arrows indicate significant decreases or increases respectively. The number of arrows represents the significance threshold met by the reported P value:

  • ↓ / ↑ ≤ 0.05
  • ↓↓ / ↑↑ ≤ 0.01
  • ↓↓↓ / ↑↑↑ ≤ 0.001

To enhance table’s readability, symbols have been colour-coded blue (neutral) or red (negative) to indicate the impact on bone health depending on whether the measure is absolute or relative.

Table 2. Impacts of GnRHa treatment on BMD and BMD Z-Score

KlinkMSpine BMAD11/12~~
KlinkMSpine aBMD12/11~~
KlinkMHip BMAD12/10~~
KlinkMHip aBMD14/6~~
KlinkFSpine BMAD18~↓↓
KlinkFSpine aBMD18↓↓↓↓↓
KlinkFHip BMAD18~~
KlinkFHip aBMD18↓↓↓↓↓
VlotFHip BMAD10~~
VlotFHip BMAD23↓↓↓↓
VlotFSpine BMAD11~↓↓
VlotFSpine BMAD23↓↓↓↓
VlotMHip BMAD16~~
VlotMHip BMAD6~~
VlotMSpine BMAD15~↓↓
VlotMSpine BMAD5~~
CarmichaelM+FSpine BMD12↓?
CarmichaelM+FHip BMD12~↓?
StoffersFSpine BMD62↓↓↓↓↓↓
StoffersFL Hip BMD62↓↓↓↓↓↓
StoffersFR Hip BMD62↓↓↓↓↓↓
SchagenMSpine aBMD15
SchagenMSpine aBMD36
SchagenMHip aBMD15
SchagenMHip aBMD36~
SchagenMBody BMD15
SchagenMBody BMD36~
SchagenMSpine BMAD15~
SchagenMSpine BMAD36~
SchagenMHip BMAD15~~
SchagenMHip BMAD36
SchagenFSpine aBMD14
SchagenFSpine aBMD56
SchagenFHip aBMD14
SchagenFHip aBMD56
SchagenFBody BMD14
SchagenFBody BMD56
SchagenFSpine BMAD14~
SchagenFSpine BMAD56
SchagenMHip BMAD14
SchagenMHip BMAD56
JosephMHip BMD10~↓↓
JosephMSpine BMD10~↓↓↓
JosephMSpine BMAD10~↓↓↓
JosephFHip BMD21~↓↓↓
JosephFSpine BMD21~↓↓↓
JosephFSpine BMAD21~↓↓↓

Studies report results by cohort and measure, giving 47 cohort-measures across the 6 studies. Absolute BMD does not change significantly in 25, decreases in 14, and increases in 8 out of the cohort-measures. Thirty-seven BMD-Z cohort-measures decrease significantly, while eight show no change. For unexplained reasons, Carmichael et al. (2021) do not test the significance of the z-score measures, but do report a decrease (indicated with a ↓?). All cohort-measures with at least 21 participants reported a decrease in BMD Z-score. The main random effects model compared baseline and follow-up periods irrespective of treatment duration. The modelled change in BMD-Z across all studies was -0.83 (95% confidence interval -0.65 – -1.00, P <.001). The model predicts a mean BMD-Z of -0.24 at baseline and -1.06 after GnRHa treatment. The model was adjusted for sex but this was not significant. A secondary random effects model of the followup periods modeled the impact of treatment duration. An additional month of GnRHa treatment was correlated with an -.04 decline in BMD-Z but the P value (0.0827) did not reach the usual threshold for statistical significance.

Cross Sex Hormones

Table 3. compares BMD-Z under subsequent CSH with baseline (i.e. the start of GnRHa treatment), start of CSH treatment, and a reference population. Comparisons to the start of CSH treatment are those reported by the studies. Klink et al. (2015) also reports comparisons to baseline values. In all other baseline and reference cases a pairwise T-test is computed.

Table 3. Differences between BMD z-scores, baseline, start of testosterone treatment, and reference population values.

StudyMeasurenvs Baselinevs Startvs Reference
Schagenspine aBMD5--
Schagenhip aBMD5--
Schagenbody aBMD5--
Schagenspine BMAD5--
Schagenhip BMAD5--
Schagenspine aBMD37↓↓-
Schagenhip aBMD37--
Schagenbody aBMD37↓↓-
Schagenspine BMAD37--
Schagenhip BMAD37-~-
Stoffersspine BMD15~-
StoffersL hip BMD15↑↑-
StoffersL hip BMD15-
Vlothip BMAD10-↑↑-
Vlotspine BMAD11-↑↑-
Vlothip BMAD23--
Vlothip BMAD23-↑↑-
Klinkspine BMAD19-↑↑-
Klinkhip BMAD19~-
Klinkhip aBMD16-~-

BMD-Z increased under testosterone treatment in most cohort-measures and followup values were not significantly different from the reference population. In 6 out of 20 cohort-measures BMD-Z after testosterone treatment was still significantly below baseline values prior to GnRHa treatment. The pooled random effects model predicted a significant increase from -0.65 at the start of testosterone treatment to -0.12 at followup (p = .0002). Predicted followup BMD-Z was -0.22 SD below baseline but was not significant (p = .12).

Table 3. Differences between BMD z-scores, baseline, start of estrogen treatment, and reference population values.

StudyMeasurenvs Baselinevs Startvs Reference
Schagenspine aBMD15--
Schagenhip aBMD15--
Schagenbody BMD15↓↓↓
Schagenspine BMAD15--
Schagenhip BMAD15--
Schagenspine aBMD26↓↓~↓↓↓
Schagenhip aBMD26-~↓↓↓
Schagenbody BMD26↓↓↓~↓↓↓
Schagenspine BMAD26--
Schagenhip BMAD26-↓↓↓
Klinkspine BMAD14-~↓↓
Klinkspine aBMD13~↓↓↓
Klinkhip aBMD11-~↓↓
Vlothip BMAD16- (↓)~↓↓↓
Vlotspine BMAD15↓↓↓↓↓↓
Vlothip BMAD6-~-
Vlotspine BMAD5--

BMD-Z increased under estrogen treatment in 9 out of 17 cohort-measures but values were still significantly below the reference population in the majority of cohort-measures. In 5 cohort-measures BMD-Z after testosterone treatment was still significantly below baseline values prior to GnRHa treatment. The pooled random effects model showed a non-significant increase of 0.24 from -1.13 at the start of testosterone treatment to -0.89 at followup (p = .14). Modeled followup BMD-Z was -0.29 below baseline and this difference came close to statistical significance (p = .08).



BMD under GnRHa treatment stabilises at early adolescent level, declines, or accretes more slowly, when BMD would normally accrete rapidly. Any loss of BMD during adolescence is abnormal (Ferguson et al., 2019). In part due to the typically lower baseline BMD of gender dysphoric adolescents (Lee et al., 2020), an adolescent completing a 1-3 year course of GnRHa could be expected to have BMD around ~1 standard deviation below peers, or about 0.8 SD below baseline.

However, care should be taken with interpreting mean values as reported means obscure variation in outcomes. Biggs (2021) analysis of the raw values of Joseph et al (2019) highlights how a significant minority of patients had Z-scores below -2, qualifying them as "low for age". Studies did not track fractures and it is unclear if the treatment induced childhood osteoporosis in any subjects. Most subjects started GnRHa treatment at age 13-15 and were followed up 2-3 years later, limiting conclusions regarding subjects who start treatment earlier and continue for longer. Results suggest that longer treatment duration may lead to greater declines in BMD-Z. All studies lacked controls and lifestyle factors may have impacts on BMD. Standard practice is to counsel patients on the importance of diet and load-bearing exercise (Oliphant et al., 2018). However, the advice to perform load-bearing exercise may not be realistic for gender dysphoric male adolescents who are seeking a female body shape and may lack motivation to undertake a practice they perceive to be at odds with this goal.

Cross sex hormones and treatment withdrawal

BMD can partially or wholly recover to normal levels under subsequent testosterone treatment for the majority of female patients. However, when considering patients of both sexes and the range outcomes following GnRHa treatment, the Bagshaw-Spittlehouse statement that BMD "normalises when GnRH analogues are stopped and either a return to self-produced or cross-sex hormones are started" is not supported. Studies do not demonstrate a recovery of BMD-Z under subsequent estrogen treatment. BMD-Z may stagnate; or improve but may not return to baseline. Schagen et al (2019) reports that three male patients had a femoral neck (hip) BMD-Z less than -2 after three years of estrogen treatment. Three patients also had a spine BMD less than -2 after three years of CSH.
Evidence for a recovery of BMD after GnRHA withdrawal and without CSH is lacking and it is unclear how Bagshaw and Spittlehouse support the claim that BMD recovers in this circumstance.


Dr Bagshaw’s comparison of GnRHa to depo-provera is also puzzling. Depo-provera is a contraceptive. Consequently, depo-provera treated adolescents will be sexually active and generally older than those prescribed GnRHa for gender dysphoria. GnRHa treatment that starts before and continues through the peak bone mass accretion ages 14-16 may have distinct impacts on BMD compared to a treatment that starts at older ages. Pitts et al. (2012) assessed the impact of depo-provera on BMD and BMD-Z in 83 subjects with a median age of 16.4 at treatment start. In 15 subjects followed up after a further 1.5 years of depo-provera treatment BMD-Z declined significantly (p = .006) at the spine by -0.33 SD, and non-significantly (p = .33) by -0.14 SD at the hip. The former result is less than half the modelled impact of GnRHa treatment of similar duration. However, the small size and older age of subjects make it difficult to compare results between these treatments.


The evidence of impacts of GnRHa on bone health in combination with subsequent CSH is limited by the small number of studies, lack of controls and short-term followup. A meta-regression of limited available results shows a significant decline of BMD-Z in GnRHa treated adolescents, a partial or full recovery of BMD-Z in the majority of subsequently testosterone-treated females, and a stagnation or partial recovery of BMD-Z in estrogen-treated males. A literature search was unable to find reported outcomes for gender dysphoric adolescents who suspend hormone treatments altogether post-GnRHa.

It is not clear how Bagshaw, Spittlehouse, and Russell’s literature review arrives at a position of recovery of BMD under both estrogen and endogenous sex hormones post GnRHa treatment withdrawal. A position distinct from both (a) the results reported in this article, and (b), the assessment of other clinicians who work in this area ( Davidge-Pitts and Clarke, 2019; Notini et al., 2020). Where the evidence review team includes clinicians, statements of professional or reputational conflicts of interest are useful for assessing a potential source of bias. Disappointingly, the claims of recovery of BMD have been repeated in news articles. Patients, their parents, and other clinicians may rely on these sources for medical decision-making. Bagshaw, Spittlehouse, and Russel are advised to clarify this position with supporting evidence.