- Systematic review update
- Open access
- Published:
The best ovarian reserve marker to predict ovarian response following controlled ovarian hyperstimulation: a systematic review and meta-analysis
Systematic Reviews volume 13, Article number: 303 (2024)
Abstract
Background
One of the most challenging aspects of treating patients facing primary ovarian insufficiency, especially those eligible for controlled ovarian hyperstimulation (COH), is the assessment of ovarian function and response to stimulatory protocols in terms of the number of oocytes retrieved. The lack of consistency between studies regarding the best parameter for response evaluation necessitates a comprehensive statistical analysis of the most commonly utilized ovarian reserve markers (ORM). This systematic review and meta-analysis aims to establish the optimal metric for assessing ovarian reserve among COH candidates.
Methods
The PubMed/MEDLINE, Scopus, and ISI Web of Science databases were searched until July 2024, with no date or language limitations. The Newcastle–Ottawa scale was used to evaluate the validity of anti-Mullerian hormone (AMH), antral follicle count (AFC), follicle-stimulating hormone (FSH), and estradiol (E2) in patients receiving controlled ovarian hyperstimulation. Studies on the diagnostic accuracy of ovarian reserve markers in predicting ovarian response to controlled ovarian hyperstimulation in assisted reproduction technology (ART) candidates were reviewed. The diagnostic odds ratio (DOR) was determined using the Der Simonian-Laird random effects model meta-analysis to assess the likelihood of detecting low or high ovarian responses in COH candidates. Cochran’s Q, and I-squared, were used to analyze between-study heterogeneity.
Results
This systematic review and meta-analysis included 26 studies including 17 cohorts, 4 case controls, and 5 cross-sectional studies. AFC and AMH demonstrated significant diagnostic performance compared to FSH and E2 in poor and high response category. AMH slightly outperformed AMH and had the highest logarithm of DOR for detecting poor [2.68 (95% CI 1.90, 3.45)] and high ovarian response [2.76 (95% CI 1.57, 3.95)]. However, it showed a high between-study heterogeneity (I2 = 95.65, Q = 189.65, p < 0.05).
Conclusions
AFC and AMH were the most accurate predictors of poor and high ovarian response to controlled ovarian hyperstimulation. However, further research is needed to develop models assessing the combined impact of AMH and AFC on ovarian response prediction.
Systematic review registration
PROSPERO CRD42021245380.
Background
Over the last two decades, numerous ovarian reserve tests have been developed to assess oocyte reserve and quality, as well as to predict IVF (in vitro fertilization)/ICSI success in terms of oocyte yield and pregnancy. Many of these tests are now regularly performed on infertile patients undergoing ART [1, 2]. Finding out a patient’s reproductive potential, ovarian reserve, and response to COH are the main objectives. A commonly used word, ovarian reserve describes the overall pool of follicles in the ovaries, including both dormant and actively developing follicles [3].
Premature ovarian insufficiency (POI) is a prevalent reproductive endocrine condition characterized by the loss of ovarian function prior to the age of 40 [4]. The rate at which a woman’s ovarian reserve depletes is governed by a combination of genetic and environmental factors, and it usually diminishes permanently over time. The rate of follicular depletion accelerates around the age of 37–38 years [5]. Reduced ovarian reserves (ROR) is a term used to describe the rapid decline in ovarian follicular reserve in women in their early thirties, which is a known cause of infertility in this age group [6]. In most cases, ROR affects women in their late 30 s and early 40 s, but younger women may also be affected [7]. When compared to other women of the same age, these women tend to have lower fertility and a poor ovarian response (POR) to ovarian stimulation [8]. Young women should be educated about their future fertility potential and provided adequate advice on the best and most timely medical treatment to help them conceive [9]. Although age is the best predictor of poor oocyte quality, ovarian reserve markers (ORMs) are more commonly utilized as a surrogate for oocyte quantity [10].
Various ovarian reserve tests have been used to assess ovarian reserve and predict response to COH [11].
Basal FSH is one of the first parameters evaluated, and its increase is associated with insufficient ovarian response to ovarian stimulation. However, a normal FSH does not exclude inadequate response because its peak occurs relatively in the late phases of diminishing ovarian reserve [5]. Hence, basal FSH cannot be the single definitive test to identify poor responders [12]. E2 is a steroid sex hormone produced by ovarian follicles, the liver, the adrenal cortex, the breast, and adipose tissue [13]. It is often used to monitor ovarian reserve and detect hypoestrogenism and menopause in women with amenorrhea or menstrual dysfunction [14].
AFC and AMH are other ORMs that can be used separately or in tandem to assess ovarian response in ovarian stimulation protocols [15, 16]. AMH is a glycoprotein that belongs to the transforming growth factor-β superfamily and is one of the most important markers for detecting POR. It is exclusively produced by granulosa cells of small and large preantral as well as small antral follicles in women [17]. A decrease in AMH secretion has been identified as the first sign of diminished ovarian reserve [18]. AFC refers to the cumulative number of follicles observed via ultrasonography in both ovaries during the initial phase of the menstrual cycle, specifically days 2–4. Antral follicles are follicles that have a maximum mean diameter of 2–10 mm when measured in the two-dimensional plane [10]. AFC is widely considered to be the most dependable approach for evaluating the ovarian response to ovarian stimulation. Nevertheless, the outcome relies on the operator’s expertise and the precision of the ultrasonogram [19].
Tests such as the clomiphene citrate challenge test (CCCT), gonadotropin-releasing hormone (GnRH) agonist, and inhibin B have limited predictive value because they cannot directly quantify ovarian reserve [20]. Given the high cost and inconvenience of frequent hospital visits, these provocative tests (CCCT and GnRH agonist) have been largely abandoned. The basal tests FSH, AFC, and AMH are now the most commonly used markers in clinical practice [21].
Literature reviews fail to predict the accuracy of each ORM in IVF outcoms quantitatively, and the previous systematic reviews and meta-analyses revealed controversial results. It was determined that several ORMs (AMH, FSH, AFC, and E2) were predictive of ovarian response in IVF patients [1]. Despite their noninvasive nature and cost-effectiveness, ORMs were not advised for evaluating the response to the initial IVF cycle with maximum ovarian stimulation [1]. Afterwards, Borer et al. tested the predictive power of AMH and AFC in IVF candidates. Both markers were equally sensitive (82%), but AFC’s specificity (82%) was slightly higher than AMHs (76%) [22]. Another systematic review and meta-analysis investigated the potential of ORMs such as AMH, AFC, and FSH to predict ovarian hyperstimulation syndrome (OHSS) in women. By using AMH and AFC along with age, the predictive accuracy for identifying a high response was improved, as seen by an increased area under the receiver operating characteristic (ROC) curve (AUC) compared to using age alone [23].
La Marca et al. summarized AMH and AFC efficacy in detecting poor, normal, and high ovarian responses in patients receiving IVF after ovarian stimulation. They found that AFC and AMH, which are now the most accurate indicators of ovarian reserve, were suitable for creating personalized COH protocols. These sensitive indicators can be employed interchangeably in clinical settings to accurately predict the full range of ovarian response [16]. Also, in 2015, a different meta-analysis that evaluated the capacity of AMH to predict the success of blastocyst implantation and clinical pregnancy in individuals undergoing IVF/ICSI found out that while AMH can be beneficial in fertility consultations, it is unable to predict clinical pregnancy in those with reduced ovarian reserve [24].
However, the inconsistent data have made it unclear which variables are more effective at diagnosing ovarian reserve and predicting IVF success rates. Since the last systematic review in 2015, many studies have been conducted on this topic. Therefore, the goal of this meta-analysis is to enhance our understanding of the most reliable ovarian reserve marker (ORM) test for predicting IVF outcomes in patients with premature ovarian insufficiency (POI).
However, the inconsistent data made it unclear which of these variables is more effective at diagnosing ovarian reserve and forecasting IVF success rates. Many studies have been conducted since the previous systematic review on this topic, which was published in 2015. As a result, the goal of this meta-analysis is to improve our understanding of the best ORM test for predicting IVF outcomes in patients dealing with POI.
Methods
This meta-analysis was designed according to the recommended reporting items for systematic reviews and meta-analysis (PRISMA), statement [25]. The study protocol was registered in the International Prospective register of Systematic Reviews (PROSPERO) database (registration code: CRD42021245380). The PICO (population, intervention, comparison, outcome) framework was utilized to establish research questions and create a search strategy. The purpose of this systematic review and meta-analysis was to examine (C) ORMs and determine which ORM had the highest specificity and sensitivity to ovarian response (O) in IVF/ICSI candidates (P) receiving COH (I).
Search strategy and data gathering
Two researchers (F. S. and M. L.) separately conducted comprehensive searches of the literature using MeSH (Medical Subject Headings) and non-MeSH terms (Supplemental Table 1), in PubMed/MEDLINE, Scopus, and ISI Web of Science databases to find original papers published up to July 2024. After removing duplicates, all of the imported studies into EndNote software (version X7, USA) were thoroughly reviewed. Their bibliographies were then thoroughly examined to verify that no potentially relevant papers were overlooked.
Study design and eligibility criteria
Three authors (F. S., M. L., and S. O.) separately screened titles and abstracts of all the imported articles, selected those eligible for the meta-analysis, and discussed all controversies with the statistician (S. J.). Cohort, case–control, cross-sectional, and interventional studies that measured the diagnostic accuracy of ORMs to predict ovarian response to COH in ART candidates were included. During the screening, animal or in vitro studies, non-English papers, and case reports were dismissed. Following the screening process, we download the full text of the all the papers that were selected for full-text review. For research articles that were published on non-open-access journals, we emailed the corresponding author(s) and requested the full text.
The ovarian reserve was defined as the total ovarian follicular pool including both primordial and growing follicles [3]. Research without a gold standard, true positives (TP), true negatives (TN), false positives (FP), false negatives (FN), sensitivity [TP/(TP + FN), specificity [TN/(FP + FN), and the AUC for a particular ORM were also disqualified, as were publications without the information required to compute the aforementioned parameters. Furthermore, articles that employed ORM for other purposes, such as predicting live birth, cycle cancellation, and the frequency of viable pregnancies rather than ovarian response, were excluded. Articles about women with iatrogenic ovarian insufficiency were eliminated. Also, studies that evaluated the value of less popular tests such as the LH:FSH ratio or the ovarian index, which are not commonly used in clinical practice, were dismissed.
Definitions
Reduced ovarian response (ROR) happens when assisted reproductive techniques (ART) fail to yield a successful fertility [26]. It is characterized by a state of hypergonadotropic hypogonadism associated with oligomenorrhea or amenorrhea [27]. A common procedure used before ART is called controlled ovarian hyperstimulation (COH) which uses exogenous gonadotrophins to stimulate multiple follicular maturation [28]. COH induces the growth of numerous follicles by administering external gonadotropins. At the same time, GnRH agonists or antagonists are given to prevent the release of natural gonadotropins and avoid early ovulation. Ovarian response to COH procedures is graded as normal, low, or high based on the number of recovered oocytes [29]. IVF oocyte retrieval rates are typically used to categorize ovarian response to COH regimens. The extraction of 8 to 15 oocytes is indicative of a normal ovarian response. To minimize the danger of OHSS and maximize the odds of successful fertilization and embryo development, this range is considered optimum (1). In most cases, a low ovarian response is indicated when fewer than five oocytes are recovered. People in this category, who are commonly known as “poor responders,” may have a reduced ovarian reserve (2). Retrieving more than 15 oocytes is usually indicative of a high ovarian response, which increases the risk of OHSS, and needs to be managed carefully (3).
Data extraction and excluded literature
Three authors (F. S., M. L., and A. K.) extracted the required data from eligible articles separately, including first author and publication date, country of research, study design, sample size, population studied, age range, and ovarian reserve markers used. Two-by-two tables were also used to calculate sensitivity, specificity, TP, TN, FP, FN, AUC, and the optimal cut-off point for each ORM. The required data were estimated using the ROC curve for papers that did not include information on sensitivity or specificity in their text. The studies that reported their data per cycle instead of per patient were excluded unless cycle numbers matched the number of participants. The Microsoft Excel 2019 software was used for data extraction.
Quality assessment
For the quality assessment of the included studies, the QUADAS-2 tool, a widely recognized framework specifically designed to evaluate the methodological quality of diagnostic accuracy studies, was employed [30]. The QUADAS-2 tool examines four key domains: patient selection, index test, reference standard, and flow and timing, each evaluated for potential risk of bias and concerns regarding applicability of study’s findings to the systematic review question or broader clinical settings and populations. The patient selection domain assesses how participants were chosen, focusing on the appropriateness of inclusion criteria and the risk of selection bias. The index test domain evaluates how the diagnostic test was conducted and interpreted, including the use of blinding and whether pre-specified thresholds were applied. The reference standard domain examines the objectivity and consistency of the outcome measure used to classify the target condition. The flow and timing domain reviews the sequence of participant assessments, including the timing between the index test and reference standard and any exclusions that might impact the study results.
Statistical analysis
To assess each ORM, the “metafor” R package of the R program (R-4.4.0 for macOS) was employed for meta-analysis. The logarithm of diagnostic odds ratio (Ln DOR) with 95% confidence intervals for each ORM was measured by a random-effects model. DOR evaluates that how much the odds of having a poor or high ovarian response increase when each ORM is positive. The Ln DOR was utilized to enhance normality and variance stabilization and facilitate clearer interpretation of the pooled results on the original DOR scale. Two separate meta-analyses were performed to evaluate each ORM’s efficacy in predicting poor and high ovarian responses. The Ln DOR combines sensitivity and specificity to evaluate the diagnostic efficacy of each ORM. The DOR compares the odds of a poor or high response. An Ln DOR greater than 1 indicates the test can discriminate between cases, with higher values reflecting better accuracy. An Ln DOR of 1 suggests no discriminatory ability of the ORM. Together with AUC, Ln DOR provides a fuller view of any of the ORMs’ performance, with an Ln DOR of 3.0, signaling strong diagnostic potential. The meta-analysis included forest plots that visually displayed the Ln DOR for ORMs in predicting poor and high ovarian response. These plots illustrated the differences in diagnostic accuracy observed across the selected studies.
Furthermore, this meta-analysis aggregated the findings of several studies that evaluated the diagnostic accuracy of ORMs using receiver operating characteristic (ROC) curve analysis. These curves were used to plot the true positive rate (sensitivity) against the false-positive rate (1-specificity) for each study, providing a visual summary of the diagnostic accuracy across all included studies. We calculated the average area under the ROC curve (AUC) that measures how well any ORM diagnostic test distinguishes between poor and high responders. It comes from the ROC curve, which plots sensitivity versus 1-specificity at different thresholds. An AUC of 1.0 means perfect discrimination between cases, while 0.9–1.0 indicates excellent accuracy. An AUC of 0.8–0.9 shows good reliability with some overlap, 0.7–0.8 is acceptable but with limitations, and 0.5 means the test is no better than chance (similar to an Ln DOR = 1). Clinically, these AUC values help determine the effectiveness of ORMs with an AUC of 0.85, indicating good utility for patient outcomes. Also, we calculated the degree of heterogeneity between studies, using metrics such as Tau, Tau-squared, I-squared (I2), and Cochran’s Q statistic. The Begg’s funnel plot was used in this meta-analysis to visually identify publication bias.
Subgroup analyses were performed in both poor and high response category based on study design, study country, and sample size to find out whether any of these variables account for potential differences in diagnostic accuracy. These subgroup analyses helped to identify any variations in diagnostic performance across different study characteristics. For the subgroup meta-analysis of sample size, we used the median sample size of the included studies as a cut-off point to categorize studies into two groups: “low” and “high” sample size. This data-driven dichotomization ensures that the studies are evenly distributed across the two categories, allowing us to analyze the impact of sample size on the diagnostic accuracy of ovarian reserve markers (ORMs). Studies with sample sizes below the median value were classified as “low sample size,” while those above the median were categorized as “high sample size.” We performed mixed-effects model meta-analysis separately for both groups to assess whether sample size influenced the diagnostic outcomes of ORMs in predicting ovarian response to COH. The median value was selected based on the distribution of sample sizes across the studies, ensuring an objective and balanced categorization. This approach avoided any potential biases that may occur using arbitrarily defined thresholds.
Results
After evaluating the titles and abstracts of all the imported articles, 65 studies were selected for full-text review and data extraction. Then, 39 papers were dismissed due to the following reasons: 16 studies lacked the required data for meta-analysis [31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46], 6 articles assessed the efficacy of uncommon ORMs (ovarian index, repeated antral follicular counts, elevated day 3 FSH:LH ratio, repeated clomiphene citrate challenge test) [47,48,49,50,51,52], 4 papers assessed POI patients in the absence of COH setting [53,54,55,56] and 7 publications examined the accuracy of ORM in the diagnosis of other outcomes such as pregnancy and OHSS without evaluating the ovarian response [55, 57,58,59,60,61,62], 3 studies considered the number of cycles instead of patients in assessing ORMs [63,64,65], and 3 papers only assessed the ORMs’ predictability for normal response instead of high or low response [66,67,68].
Eventually, 26 relevant articles with 21,584 participants were selected for this systematic review and meta-analysis (Fig. 1). From these, 13 studies evaluated ORMs’ accuracy in predicting poor and high ovarian response [69,70,71,72,73,74,75,76,77,78,79,80,81] following COH, and one article only assessed high response [82]. We found 12 studies that estimated the predictive accuracy of ORMs during COH only in women with poor ovarian response [83,84,85,86,87,88,89,90,91,92,93,94]. Only one study calculated its data per treatment cycle instead of the per participant [84]; however, it attributed cycles to each participant since total cycles and participants almost matched.
Study characteristics
Selected studies for this systematic review and meta-analysis were published from 2005 until 2024. This meta-analysis evaluated 4 case controls [75, 76, 84, 85], 17 cohorts [69,70,71, 74, 78,79,80,81,82, 87,88,89,90,91,92,93,94], and 5 cross-sectional studies [72, 73, 77, 83, 86] (Table 1).
Assessment of risk of bias in eligible studies
The studies reviewed using the QUADAS-2 framework reveal varying levels of bias across different domains, particularly in patient selection and the index test interpretation. While many studies demonstrated low risk of bias in terms of reference standards and flow and timing, a significant number had moderate to high risk due to issues like unclear selection criteria and lack of blinding in test interpretation. However, the majority of studies showed low applicability concerns, indicating that their findings are relevant to clinical practice. Despite some methodological limitations, the overall body of research provides a credible foundation for understanding diagnostic tests in predicting ovarian response and OHSS, with caution needed when generalizing results to broader populations (Table 2).
Meta-analysis
In women characterized by either poor or high response to ovarian stimulation, both AMH and AFC demonstrated superior diagnostic performance compared to FSH and E2 though AFC appeared slightly more effective. Similarly, in high response category, in contrast, FSH and E2 exhibited lower diagnostic accuracy (Table 3). According to the forest plots of ORMs in both response categories, both AMH and AFC demonstrated narrower confidence intervals and more consistent effect sizes compared to FSH and E2. In the high response group, particularly, AFC showed marginally higher diagnostic performance than AMH (Figs. 2 and 3). The ROC curve analysis further substantiated these findings; both AMH and AFC demonstrated higher area under the curve (AUC) values, indicating superior diagnostic accuracy. However, slight variations were observed between the two markers, particularly in the high response group, where AFC demonstrated marginally higher performance (Fig. 4).
The analysis of heterogeneity within the poor response group revealed substantial variability across studies, as indicated by a significant Cochran’s Q statistic and its associated p-values. High I2 percentages further underscored the heterogeneity, suggesting that differences in study outcomes might influence the meta-analysis results. The T2 and T-values additionally highlighted that this variability was not merely due to random error but pointed to actual differences in effect sizes across studies. Unlike the poor response group, the high response group showed lower levels of variability among the studies, reflected by reduced I2 values. The Cochran’s Q statistic supported the consistency of the results across studies, suggesting robustness in the findings for this population (Table 3). Funnel plots also illustrate the potential for publication bias across the included studies in both response categories (Fig. 5).
Subgroup meta-analysis
Study design
The subgroup analysis for the poor response category demonstrated notable differences between cohort and cross-sectional study designs. In cohort studies, both AMH and AFC markers exhibited strong diagnostic performance compared to FSH and E2. This indicates that cohort studies, which track outcomes over time, may provide more reliable indicators of ovarian reserve. Conversely, the cross-sectional subgroup showed a reduced performance of these markers, especially for FSH, suggesting that the nature of cross-sectional studies might not capture the dynamic aspects of ovarian reserve as effectively. Among the ORMs in the poor response category, AMH’s diagnostic accuracy was most influenced by study design, showing greater reliability in cohort studies than in cross-sectional analyses. In the high response category, cohort studies again resulted in higher diagnostic accuracy for both AMH and AFC compared to cross-sectional studies, with AFC performing particularly well in the cohort subgroup. Therefore, study design significantly impacts diagnostic outcomes, with AMH’s accuracy being notably affected, as it was more consistent in cohort studies (Table 4).
Study country
Due to most subgroups containing only a single study, it was not possible to draw definitive conclusions from the subgroup meta-analysis based on the study country. Therefore, only countries that evaluated the diagnostic accuracy of ORMs to predict either poor or high ovarian response at least twice were included in this subgroup meta-analysis. The goal of this analysis was to determine whether the high rate of heterogeneity among studies was attributable to the study country.
In the poor response group, the USA conducted two studies on AMH, which showed statistically nonsignificant results (Ln DOR: 1.95, p = 0.13) and high heterogeneity (I2: 92.31%, p < 0.05). China assessed AMH in three studies and found significant diagnostic accuracy (Ln DOR: 2.47, p < 0.05) despite considerable heterogeneity (I2: 80.79%, p < 0.05), suggesting variability in study results. Similarly, Palestine evaluated AMH accuracy for poor response prediction, demonstrating meaningful accuracy (Ln DOR: 3.69, p < 0.05) while displaying high heterogeneity (I2: 96.95%, p < 0.05). The UK measured AMH predictability for poor response, with AFC showing meaningful overall diagnostic accuracy (Ln DOR: 3.07, p < 0.05) and no considerable heterogeneity (I2: 67.43%, p < 0.05). In the high response category, China assessed AMH in three studies, yielding significant findings (Ln DOR: 1.71, p < 0.05) and high between-study heterogeneity (I2: 84.75%, p < 0.05). However, according to two studies conducted in Iran, AMH had significant diagnostic precision for high response prediction (Ln DOR: 2.43, p < 0.05) with low variability among studies (I2: 0.00%, p = 0.47). These findings suggest that country-specific factors, including population characteristics and study methodologies, can contribute to the heterogeneity observed in the meta-analysis.
Sample size
In the subgroup meta-analysis based on sample size, studies were categorized as small or large sample size based on the median sample size of all included studies (Table 5). Studies with fewer participants than the median were categorized as “small sample size,” while those with more participants were classified as “large sample size.” This data-driven approach ensured a balanced comparison between groups. In the poor ovarian response category, studies with “high” sample sizes demonstrated significant diagnostic accuracy for AMH, AFC, and FSH, with strong statistical significance (p < 0.0001). In contrast, studies with small sample sizes showed less consistent results, particularly affecting the diagnostic accuracy of FSH, suggesting that smaller sample sizes may introduce variability in the outcomes. In the high ovarian response category, AMH and AFC retained significant diagnostic performance across both small and large sample size studies, but the results were more consistent in the “large” sample size group (p < 0.0001). Conversely, small studies indicate a lack of data for AMH and AFC and a negative influence on FSH results. This suggests that in the poor response category, smaller studies may lead to less reliable conclusions, particularly affecting the outcomes for FSH. For the high response category, AMH retains its significance in large studies (p < 0.0001) but shows a substantial drop in small studies, signifying the impact of sample size. AFC’s results are robust in large studies but not reported for small studies, again indicating sample size influence. FSH in large studies shows a nonsignificant trend, further compromised in small studies through negative results, suggesting less reliable outputs. Across both response categories, AMH and AFC are notably affected by sample size, with larger studies providing more consistent conclusions. Similarly, FSH shows variability across both large and small studies, indicating that smaller sample sizes might lead to inconclusive or skewed data. Overall, this subgroup analysis highlights the importance of sufficient sample size in obtaining reliable meta-analysis outcomes, emphasizing that smaller studies can contribute to misleading heterogeneity and less definitive results (Table 5).
Discussion
Through this meta-analysis of the most frequently employed ovarian reserve markers (ORMs), we identified the optimal parameters for accurately determining whether the oocyte yield was inadequate or excessive. In summary, AFC emerged as marginally superior to AMH as the best indicator of ovarian response to COH in both groups. Both FSH and estradiol showed almost similar ability to identify cases of poor and excessive ovarian response, with FSH slightly outperforming estradiol in both groups.
Consistent with our research, Rosen et al. observed that among many indicators including AMH, AFC, FSH, and E2, only AMH and AFC could accurately characterize the histological pattern of diminished ovarian reserve. Although AMH was more economical, AFC was more accurate and less invasive [95]. Studies that examined multiple ovarian reserve indicators and POI progression found that AMH has the highest diagnostic sensitivity for predicting the severity of POI. Furthermore, the combination of AMH and AFC proved to be the most reliable method for identifying POI in its early stages [96]. According to a recent analysis, AMH is one of the greatest functional ovarian reserve measures since it shows the number of developing follicles capable of ovulation. Furthermore, it can predict poor or high response in COH candidates and indicate menopausal state [97].
However, further research is required to determine the internal and environmental elements that interact with the synthesis and physiology of AMH, in order to enhance its effectiveness in therapeutic applications [97]. While age affects IVF outcomes in terms of ovarian response to GnRH protocols, AMH and AFC provide a more precise assessment of oocyte retrieval in COH. After evaluating the interaction of AFC and AMH, Keane et al. reported that both AFC and AMH demonstrated a positive relationship with oocyte number and, when combined, offered the highest accuracy for predicting ovarian response in IVF patients [98].
AMH is a member of the transforming growth factor β (TGF-β) family that acts via a specific receptor called AMH receptor type II [99]. During human embryogenesis, AMH signaling pathways play an important role in the maturation of GnRH neurons, as their abnormalities may lead to infertility later in life [100]. Anti-Müllerian hormone (AMH) is specifically released by developing follicles and can be used to accurately determine if the ovarian reserve is depleted. This hormone remains unaffected by the hypothalamic–pituitary–gonadal axis and has only minimal and insignificant changes during each ovarian cycle. During menopause, serum AMH levels progressively decline until they are undetectable [101]. AMH levels gradually rise as dormant follicles mature into growing follicles, eventually reaching the preantral and antral stages. Antral and preantral follicles secrete AMH, predominantly hindering the maturation of primordial follicles [101]. AMH also inhibits intraovarian pathways implicated in follicular atresia, as seen by its absence in follicles undergoing degeneration [102]. Hayes et al. found that while in vivo AMH inhibits follicular growth and ovulation in mouse models, it also prevents follicular degeneration by coordinating FSH function and follicular development through miR-181a and miR-181b miRNAs. Overall, AMH might have therapeutic benefits in mouse models through enhancing oocyte retrieval [103].
FSH has traditionally been the ovarian reserve biomarker of choice. Since the late 1980s, it has indicated hypothalamic-pituitary–gonadal axis functioning. The World Health Organization classified ovarian dysfunction based on serum FSH and estradiol levels [104]. Follicular impairment due to FSH receptor failure and steroid-cell autoantibodies produce gradual follicular pool degeneration in POI, which justifies the use of FSH in identifying these individuals [105]. However, FSH on day 3 of the menstrual cycle exhibited both inter- and intra-cyclic variations, which questioned its status as the preferred ORM, thus shifting the focus to other tests [106]. Similar to this study, all the mentioned AMH physiology and its association with antral follicles make AMH and AFC more eligible ORMs than FSH to predict the outcome in patients suffering from PCOS, POI, and other fertility-related disorders [107].
To our knowledge, this is the first systematic review and meta-analysis that has assessed all available information on ovarian reserve markers (ORMs) and how well they predict ovarian response in IVF patients. One of the primary strengths is the robust comparison of multiple markers across distinct subgroups classified by ovarian response, study design, and country, providing a comprehensive view of diagnostic efficacy. Much of the prior literature has concentrated on how ORMs relate to specific IVF outcomes. This work adds a new dimension to clinical assessment tools by quantifying the diagnostic performance of these indicators in predicting ovarian response. Clinicians can optimize medication dosages and minimize risks like ovarian hyperstimulation syndrome (OHSS) by tailoring COH procedures to individual patients when they know which ORMs (e.g., AMH, AFC) are the best predictors of ovarian response. By early intervention and consideration of alternate treatments like oocyte donation or modified stimulation protocols, patients whose ovarian responses are expected to be poor can be identified. By decreasing the need for many cycles and the costs that come with them, accurate prediction of ovarian response can lead to more efficient use of medical resources. In addition, reliable ovarian response prediction helps control patient expectations and decreases the emotional and financial strain associated with IVF therapy.
Since 2006, comprehensive reviews and meta-analyses have been performed to assess the efficacy of several ORMs in predicting ovarian response to hyperstimulation [1, 16, 22,23,24]. Incorporating the most current studies that analyzed multiple ORMs in IVF candidates strengthened this hypothesis. There were, however, certain caveats to our research. Due to a lack of relevant studies, this meta-analysis did not include other ovarian reserve tests including LH and inhibin B. Depending on the study, it may have been necessary to manually compute the sensitivity and specificity of an ovarian reserve marker using the ROC curve. Selected studies used a variety of AMH tests, which could increase the risk of bias in interpreting results. Also, disparities in research methods, sample populations, and assessment tools might account for this variability. Only studies published in English were included, which may introduce language bias and exclude significant research published in other languages. Some studies did not report sensitivity or specificity, necessitating the use of ROC curves for approximation, which may include estimation errors. The funnel plot analysis revealed potential publication bias, implying that papers yielding positive outcomes may be overrepresented. Furthermore, differences in ORM definitions and measures between studies may impact the accuracy and generalizability of the pooled values. The significant heterogeneity observed, especially in the poor response group, indicates variability in study outcomes that could affect the overall conclusions. Additionally, subgroup analyses based on study design and country revealed that diagnostic performance could be heavily influenced by these factors, with cohort studies generally providing more reliable indicators than cross-sectional studies. The potential for publication bias, as hinted at by funnel plots, also poses a limitation, possibly skewing the results towards more favorable outcomes reported in published studies. To address these limitations and build on the current findings, future studies should aim to standardize methodologies and reporting to reduce heterogeneity. Larger sample sizes in studies are crucial to provide more precise effect estimates and reduce the impact of random variations. Additionally, longitudinal cohort studies should be prioritized over cross-sectional designs to capture the dynamic aspects of ovarian reserve more accurately. Collaborative efforts across different countries can help mitigate the impact of country-specific factors and contribute to more generalizable conclusions. Finally, robust strategies to minimize publication bias, such as pre-registering study protocols and including unpublished data, can enhance the reliability and validity of future meta-analyses.
Conclusions
In summary, this systematic review and meta-analysis provide valuable insights into enhancing clinical management and evidence-based practices in IVF. AMH and AFC show the highest predictive accuracy for ovarian response in patients with low ovarian reserve, outperforming other markers. These insights can help optimize COH protocols and be beneficial for POI patients considering oocyte cryopreservation. Additionally, investigating the effectiveness of various hormonal and genetic indicators in predicting egg quality and quantity can reduce treatment failures and complications. Future research should focus on distinguishing ovarian response from the likelihood of successful pregnancy by creating sensitive assays to measure oocyte quality.
Data availability
The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- IVF:
-
In vitro fertilization
- ART:
-
Assisted reproductive technology
- ROR:
-
Reduced ovarian reserves
- POR:
-
Poor ovarian response
- ORM:
-
Ovarian reserve marker
- COH:
-
Controlled ovarian hyperstimulation
- FSH:
-
Follicle-stimulating hormone
- LH:
-
Luteinizing hormone
- E2:
-
Estradiol
- AFC:
-
Antral follicular count
- AMH:
-
Anti-Mullerian hormone
- CCCT:
-
Clomiphene citrate challenge test
- GnRH:
-
Gonadotropin hormone-releasing hormone
- ROC:
-
Receiver operating characteristic
- AUC:
-
Area under curve
- PRISMA:
-
Preferred Reporting Items for Systematic reviews and Meta-Analyses
- PROSPERO:
-
International Prospective Register of Systematic Reviews
- TP:
-
True positive
- TN:
-
True negative
- FP:
-
False positive
- FN:
-
False negative
- NOS:
-
Newcastle-Ottawa scale
- POI:
-
Premature ovarian insufficiency
References
Broekmans FJ, et al. A systematic review of tests predicting ovarian reserve and IVF outcome. Hum Reprod Update. 2006;12(6):685–718.
Wang X, et al. Evaluation of ovarian reserve tests and age in the prediction of poor ovarian response to controlled ovarian stimulation-a real-world data analysis of 89,002 patients. Front Endocrinol (Lausanne). 2021;12: 702061.
Medicine, P.C.o.t.A.S.f.R. Diagnostic evaluation of the infertile male: a committee opinion. Fertility and sterility. 2015;103(3):e18–25.
Chon SJ, Umair Z, Yoon M-S. Premature ovarian insufficiency: past, present, and future. Frontiers in cell and developmental biology. 2021;9: 672890.
Jirge PR. Poor ovarian reserve. J Hum Reprod Sci. 2016;9(2):63–9.
Man L, et al. Ovarian reserve disorders, can we prevent them? A review. Int J Mol Sci. 2022;23(23):15426.
Jirge PR. Poor ovarian reserve. Journal of human reproductive sciences. 2016;9(2):63.
Medicine, P.C.o.t.A.S.f.R. Fertility and sterility. Fertility and sterility. 2015;103(3):e9–17.
Koert, E., et al., Women’s perceptions of fertility assessment and counselling 6 years after attending a Fertility Assessment and Counselling Clinic in Denmark. Hum Reprod Open, 2020. 2020(4): p. hoaa036.
Tal R, Seifer DB. Ovarian reserve testing: a user’s guide. Am J Obstet Gynecol. 2017;217(2):129–40.
Jirge PR. Ovarian reserve tests. Journal of human reproductive sciences. 2011;4(3):108.
Galey-Fontaine J, et al. Age and ovarian reserve are distinct predictive factors of cycle outcome in low responders. Reprod Biomed Online. 2005;10(1):94–9.
Ryan KJ. Biochemistry of aromatase: significance to female reproductive physiology. Can Res. 1982;42(8 Supplement):3342s–4s.
Lee T-H, et al. Serum anti-Müllerian hormone and estradiol levels as predictors of ovarian hyperstimulation syndrome in assisted reproduction technology cycles. Hum Reprod. 2008;23(1):160–7.
Aflatoonian A, et al. Effects of intraovarian injection of autologous platelet-rich plasma on ovarian rejuvenation in poor responders and women with primary ovarian insufficiency. Reprod Sci. 2021;28(7):2050–9.
La Marca A, Sunkara SK. Individualization of controlled ovarian stimulation in IVF using ovarian reserve markers: from theory to practice. Hum Reprod Update. 2014;20(1):124–40.
La Marca A, et al. Anti-Müllerian hormone (AMH) as a predictive marker in assisted reproductive technology (ART). Hum Reprod Update. 2010;16(2):113–30.
Jamil Z, et al. Anti-Mullerian hormone: above and beyond conventional ovarian reserve markers. Dis Markers. 2016;2016(1):5246217.
La Marca A, et al. Serum anti-Mullerian hormone throughout the human menstrual cycle. Hum Reprod. 2006;21(12):3103–7.
Jayaprakasan K, et al. Establishing the intercycle variability of three-dimensional ultrasonographic predictors of ovarian reserve. Fertil Steril. 2008;90(6):2126–32.
Sharara FI, Scott RT Jr, Seifer DB. The detection of diminished ovarian reserve in infertile women. Am J Obstet Gynecol. 1998;179(3):804–12.
Broer S, et al. AMH and AFC as predictors of excessive response in controlled ovarian hyperstimulation: a meta-analysis. Hum Reprod Update. 2011;17(1):46–54.
Broer S.L, et al. Prediction of an excessive response in in vitro fetilization from patient characteristics and ovarian reserve test and comparison in subgroups: an individual patient data meta-analysis. Fertility and sterility. 2013;100(2):420-429. e7.
Tal, R., et al., AntiMüllerian hormone as predictor of implantation and clinical pregnancy after assisted conception: a systematic review and meta-analysis. Fertility and sterility, 2015. 103(1): p. 119–130. e3.
Moher D, et al. PRISMA statement. Epidemiology. 2011;22(1):128.
Cohen J, Chabbert-Buffet N, Darai E. Diminished ovarian reserve, premature ovarian failure, poor ovarian responder—a plea for universal definitions. J Assist Reprod Genet. 2015;32(12):1709–12.
Gunning M, et al. Premature ovarian insufficiency. Female Reprod Dysfunction. 2020:287–307.
Altmäe S, et al. Genetic predictors of controlled ovarian hyperstimulation: where do we stand today? Hum Reprod Update. 2011;17(6):813–28.
Cohlen BJ, et al. Controlled ovarian hyperstimulation and intrauterine insemination for treating male subfertility: a controlled study. Human reproduction (Oxford, England). 1998;13(6):1553–8.
Whiting PF, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36.
Amer Abed, F., R. Ezzat Maroof, and U.M.A. Al-Nakkash, Comparing the diagnostic accuracy of anti-Mullerian hormone and follicle stimulating hormone in detecting premature ovarian failure in Iraqi women by ROC analysis. Rep Biochem Mol Biol, 2019. 8(2): p. 126–131.
Lunding SA, et al. AMH as predictor of premature ovarian insufficiency: a longitudinal study of 120 Turner syndrome patients. J Clin Endocrinol Metab. 2015;100(7):E1030–8.
Alipour F, et al. Comparison of specificity and sensitivity of AMH and FSH in diagnosis of premature ovarian failure. Dis Markers. 2015;2015: 585604.
Reichman DE, Goldschlag D, Rosenwaks Z. Value of antimüllerian hormone as a prognostic indicator of in vitro fertilization outcome. Fertil Steril. 2014;101(4):1012-1018.e1.
Kallio S, et al. Anti-Mullerian hormone as a predictor of follicular reserve in ovarian insufficiency: special emphasis on FSH-resistant ovaries. Hum Reprod. 2012;27(3):854–60.
Nikbakht R, Borhani M, Sarvestani GY. Evaluation of ovarian reserve by measurement of the serum levels of anti-mullerian hormone and follicle-stimulating hormone in intracytoplasmic sperm injection cycles. International Journal of Fertility and Sterility. 2010;4(2):57–60.
Kaya C, Pabuccu R, Satiroglu H. Serum antimullerian hormone concentrations on day 3 of the in vitro fertilization stimulation cycle are predictive of the fertilization, implantation, and pregnancy in polycystic ovary syndrome patients undergoing assisted reproduction. Fertil Steril. 2010;94(6):2202–7.
Aslan K, et al. Which is more predictive ovarian sensitivity marker if there is discordance between serum anti-Müllerian hormone levels and antral follicle count? A retrospective analysis J Obstet Gynaecol. 2022;42(5):1461–6.
Barrière P, et al. Added value of anti-Müllerian hormone serum concentration in assisted reproduction clinical practice using highly purified human menopausal gonadotropin (HP-hMG). J Gynecol Obstet Hum Reprod. 2022;51(2): 102289.
Heidary Z, et al. The association of AMH level with the number and quality of oocytes in women undergoing IVF/ICSI: a single-center study. Journal of Reproduction and Infertility. 2024;25(1):38–45.
Jabarpour M, et al. High anti-Müllerian hormone strongly correlates with reproductive outcomes in women undergoing assisted reproduction. Clin Lab. 2023;69(12).
Permadi W, et al. Correlation of anti-Mullerian hormone level and antral follicle count with oocyte number in a fixed-dose controlled ovarian hyperstimulation of patients of in vitro fertilization program. International Journal of Fertility and Sterility. 2021;15(1):40–3.
Sinha S, Sharan A, Sinha S. Anti-Mullerian hormone as a marker of ovarian reserve and function. Cureus. 2022;14(9): e29214.
Sun PP, et al. The relationship between serum FSH level and ovarian response during controlled ovarian stimulation. Ginekol Pol. 2023;94(6):470–5.
Vagios S, et al. Response to ovulation induction treatments in women with polycystic ovary syndrome as a function of serum anti-Müllerian hormone levels. J Assist Reprod Genet. 2021;38(7):1827–33.
Zajicek M, et al. Evaluation of ovarian reserve in young females with non-iatrogenic ovarian insufficiency to establish criteria for ovarian tissue cryopreservation. Reprod Biomed Online. 2023;47(1):102–9.
El-Mazny A, Abou-Salem N. Anti-Müllerian hormone and antral follicle count for prediction of ovarian stimulation response in polycystic ovary syndrome. Gynecol Endocrinol. 2013;29(9):826–9.
Bancsi L, et al. Impact of repeated antral follicle counts on the prediction of poor ovarian response in women undergoing in vitro fertilization. Fertil Steril. 2004;81(1):35–41.
Mukherjee T, et al. An elevated day three follicle-stimulating hormone:luteinizing hormone ratio (FSH:LH) in the presence of a normal day 3 FSH predicts a poor response to controlled ovarian hyperstimulation. Fertil Steril. 1996;65(3):588–93.
Hendriks DJ, et al. Repeated clomiphene citrate challenge testing in the prediction of outcome in IVF: a comparison with basal markers for ovarian reserve. Hum Reprod. 2005;20(1):163–9.
Xu X, et al. Development and validation of a prediction model for unexpected poor ovarian response during IVF/ICSI. Front Endocrinol. 2024;15:1340329.
Yan Y, et al. Clinical features and management of suboptimal ovarian response during in vitro fertilization and embryo transfer: analysis based on a retrospective cohort study. Front Endocrinol. 2022;13:938926.
Jacobs MH, et al. A multicentre evaluation of the Elecsys ® anti-Müllerian hormone immunoassay for prediction of antral follicle count. Reprod Biomed Online. 2019;38(5):845–52.
Sahmay S, et al. Elevated LH levels draw a stronger distinction than AMH in premature ovarian insufficiency. Climacteric. 2014;17(2):197–203.
Fouda F, et al. Anti-mullerian hormone level is a reliable predictor for cycle cancellation in ICSI. Middle East Fertility Society Journal. 2010;15(3):194–9.
Rashad NM, et al. Anti-Müllerian hormone: predictor of premature ovarian insufficiency in Egyptian women with autoimmune thyroiditis. Middle East Fertility Society Journal. 2018;23(4):286–91.
Saikumar P, et al. Anti Mullerian hormone: a potential marker for recruited non growing follicle of ovarian pool in women with polycystic ovarian syndrome. J Clin Diagn Res. 2013;7(9):1866–9.
La Marca A, et al. Anti-Müllerian hormone-based prediction model for a live birth in assisted reproduction. Reprod Biomed Online. 2011;22(4):341–9.
Gleicher N, Weghofer A, Barad DH. Anti-Müllerian hormone (AMH) defines, independent of age, low versus good live-birth chances in women with severely diminished ovarian reserve. Fertil Steril. 2010;94(7):2824–7.
Burks HR, et al. Can highly sensitive antimüllerian hormone testing predict failed response to ovarian stimulation? Fertil Steril. 2015;104(3):643–8.
Lee TH, et al. Serum anti-Mullerian hormone and estradiol levels as predictors of ovarian hyperstimulation syndrome in assisted reproduction technology cycles. Hum Reprod. 2008;23(1):160–7.
Aydin T, et al. The association between anti-Müllerian hormone and IVF-ICSI outcome in poor responder patients performing long protocol. Clin Exp Obstet Gynecol. 2015;42(5):663–5.
Asada Y, et al. Performance of anti-Müllerian hormone (AMH) levels measured by Beckman Coulter Access AMH assay to predict oocyte yield following controlled ovarian stimulation for in vitro fertilization. Reproductive Medicine and Biology. 2019;18(3):273–7.
Bosch E, et al. Prediction of ovarian response using the automated Elecsys anti-Müllerian hormone assay in gonadotrophin-releasing hormone antagonist cycles. Reprod Biomed Online. 2023;46(2):295–301.
Peralta S, et al. Antral follicle count measured at down-regulation as predictor of ovarian response and cumulative live birth: single center analysis including 2731 long agonist IVF cycles. Gynecol Endocrinol. 2022;38(12):1079–86.
Hochberg A, et al. Significance of serum AMH and antral follicle count discrepancy for the prediction of ovarian stimulation response in Poseidon criteria patients. J Assist Reprod Genet. 2024;41(3):717–26.
Jabir Edan B, Mahdi Kadhim H, Resheed Behayaa H. Sensitivity and specificity of measuring anti-Müllerian hormone and follicle stimulating hormone levels in predicting response to ovarian stimulation in infertile women. J Babol Univ Med Sci. 2024;26:0.
Razafintsalama-Bourdet M, et al. Random antral follicle count performed on any day of the menstrual cycle has the same predictive value as AMH for good ovarian response in IVF cycles. J Gynecol Obstet Hum Reproduct. 2022;51(1):102233.
Zheng H, et al. Ovarian response prediction in controlled ovarian stimulation for IVF using anti-Mullerian hormone in Chinese women: a retrospective cohort study. Medicine (Baltimore). 2017;96(13): e6495.
Lerman T, et al. Ovarian response to 150 µg corifollitropin alfa in a GnRH-antagonist multiple-dose protocol: a prospective cohort study. Reprod Biomed Online. 2017;34(5):534–40.
Li R, et al. Anti-Müllerian hormone for prediction of ovarian response in Chinese infertile women undergoing IVF/ICSI cycles: a prospective, multi-centre, observational study. Reprod Biomed Online. 2016;33(4):506–12.
Aghssa MM, et al. Optimal cutoff value of basal anti-mullerian hormone in iranian infertile women for prediction of ovarian hyper-stimulation syndrome and poor response to stimulation female fertility. Reprod Health. 2015;12:1–8.
Mehrafza M, et al. Anti-müllerian hormone as a predictor of IVF treatment. Tehran University Medical Journal. 2012;70(6):371–7.
Nardo LG, et al. Circulating basal anti-Müllerian hormone levels as predictor of ovarian response in women undergoing ovarian stimulation for in vitro fertilization. Fertil Steril. 2009;92(5):1586–93.
Riggs RM, et al. Assessment of ovarian reserve with anti-Mullerian hormone: a comparison of the predictive value of anti-Mullerian hormone, follicle-stimulating hormone, inhibin B, and age. Am J Obstet Gynecol. 2008;199(2):202.e1-8.
Laqqan MM, Yassin MM. Predictive factors of ovarian response to GnRH antagonist stimulation protocol: AMH and age are potential candidates. Middle East Fertility Society Journal. 2021;26(1):1–11.
Peluso C, et al. Are ovarian reserve tests reliable in predicting ovarian response? Results from a prospective, cross-sectional, single-center analysis. Gynecol Endocrinol. 2021;37(4):358–66.
Neves AR, et al. The performance of the Elecsys® anti-Müllerian hormone assay in predicting extremes of ovarian response to corifollitropin alfa. Reprod Biomed Online. 2020;41(1):29–36.
Lee RWK, et al. Ovarian biomarkers predict controlled ovarian stimulation for in vitro fertilisation treatment in Singapore. Singapore Med J. 2020;61(9):463–8.
Laqqan MM, Yassin MM. Anti-Mullerian hormone and antral follicle count predict ovarian response in women less than 45 years following GnRH antagonist multiple-dose protocol. Asian Pacific Journal of Reproduction. 2022;11(5):208–16.
Sun X, et al. Comparison of the predictive capability of antral follicle count vs. the anti-Müllerian hormone for ovarian response in infertile women. Front Endocrinol. 2022;13:862733.
Izhar R, et al. Antral follicle count and anti-Müllerian hormone level as predictors of ovarian hyperstimulation syndrome in women with polycystic ovarian syndrome undergoing controlled ovarian stimulation. Journal of Ultrasonography. 2021;21(86): e200.
Aydın GA, et al. Assessment of the relationship of basal serum anti-mullerian hormone levels with oocyte quality and pregnancy outcomes in patients undergoing ICSI. Iranian Journal of Reproductive Medicine. 2015;13(4):231–6.
Durmusoglu F, et al. Combining cycle day 7 follicle count with the basal antral follicle count improves the prediction of ovarian response. Fertil Steril. 2004;81(4):1073–8.
Jayaprakasan K, et al. A prospective, comparative analysis of anti-Müllerian hormone, inhibin-B, and three-dimensional ultrasound determinants of ovarian reserve in the prediction of poor response to controlled ovarian stimulation. Fertil Steril. 2010;93(3):855–64.
Martínez F, et al. Is AMH useful to reduce low ovarian response to GnRH antagonist protocol in oocyte donors? Gynecol Endocrinol. 2013;29(8):754–7.
Baker VL, et al. Multicenter evaluation of the access AMH antimüllerian hormone assay for the prediction of antral follicle count and poor ovarian response to controlled ovarian stimulation. Fertil Steril. 2018;110(3):506-513.e3.
Muttukrishna, S., et al., Antral follicle count, anti-mullerian hormone and inhibin B: predictors of ovarian response in assisted reproductive technology? BJOG: An International Journal of Obstetrics and Gynaecology, 2005. 112(10): p. 1384–1390.
Kasapoglu I, et al. Are all antral follicles the same? Size of antral follicles as a key predictor for response to controlled ovarian stimulation. J Obstet Gynaecol. 2022;42(3):461–6.
Tremellen KP, et al. Anti-mullerian hormone as a marker of ovarian reserve. Aust N Z J Obstet Gynaecol. 2005;45(1):20–4.
Dermolo M, Ansa M, Siferih M. Ovarian response to controlled stimulation and its predictors in a limited-resource setting. BMC Women's Health. 2024;24(1):279.
Esteves SC, et al. Antral follicle count and anti-Müllerian hormone to classify low-prognosis women under the POSEIDON criteria: a classification agreement study of over 9000 patients. Hum Reprod. 2021;36(6):1530–41.
La Marca A, et al. Predicting the total number of retrieved oocytes following double ovarian stimulation (DuoStim). Hum Reprod. 2023;38(9):1784–8.
Laqqan MM, Yassin MM. Investigation of the predictive factors of diminished ovarian reserve in women aged less than 40 years and undergoing ICSI cycle. Reprod Sci. 2023;30(3):873–82.
Rosen MP, et al. A characterization of the relationship of ovarian reserve markers with age. Fertil Steril. 2012;97(1):238–43.
Jiao X, et al. Ovarian reserve markers in premature ovarian insufficiency: within different clinical stages and different etiologies. Front Endocrinol. 2021;12:601752–601752.
Moolhuijsen LME, Visser JA. Anti-Müllerian hormone and ovarian reserve: update on assessing ovarian function. J Clin Endocrinol Metab. 2020;105(11):3361–73.
Keane K, et al. Specific ranges of anti-Mullerian hormone and antral follicle count correlate to provide a prognostic indicator for IVF outcome. Reprod Biol. 2017;17(1):51–9.
Hart KN, et al. Structure of AMH bound to AMHR2 provides insight into a unique signaling pair in the TGF-β family. Proc Natl Acad Sci. 2021;118(26): e2104809118.
Malone SA, et al. Defective AMH signaling disrupts GnRH neuron development and function and contributes to hypogonadotropic hypogonadism. Elife. 2019;8:e47198.
Visser JA, et al. Anti-Müllerian hormone: an ovarian reserve marker in primary ovarian insufficiency. Nat Rev Endocrinol. 2012;8(6):331–41.
Visser JA, et al. Increased oocyte degeneration and follicular atresia during the estrous cycle in anti-Müllerian hormone null mice. Endocrinology. 2007;148(5):2301–8.
Hayes E, et al. Intra-cellular mechanism of anti-Müllerian hormone (AMH) in regulation of follicular development. Mol Cell Endocrinol. 2016;433:56–65.
Broer SL, et al. Anti-Müllerian hormone: ovarian reserve testing and its potential clinical implications. Hum Reprod Update. 2014;20(5):688–701.
Welt CK. Primary ovarian insufficiency: a more accurate term for premature ovarian failure. Clin Endocrinol. 2008;68(4):499–509.
Bukman A, Heineman M. Ovarian reserve testing and the use of prognostic models in patients with subfertility. Hum Reprod Update. 2001;7(6):581–90.
Sakaguchi K, et al. Relationships between the antral follicle count, steroidogenesis, and secretion of follicle-stimulating hormone and anti-Müllerian hormone during follicular growth in cattle. Reprod Biol Endocrinol. 2019;17(1):88.
Acknowledgements
Not applicable
Funding
None.
Author information
Authors and Affiliations
Contributions
FS conceptualized and designed the study, drafted and critically reviewed the manuscript, and approved the final manuscript as submitted. ML conceptualized and designed the study, critically reviewed the manuscript, and approved the final manuscript as submitted. ASA designed the study, critically reviewed the manuscript, and approved the final manuscript as submitted. SJ conceptualized the study, designed the study, critically reviewed the manuscript, and approved the final manuscript as submitted. AK, ZA, and HA drafted the manuscript and approved the final manuscript as submitted. SA drafted and critically reviewed the manuscript and approved the final manuscript as submitted. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Salemi, F., Jambarsang, S., Kheirkhah, A. et al. The best ovarian reserve marker to predict ovarian response following controlled ovarian hyperstimulation: a systematic review and meta-analysis. Syst Rev 13, 303 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13643-024-02684-0
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13643-024-02684-0