Semen
characteristics
WHO
1980
WHO
1987
WHO
1992
WHO
1999
WHO
2010a
Volume (mL)
ND
≥ 2
≥ 2
≥ 2
1.5
Sperm count (106/mL)
20–200
≥ 20
≥ 20
≥ 20
15
Total sperm count (106)
ND
≥ 40
≥ 40
≥ 40
39
Total motility (% motile)
≥ 60
≥ 50
≥ 50
≥ 50
40
Progressive motilityb
≥ 2c
≥ 25 %
≥ 25 % (grade a)
≥ 25 % (grade a)
32 % (a + b)
Vitality (% alive)
ND
≥ 50
≥ 75
≥ 75
58
Morphology (% normal forms)
80.5
≥ 50
≥ 30d
(14)e
4f
Leukocyte count (106/mL)
< 4.7
< 1.0
< 1.0
< 1.0
< 1.0
In its latest fifth edition (WHO 2010), the semen analysis reference values are markedly lower than those of previous editions. Much debate has taken place thereafter, and a series of reports has questioned the validity of the newly released reference values [5–9].
In this chapter, I discuss the controversy surrounding the new 2010 WHO criteria for semen analyses. First, I point out the importance and limitations of the routine semen analysis in the workup of male infertility . Then, I present the 2010 WHO cutoff values for human semen characteristics and how they compare with previous references. Third, I critically discuss the methods used for generating these new limits and present our hypotheses to explain these lowered limits. Subsequently, I analyze the likely effect of the 2010 WHO cutoff values on the clinical management of men with unexplained infertility. Finally, I propose a practical approach to report semen analysis results for those contemplating adopting the 2010 WHO cutoff values for semen characteristics.
Importance and Limitations of Semen Analysis for Male Infertility Evaluation
Semen analysis is the most widely used biomarker to predict the male fertility potential [10]. It provides information on the functional status of the seminiferous tubules, epididymis and accessory sex glands, and its results are often taken as a surrogate measure of a man’s ability to father a pregnancy. Routine semen analyses include: (a) physical characteristics of semen , including liquefaction, viscosity, pH, color, and odor; (b) specimen volume; (c) sperm concentration; (d) sperm motility and progression; (e) sperm morphology; (f) leukocyte quantification; and (g) fructose detection in cases where no spermatozoa are found and ejaculate volume is low [11].
Owed to its widespread availability, health care providers usually use semen analysis alone as the main marker to determine male partner referral for further investigation. However, semen characteristics that discriminate between infertile and fertile men are not well defined and results are normal in up to 40 % of those suffering from infertility [12–14]. Not only sperm production varies widely in same men but also conventional semen analysis neither tests for the diverse array of biological properties spermatozoa express as an eminently specialized cell nor accounts for putative sperm dysfunctions such as immature chromatin or fragmented DNA. In addition, there is a wide variation on how laboratories perform semen analysis . In this section, I will continue to discuss the major drawbacks of semen analysis for male infertility evaluation .
Biological Intraindividual Variability of Semen Parameters
The semen parameters from same individuals are highly variable. Many conditions including the duration of ejaculatory abstinence, activity of the accessory sex glands, analytical errors, and inherent biological variability account for such discrepancies [15–18]. In one study, the within-subject variability of 20 healthy subjects assessed over a 10-week follow-up ranged from 10.3 to 26.8 % [15]. Sperm concentration showed the highest within-subject variation (26.8 %), followed by morphology (19.6 %) and progressive motility (15.2 %) whereas vitality had the lowest variation (10.3 %). The utility of population-based reference values is related to the individual variability of a particular analyte. Reference values of analytes with attributable individuality, including the ones routinely assessed in the semen , are generally of limited utility. It means that individual subjects could present results that were very unusual for them, and such results might have been accounted when establishing the reference thresholds. For the aforesaid reasons and other uncontrolled factors such as the regression toward the mean, it is impossible to take the results of a single-semen specimen as a surrogate for a man’s ability to father a child unless when at extreme low levels [19]. Regression toward the mean is the phenomenon in which a variable would tend to be closer to the average on a second measurement if it was extreme in its first measurement. This uncontrolled factor should be contemplated when designing studies involving semen analysis because following an extreme random event the next random event is less likely to be extreme. It has been shown that sperm concentration and motility were significantly higher in the second test in men with previous abnormal semen analyses results [20]. Regression toward the mean can be reduced in its magnitude by using means of multiple samples (two or three in the case of semen analysis). Hence, it is prudent that clinicians request at least two semen specimens following 2–5 days of ejaculatory abstinence to allow a better understanding of the baseline semen quality status of a given individual [21–23].
Sperm Dysfunctions Not Tested in the Routine Semen Analysis
Up to 30 % of men with difficulties to father a child have no demonstrable abnormalities after an initial male infertility workup. Additional tests have been developed to unravel functional disorders and other sperm abnormalities that cannot be identified by conventional semen analysis [11, 24]. Some of these tests include the hypo-osmotic swelling test, computer-assisted sperm analysis, antisperm antibody test, sperm penetration assay, hemizona assay, reactive oxygen species (ROS) tests, and sperm chromatin integrity test [25]. Despite being available, there are inherent difficulties to set up these tests including cost of equipment and technical complexity. In addition, their predictive value in assessing the male fertility status is either variable or unknown [26]. Not surprisingly, many couples with unexplained infertility choose assisted reproduction techniques (ART) because of their widespread availability and overall success irrespective of the male infertility cause [27]. Yet, the assessment of sperm oxidative stress (OS) and DNA integrity has gained clinical importance in recent years. OS, which is present anywhere from 30 to 80 % in infertile men, is a result of the generation of ROS from contaminating leukocytes, defective sperm, and antioxidant depletion [28, 29]. ROS target sperm DNA molecules and ultimately affect the quality of the genetic material transmitted from the parents to the offspring. Damage to sperm DNA integrity can also result from apoptosis during spermiogenesis , alterations in chromatin remodeling during spermiogenesis, as well as exposure to environmental toxicants and gonadotoxins such as chemotherapy and radiotherapy [30]. Abnormal levels of DNA damage are observed in approximately 5 and 25 % of infertile men with normal and abnormal semen analysis, respectively [31–33]. Therefore, some authors propose that the assessment of both conditions might be included to the male infertility workup algorithm [34, 35].
Evidence of Poor Standardization in Semen Analysis Among Laboratories
Accuracy, the degree to which the measurement reflects the true value, and precision, the reproducibility of the results, are vitally important for clinicians who rely upon the values provided by the laboratory to direct the further work-up, diagnosis and counseling of the infertile male [36]. When both accuracy and precision are assured, the clinician is able to rely upon the semen analysis results to provide adequate counseling to the infertile couple. However, data from surveys of laboratory practice in the USA and the UK indicate that semen analysis techniques are still poorly standardized.
Among 536 clinical laboratories in the USA only about 60 % reported abstinence and indicated the criteria adopted for sperm morphology assessments. Moreover, fewer than half of them performed quality control for commonly assessed parameters such as sperm counts, motility, and morphology [37]. A survey involving 37 laboratories in the UK about the methods used to assess sperm morphology revealed that only 5 % complied with all WHO guidelines [38]. In the aforementioned study, participating laboratories had high interobserver variability when evaluating the same specimen. This data were corroborated by another study in which interlaboratory coefficient of variation was as high as 34 % for sperm concentration, 20 % for total sperm motility, 40 % for sperm vitality, and 70 % for sperm morphology (strict criterion) [39]. Discrepancies were also seen in laboratories enrolled in quality control programs, thus indicating that there is a need of global standardization among the laboratories and the providers of external quality control [40].
Owed to its complex nature, semen analysis should ideally be carried out in a dedicated andrology laboratory attired with experienced technicians, internal and external quality control, validation of test systems, quality assurance during all testing processes, and proper in place communication with clinicians and patients [41]. Despite being nonspecific for identifying male factor infertility etiologies, semen analysis is often the gateway test from which multiple expensive and often invasive treatments are based. Therefore, the importance of a reliable andrology laboratory cannot be underestimated .
The 2010 WHO Criteria for Semen Analysis
The WHO department of reproductive health and research workgroup made important changes in the 2010 laboratory manual for the examination of human semen and sperm-cervical mucus interaction [4]. While the WHO workgroup reviewed and updated in great detail all the methods delineated in previous manuals, it incorporated new protocols and tests. One of its main features was the inclusion of new references ranges and limits that were markedly lower than those reported in previous manuals.
Data characterizing the semen quality of fertile men provided the reference ranges for the manual [42]. For the first time, semen analysis results from recent fathers with known time-to-pregnancy (TTP), defined as months (or cycles) from stopping contraception to achieving a pregnancy, were analyzed. Raw data obtained from five studies of seven countries on three continents were pooled then assessed [43–48]. Approximately 1900 men who had fathered a child within 1 year of trying to initiate a pregnancy provided each one semen sample for sperm counts, motility, and volume assessments. Data on sperm morphology were extracted from four studies comprising approximately 1800 men whereas sperm vitality, assessed by the eosin–nigrosin method, was obtained from approximately 400 men of two countries [43, 45, 47, 48]. The mean ( ± SD) male age was 31 ( ± 5) years (range 18–53) and only ten men were over 45 years old. Participating laboratories practiced internal and external quality control and used standardized methods for semen analysis according to the WHO manual for the examination of human semen current at the time of the original studies [42].
The 95 % interval for sperm volume, count, motility, vitality, and morphology were generated and the one-sided lower reference limits (the fifth centile) proposed as the lower cutoff limits for normality [42]. It was then assumed that values below these limits would come from a different population. Of note, assessment of progressive motility according to grades, as recommended by the previous WHO manuals, was replaced by categorizing motile sperm as being “progressive” or “nonprogressive.” In addition, the strict criterion for morphology assessment was incorporated at last as the standard method. The lower limits of these distributions were lower than the values presented in previous editions except for the total sperm number per ejaculate (Table 3.1) [1–4]. Leukocyte reference values (< 1 × 106/mL)were not determined and remained the same as in previous manuals.
Controversies Surrounding the Validity of the 2010 WHO Thresholds
The lower reference limits in the 2010 WHO manual aimed to provide evidence-based thresholds that may aid clinicians in estimating the relative fertility of a given patient. Besides the aforesaid limitations of routine semen analysis in evaluating the male reproductive potential, methodological concerns arise from a careful examination of the studies that generated the current reference values. In a recent review, we critically analyzed these issues and concluded that it was unsound to assume that the 2010 reference standards represented the distribution of fertile men across the globe [6]. The group of studied men represented a limited population of individuals who live in large cities in the North hemisphere but for a small subset of men from Australia. Of note it was the absence of men from densely populated areas in Asia, Middle East, Latin America, and Africa, which represent the areas where most men live nowadays. This fact precludes the examination of regional and racial discrepancies that could account for semen quality variability. The selection criteria were arbitrary as stated by Cooper et al. “laboratories and data were identified through the known literature and personal communication with investigators and the editorial group of the fifth edition of the WHO laboratory manual” [42]. Not surprisingly, there was a significant overlap of authorship in the included studies. In addition, a single-semen specimen of each man was included for the pooled analysis, thus limiting the appraisal of the already discussed large intraindividual biological variability [6].
Some authors have claimed that the lowered 2010 WHO thresholds resulted from the declines in sperm count caused by endocrine disruptors and other environmental pollutants, such as insecticides and pesticides [49–51]. I, otherwise, conjecture that the observed discrepancies are likely to be associated with the patient selection criteria, the higher laboratory quality control standards and the methods used for semen assessment, such as the strict criterion for morphology determination. It means that methodology issues related to data generation might explain the discrepancies in the reference thresholds among WHO guidelines.
Collectively, these findings cast to doubt on the validity of the proposed reference range and cutoff limits about universally represent the distribution of semen results of fertile men.
Effects of the 2010 WHO Criteria for Semen Analysis in the Management of Male Infertility
Clinicians involved in the care of infertile couples still rely on the semen analyses results to determine a management plan. Abnormal semen parameters are taken into account not only to define male infertility but also to recommend further evaluation and treatment. One example is unexplained infertility which is based on the absence of female infertility , and the presence of at least two normal semen analyses and no identifiable causes after a thorough work-up including history, physical examination, and endocrine laboratory testing [24]. The adoption of the new WHO reference values will likely lead to more men being classified as “fertile”, which is of particular importance for gynecologists who rely on semen analysis alone as a surrogate measure for male fertility. In a recent study, up to 15 % of men with at least one parameter below the 1999 WHO reference values were reclassified as “normal” by having all parameters at or above the 2010 WHO thresholds [8]. We have also contemplated our own data involving 982 men seeking evaluation for infertility that had abnormal semen analysis results based on the 1990 WHO criteria. We found that approximately 39 % of these men would be reclassified as “normal” by the new 2010 criteria. Morphology itself accounted for over 50 % of the reclassifications (unpublished data). Patient referral for evaluation could then be postponed or not undertaken if fertility status would be based on semen analyses alone. Albeit it is ambiguous yet whether this reclassification will lead to a more cost-effective evaluation, it is also possible that it could delay the definitive diagnosis and management of the infertile couple and lead to a more pronounced infertility condition with ageing.