Skip to main content
Frontiers in Pediatrics logoLink to Frontiers in Pediatrics
. 2024 Jul 16;12:1380934. doi: 10.3389/fped.2024.1380934

Validity of self-assessment pubertal Tanner stages by realistic color images and Pubertal Development Scale in a longitudinal cohort study

Jie Luo 1,, Di Wu 1,2,, Yu Tian 1, Yujie Wang 1, Qin Zhang 1, Zongwei He 1, Hong Wang 1, Qin Liu 1,*
PMCID: PMC11286552  PMID: 39081927

Abstract

Introduction

To date, the reliability of pubertal development self-assessment tools is questioned, and very few studies have explored the comparison between these tools in longitudinal studies. Hence, this study aimed to examine the reliability of pubertal development self-assessment using realistic color images (RCIs) and the Pubertal Development Scale (PDS) in a longitudinal cohort study.

Methods

Our longitudinal study recruited 1,429 participants (695 boys and 734 girls), aged 5.8–12.2 years old, in Chongqing, China. We conducted two surveys, 6 months apart. Tanner stages were examined by trained medical students at each visit. RCIs and PDS scores were used to self-assess puberty at each visit. Agreement between physical examination and self-assessment was determined using weighted kappa (wk), accuracy, and Kendall rank correlation.

Results

The concordance of puberty self-assessment using RCIs at baseline and the first follow-up was almost perfect in girls and boys, wk >0.800 (p < 0.001). At baseline, the concordance of genital development self-assessment using RCIs was fair in boys, wk = 0.285 (p < 0.001), and that of boys’ pubic hair development self-assessment using RCIs was poor, wk = 0.311 [95% confidence interval (CI) −0.157 to 0.818]. The wk of the PDS was less than 0.300, except for breast development. The reliability and validity of the PDS in this study population were low, and the consistency of the PDS was not good.

Conclusions

The concordance of RCIs is better than that of the PDS. Pubertal development self-assessment using RCIs is reliable, while the reliability and validity of the PDS are unacceptable. Therefore, RCIs are recommended as a reliable pubertal development self-assessment tool to measure pubertal development for large-scale epidemiological investigations and long-term longitudinal studies in China.

Keywords: self-assessment, Tanner stages, pubertal, reliability, validity

1. Introduction

Puberty is a significant period of physiological changes and sexual maturity, marking the transition from childhood to adolescence. Extensive evidence indicates that abnormal development during this period is a contributing factor to obesity, hypertension, insulin resistance, type 2 diabetes, cardiovascular disease, cancers including breast cancer, endometrial cancer, and ovarian cancer, and emotional and behavioral problems (17). Therefore, monitoring pubertal development is significant for health during both childhood and adulthood.

Pubertal development requires accurate puberty measurements to determine temporal changes in pubertal milestone events. However, the measurement of pubertal development varies widely across studies, and there are many challenges in obtaining data for the assessment of pubertal development. The current gold standard for measuring puberty is the Tanner Sexual Maturation Scale (SMS) (8, 9). The SMS classifies development into five levels based on breast and pubic hair development for girls and genital and pubic hair development for boys: these levels include Tanner stage 1 (prepubertal), stage 2 (early pubertal), stage 3 (midpubertal), stage 4 (late pubertal), and stage 5 (postpubertal). The SMS requires the child to undress and be examined by a professional. Despite the accuracy provided by professionals, this method might be impractical in non-clinical settings, especially in large-scale epidemiological studies, due to its labor-intensive, material, financial, time-consuming nature, making it challenging to implement. In addition, because of the sensitivity of pubertal development assessment, children and parents may be less likely to agree to the SMS compared to the self-assessment of pubertal development. As a result, many studies rely on pictures or line drawings of Tanner stages that ask children or their parents to self-assess the children's pubertal development. The Pubertal Development Scale (PDS) (10) is another widely used tool for self-assessment of pubertal development, designed to avoid the sensitivity issues of visual sexual maturation charts.

Although pubertal development self-assessment using pictures and the PDS are widely used, their reliability has been questioned. On the one hand, the results of a systematic review and meta-analysis (11) of 22 studies showed moderate or substantial agreement between clinician assessment and self-assessment using pictures. However, a US study (12) showed that self-assessment of Tanner stages using pictures is not a reliable method. On the other hand, the consistency between self-assessment by the PDS and physical examination (PE) remains controversial in different studies. Although several studies (1317) indicated that the PDS could be used in large-scale epidemiological investigations of pubertal development, they reported varying levels of agreement—moderate agreement, fair agreement, and even slight agreement—between clinician assessment and self-assessment using the PDS. Hence, the reliability of pubertal development self-assessment tools is controversial, and the scientific accuracy and authenticity of the study results will be affected, although a large number of studies used pictures, especially the PDS, to self-assess pubertal development.

Moreover, few studies have compared the agreement between pictures and the PDS, which are both popular tools for self-assessment of pubertal development. Compared to pictures, the PDS is widely used, and it has been cited more than 2,500 times. While the reliability of the PDS is more controversial (1618), we believe that the PDS and pubertal development pictures have not been compared in a single study. If the concordance of pictures is better than that of the PDS, pictures could potentially replace the PDS as the preferred self-assessment tool of pubertal development, ensuring scientific and authentic results. Therefore, studies comparing pictures with the PDS should be carried out, as this can assist doctors, healthcare providers, epidemiologists, and other researchers in deciding a more appropriate pubertal development self-assessment tool.

At present, global trends indicate that adolescent puberty is generally occurring earlier, which is a problem that needs significant attention (19, 20). Given that puberty is a process requiring longitudinal tracking and involves psychological sensitivity of children during this period (21, 22), pubertal development examination by SMS may increase the psychological burden on adolescents. Furthermore, cross-sectional studies cannot cover the processes of pubertal development, such as pubertal trajectory and tempo, which are important for health services for children and adolescents. Assessing pubertal development every 6 months provides a more precise method for tracking pubertal changes and is widely conducted in studies such as the Danish National Birth Cohort (23). With little known about the longitudinal reliability of pubertal development self-assessment tools, more evidence is needed to longitudinally track the reliability of the self-assessment tools.

To date, the reliability of pubertal development self-assessment tools is questioned, and few previous studies have explored the comparison between pictures and the PDS in longitudinal studies. Hence, this study included the most widely used self-assessment tools to verify whether pictures with Tanner stages and the PDS can be used in cohorts through a longitudinal study in Chongqing, China.

2. Methods

2.1. Participants

The study participants were from an ongoing longitudinal cohort aimed at exploring adolescents’ pubertal development and its influencing factors and health implications. During the enrollment period since May 2014, 1,237 participants (542 girls and 695 boys) were enrolled from grades 1–4 in primary schools in an urban district of Chongqing, China. After 6 months, we added a further 192 girls at the first follow-up and thus officially recruited 1,429 participants (734 girls and 695 boys) for this study, aged 5.8–12.2 years old; informed consent was obtained from their parents and themselves. JLP District in Chongqing is a pilot district for urban and rural planning, and its demographic characteristics and economic status are similar to the characteristics of the main urban area of Chongqing. Therefore, this study adopted purpose-based sampling in the district and selected four primary schools in the district of Chongqing according to the differences in their geographical locations. A detailed description of this cohort study has been reported previously (24). The study was approved by the Medical Ethics Review Committee of Chongqing Medical University.

At baseline and the first follow-up, girls were examined for puberty outcomes, including breast and pubic hair development, while boys were examined for genital and pubic hair development by trained medical students. In addition, the Tanner stage was self-assessed by realistic color images (RCIs) and the PDS. The age, body weight, and height of the children were measured at baseline and the first follow-up. Father’s education, mother’s education, parental marital status, the number of left-behind children, and average monthly household income were surveyed at the first follow-up.

2.2. Physical examination

Pubertal development was assessed for boys and girls by trained medical school graduates during physical examinations according to Tanner stages (25). Before each investigation, every investigator received standard training from medical professors. In addition, each child's pubertal development was measured independently by two investigators for quality assurance. Tanner stages were determined as stage 1 (prepubertal), stage 2 (early pubertal), stage 3 (midpubertal), stage 4 (late pubertal), or stage 5 (postpubertal). PEs assessed pubertal development stages, including genital stages (G1–G5) for boys, breast stages (B1–B5) for girls, and pubic hair stages (P1–P5) for both genders (Supplementary Table S1).

2.3. Realistic color images

After completing the PDS and before undergoing the physical examination, puberty ratings were self-assessed using RCIs according to Tanner stages, as proposed by Carel and Leger in the New England Journal of Medicine in 2008 (26). Each child was presented with a gender-specific pubertal self-assessment questionnaire using realistic color images. After a short explanation of each stage of puberty by trained medical school graduates, the participants would read the questions along with the pictures and select the self-perceived stage that best represented their pubertal development by comparing their situation with the illustrations. Girls were asked to choose the most appropriate breast and pubic hair Tanner stage. Boys were asked to choose the most appropriate genital and pubic hair Tanner stage.

2.4. Pubertal Development Scale

Pubertal development was self-reported using the PDS before PE and RCIs. The PDS was developed to provide a continuous score of pubertal development and classified into a five-level category representing stages from prepubertal to postpubertal without physical examination or visually depicted sexual maturation stages (10). The PDS consists of five items: breast development and menstruation in girls; voice changes and facial hair growth in boys; and body hair growth, growth spurt, and skin changes in both sexes. Response options are as follows: not yet started (1), barely started (2), definitely started (3), and seems complete (4). In addition, girls’ menarche is a binary variable: no (0) or yes (1). The PDS scores can be used to classify individuals into five pubertal categories according to algorithms developed by Petersen et al. (1988) (10) and described by Carskadon and Acebo (27)—prepubertal, early pubertal, midpubertal, late pubertal, and postpubertal (in alignment with the original Tanner categories). The content of the PDS and the algorithm are provided in Table 1.

Table 1.

Composition of the PDS and the computation of the PDS using the algorithm.

Questions:
 1. Would you say that your growth in height:
 2. And how about the growth of your body hair? (“Body hair” means hair any place other than your head, such as under your arms.) Would you say that your body hair growth:
 3. Have you noticed any skin changes, especially pimples?
Form for boys:
 4. Have you noticed a deepening of your voice?
 5. Have you begun to grow hair on your face?
Form for girls:
 4. Have you noticed your breasts have begun to grow?
 5a. Have you begun to menstruate (started to have your period)?
 5b. If yes, how old were you when you started to menstruate?
Computation algorithm
Tanner stages Boys
Totaling body hair growth, voice changes, and facial hair growth
Girls
Totaling body hair growth, breast development, and menarche
1: Prepubertal 3 2 and no menarche
2: Early pubertal 4 or 5 (no 3-point responses) 3 and no menarche
3: Midpubertal 6, 7, or 8 (no 4-point responses) 3 and no menarche
4: Late pubertal 9–11 ≤7 and menarche
5: Postpubertal 12 8 and menarche

2.5. Statistical analysis

The weighted kappa (wk) statistic was performed using R4.2.2, and other statistical analyses were performed using SPSS 25. McDonald's omega (28) coefficients were used to measure the internal consistency and scale reliability of the PDS. The wk statistic and accuracy were used to evaluate the agreement between PE and RCIs and between PE and the PDS. The strength of agreement criteria for wk is as follows: <0.00, poor; 0.00–0.20, slight; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, substantial; >0.8, almost perfect (29). Agreement between PE and the RCI or the PDS was calculated using Crosstabs, and percent (%) accuracy calculated the precise agreement between the three measurements of pubertal development. The Kendall rank correlation was also used to measure the strength of the correlations between the three measurements of pubertal development. The Kendall rank correlation coefficient ranges between −1 (perfect inverse relationship between two variables) and +1 (perfect direct relationship), with zero indicating a lack of relationship. The t-test and chi-squared test were used to analyze the influencing factors of PDS concordance.

3. Results

3.1. Characteristics of study participants

At baseline, a total of 1,234 participants, including 539 girls and 695 boys, were recruited for this study. At the first follow-up visit, 1,429 participants, including 734 girls and 695 boys, had completed the survey. Age, BMI, body weight, body height, father’s education, mother’s education, parental marital status, the number of left-behind children, and average monthly household income of the study participants are given in Table 2.

Table 2.

Characteristics of study participants.

Variable Frequency, n (%) or mean (SD)
Girls (n0 = 539, n1 = 734) Boys = (n0 = 695, n1 = 695)
Age at baseline (years) 8.6 (1.2) 8.6 (1.2)
BMIa at baseline
 Thinness 45 (8.4) 54 (7.8)
 Normal 393 (73.0) 472 (68.2)
 Overweight and obesity 100 (18.6) 166 (24.0)
 Body weight at baseline (kg) 28.0 (6.8) 29.4 (7.6)
 Body height at baseline (cm) 130.9 (9.3) 131.3 (8.8)
 Age at the first follow-up (years) 9.3 (1.2) 9.2 (1.2)
BMIa at the first follow-up
 Thinness 12 (1.6) 23 (3.3)
 Normal 544 (74.2) 453 (65.2)
 Overweight and obesity 177 (24.1) 219 (31.5)
 Body weight at the 1st follow-up (kg) 31.6 (7.6) 32.7 (8.4)
 Body height at the 1st follow-up (cm) 134.4 (9.1) 134.3 (8.6)
Father's education
 Middle school and below 337 (45.9) 311 (44.9)
 High secondary school or above 397 (54.1) 382 (55.1)
Mother's education
 Middle school and below 360 (49.1) 334 (48.3)
 High secondary school or above 373 (50.9) 357 (51.7)
Parental marital status
 Divorced 69 (9.4) 69 (9.9)
 Married 662 (90.6) 625 (90.1)
Left-behind childrenb
 Yes 91 (12.4) 100 (14.4)
 No 641 (87.6) 593 (85.6)
Average monthly household income (RMB)
 <2,000 226 (30.9) 213 (30.6)
 2,001–4,000 329 (44.9) 320 (46.0)
 >4,000 177 (24.2) 162 (23.3)
a

BMI (body mass index): standards published by the National Health and Family Planning Commission of the People's Republic of China in 2014 (document no. WS/T 456-2014, screening standard for malnutrition of school-age children and adolescents) and in 2018 (document no. WS/T 586-2018, screening for overweight and obesity among school-age children and adolescents).

b

Left-behind children: Rural registered minors under the age of 16 years whose parents are both migrant workers or one is a migrant worker and the other has no guardianship ability and who cannot normally live with their parents.

3.2. Comparison of PE and self-assessment by RCIs

Figure 1 shows the concordance of PE and self-assessment by RCIs for breast and pubic hair development in girls and genital and pubic hair development in boys.

Figure 1.

Figure 1

Concordance of PE and RCI.

For girls, there was almost perfect concordance between PE and RCIs. At baseline, the wk (weighted kappa coefficient) was 0.860 (p < 0.001) for breast development and 0.887 (p < 0.001) for pubic hair development. At the first follow-up visit, the wk was 0.961 (p < 0.001) for breast development and 0.950 (p < 0.001) for pubic hair development.

For boys, there was almost perfect concordance between PE and RCIs at the first follow-up visit and fair concordance in genital development at baseline. At baseline, the wk was 0.285 (p < 0.001) for genital development and 0.331 (95% CI −0.157 to 0.818) for pubic hair development. At the first follow-up visit, the wk was 0.968 (p < 0.001) for genital development and 0.915 (p < 0.001) for pubic hair development.

The accuracy, underestimation, overestimation, and Kendall rank correlation coefficient between PE and self-assessment by RCIs for breast/genital development and pubic hair development are presented in Supplementary Table S2.

3.3. Comparison of PE and self-reporting by the PDS

The McDonald's omega coefficient for all PDS items was 0.426 in girls and 0.105 for boys.

For breast development in girls, the concordance between PE and self-reporting by the PDS was almost perfect at baseline (wk = 0.827, p < 0.001) and the first follow-up (wk = 0.891, p < 0.001). For pubic hair development in girls, there was slight concordance at baseline (wk = 0.104, p < 0.001) and the first follow-up (wk = 0.164, p < 0.001) (Figure 2).

Figure 2.

Figure 2

Concordance of PE and PDS.

For genital development comparison in boys, there was slight concordance (wk = 0.024, p < 0.05) at baseline and fair concordance (wk = 0.242, p < 0.001) at the first follow-up. For pubic hair development, there was poor concordance (wk = 0.001, p > 0.05) at baseline and slight concordance (wk = 0.081, p < 0.001) at the first follow-up (Figure 2).

The accuracy, underestimation, overestimation, and Kendall rank correlation coefficient between PE and self-reporting by the PDS for breast/genital development and pubic hair development are presented in Supplementary Table S3.

3.4. Concordance between PE and PDS in age-stratified analysis

Age, BMI, father’s education, mother’s education, parental marital status, the number of left-behind children, and average monthly household income were included as influence factors in the analysis of the concordance between PE and the PDS. We found that age was the only factor affecting the poor concordance of the PDS in both genders (Supplementary Tables S4 and S5). The age of girls was categorized as <8 and ≥8 years old. The age of the boys was categorized as <9 and ≥9 years old (30). Ages classified as <8 years had slight concordance in girls for breast development, while ages classified as ≥8 years had perfect concordance. Ages classified as <8 and ≥8 years had slight concordance in girls for pubic hair development (Figure 3 and Supplementary Table S6). Ages classified as <9 and ≥9 years had slight concordance in boys for genital and pubic hair development (Figure 3 and Supplementary Table S7).

Figure 3.

Figure 3

Concordance of PE and PDS in age-stratified analysis.

4. Discussion

To our knowledge, this is the first study to discuss the longitudinal reliability of pubertal development self-assessment using RCIs and the PDS and to compare the concordance of RCIs with the PDS through a longitudinal study. The current study has several key findings. First, our study suggests that self-assessment by RCIs for puberty development is a reliable and valid tool in longitudinal analysis. Second, the reliability and validity of the PDS were low, and the consistency of the PDS was not good. Third, the reliability of self-assessment by RCIs for puberty development is better than that by the PDS. Finally, the only factor affecting the poor concordance of the PDS in both sexes was age. These findings might assist doctors, healthcare providers, epidemiologists, and other researchers in deciding a more appropriate pubertal development self-assessment tool.

Our results demonstrate that RCIs are a reliable and valid pubertal development self-assessment tool. Like our results, studies from Thailand (31), Iran (32), and China (33, 34) have shown that such images can be used as a reliable tool to assess pubertal development in both sexes. Of these four studies, two included line drawings with brief explanatory text and two included colored images of Tanner stages; the colored images showed better concordance. Therefore, RCIs could be a better alternative pubertal assessment tool for extensive epidemiological research, (34) and might avoid the tension resulting from SMS examinations to improve the consent rate and reduce the dropout rate in cohorts.

Different from previous studies, we analyzed the dynamic changes in RCI self-rated reliability through longitudinal rather than cross-sectional data. Our longitudinal analysis suggested that it is more difficult for boys in early puberty to self-assess pubertal onset than girls and that boys tend to overestimate their genital development. Similarly, a Japanese study (35) showed that self-assessment of the onset of breast development using illustrations of Tanner stages in full color was consistent with assessment by PE in girls but not in boys. Moreover, a Danish study reported that boys overestimated their pubertal stage when using illustrations of pubertal stages (36). Yet, we noticed that the consistency of boys’ pubertal self-assessment using RCIs at the first follow-up visit was almost perfect. This might be because we explained puberty knowledge during our follow-up and boys became more knowledgeable about pubertal development, making the RCIs more reliable. We will continue to explore how sexual and reproductive health education enhances adolescents’ pubertal health and epidemiology studies into puberty. Therefore, for large-scale epidemiological studies requiring follow-up over many years, RCIs are a reliable alternative tool when SMS examinations are not feasible, saving manpower, budget, and time.

In addition to RCIs, we conducted the PDS reliability analysis using our cohort. Notably, our study suggests that the reliability and validity of the PDS are very low. Even so, the concordance of breast development self-assessment by the PDS was only almost perfect at baseline and the first follow-up visit, which might be because breast development is a representative indicator of pubertal development in girls (20). Studies from Bond et al. (15) and Pompeia et al. (14) reported that self-assessment of pubertal development using the PDS is useful when physical examination is not possible and images are unavailable. Also, the concordance of the PDS was between fair and moderate in the two studies. Such a conclusion could be challenged because the concordance of the PDS was not almost perfect or substantial. Another study (17) reported that the PDS was reliable and generally tracks with Tanner stages. However, it was only reliable when combining pubertal scores into three stages of development, and the original self-assessed Tanner stages by the PDS were fairly consistent. This means that the PDS is not applicable for measuring important indicators of pubertal development, such as pubertal onset and tempo, and cannot help doctors, healthcare providers, epidemiologists, and other researchers to accurately measure the pubertal development process. Hence, it is not known whether the concordance of the PDS is acceptable, and even if it is acceptable, the evaluation criteria are not uniform.

A study from Norris and Richter (18) did not recommend the PDS as a reliable self-assessment tool for pubertal development in South Africa, which is identical to our results. Another Chinese study (16) showed that Cronbach's alpha of the PDS was 0.80 for girls and 0.66 for boys in the Chinese version of the PDS. Although the study reported that the PDS was reliable, its reliability in boys was low. Also, there was only moderate concordance in self-rated pubertal development using the PDS. The translation and applicability of the PDS among Chinese children should be further investigated, combining our results of low reliability and validity and the influencing factors of PDS concordance. Currently, the PDS is widely used in large-scale epidemiological investigations (3740). However, many studies have failed to find that the exact correspondence to Tanner stages between the PDS and PE is reliable (13, 17, 18). Therefore, the reliability of the PDS needs further study. Based on the results, our next step is to try out health education interventions and revise or develop a Pubertal Development Scale suitable for the local population.

5. Strengths and limitations

Our study had several strengths. First, we used a longitudinal study to assess the dynamic changes in the reliability of self-assessment tools for pubertal development. At present, most studies on the reliability of self-assessment tools for pubertal development are cross-sectional studies, which do not prove their applicability to long-term longitudinal studies. Second, we compared the agreement and reliability of self-assessment tools with the PDS. This comparison can help doctors, healthcare providers, epidemiologists, and other researchers decide a more appropriate pubertal development self-assessment tool. Third, current studies demonstrating good reliability of pubertal self-assessment tools predominantly involve white/Caucasian populations. There is limited research, including our study in China, on the reliability and validity of the PDS is relatively low, providing potential research directions in the region.

Several limitations should be acknowledged. First, although our study explored the reliability of pubertal self-assessment stools longitudinally, we conducted only two surveys that did not cover the whole of puberty, which limits the inferences of our results. Second, since each item can have an aggregate effect with the other items of the PDS and the Cronbach's alpha coefficient is very low, McDonald's omega was used to evaluate the internal consistency reliability of the PDS. However, the results showed that the reliability and validity of the PDS were very poor, which made the consistent results of the PDS in our study population unreliable. The possible reasons include the following: (1) Chinese children have relatively little understanding of pubertal development; (2) most of the children in this study are in the early stage of pubertal development (mainly stages 1, 2, and 3), with very few children in stages 4 and 5. In the next step of our research, we will conduct intervention studies on health education and revise or retranslate the Chinese version of the PDS.

6. Conclusions

The reliability of pubertal development self-assessment using RCIs is acceptable, while the reliability and validity of the PDS are low. This finding can assist doctors, healthcare providers, epidemiologists, and other researchers in deciding a more appropriate pubertal development self-assessment tool. Consequently, RCIs, rather than the PDS, are recommended as a reliable pubertal development self-assessment tool to measure pubertal development for large-scale epidemiological investigations and long-term longitudinal studies in China.

Acknowledgments

The authors thank all the children and their parents for their participation and the teachers in four primary schools and eight middle schools for their kind support. The authors also thank all the students who participated in the data collection.

Funding Statement

The authors declare financial support was received for the research, authorship, and/or publication of this article.

This research was supported by the National Science Fund Project (Grant No. 81973067), the National Youth Science Fund Project (Grant No. 81502825), the Venture & Innovation Support Program for Chongqing Overseas Returnees, China (Grant No. cx2018105), the Basic and Frontier Research Project of Chongqing Science and Technology Commission (Grant No. cstc2013jcyjA10001), and the Chongqing Medical University Future Medical Innovation Team Support Program (Grant No. W0054).

Abbreviations

RCIs, realistic color image; PDS, Pubertal Development Scale; PE, physical examination; SMS, Tanner Sexual Maturation Scale.

Data availability statement

The datasets presented in this article are not readily available because the data are stored and used among Chongqing Medical University research teams, and we are working on data sharing policies and a website for collaborators' data access. The datasets used or analyzed during the current study are available from the corresponding author on reasonable request. Requests to access the datasets should be directed to QL, liuqin@cqmu.edu.cn.

Ethics statement

The studies involving humans were approved by the Medical Ethics Review Committee of Chongqing Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

JL: Formal Analysis, Investigation, Writing – original draft, Writing – review & editing, Conceptualization, Data curation. DW: Writing – original draft, Writing – review & editing, Data curation, Formal Analysis, Investigation. YT: Data curation, Investigation, Writing – review & editing. YW: Data curation, Investigation, Writing – review & editing. QZ: Data curation, Investigation, Writing – review & editing. ZH: Data curation, Investigation, Writing – review & editing. HW: Conceptualization, Writing – review & editing. QL: Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Writing – review & editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2024.1380934/full#supplementary-material

Table1.docx (54.6KB, docx)

References

  • 1.Anderson GM, Hill JW, Kaiser UB, Navarro VM, Ong KK, Perry JRB, et al. Metabolic control of puberty: 60 years in the footsteps of Kennedy and Mitra’s seminal work. Nat Rev Endocrinol. (2024) 20(2):111–23. 10.1038/s41574-023-00919-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Guo JZ, Wu QJ, Liu FH, Gao C, Gong TT, Li G. Review of Mendelian randomization studies on endometrial cancer. Front Endocrinol (Lausanne). (2022) 13:783150. 10.3389/fendo.2022.783150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shamshirian A, Heydari K, Shams Z, Aref AR, Shamshirian D, Tamtaji OR, et al. Breast cancer risk factors in Iran: a systematic review & meta-analysis. Horm Mol Biol Clin Investig. (2020) 41(4):20200021. 10.1515/hmbci-2020-0021 [DOI] [PubMed] [Google Scholar]
  • 4.Ren Y, Zou H, Zhang D, Han C, Hu D. Relationship between age at menarche and risk of glucose metabolism disorder: a systematic review and dose-response meta-analysis. Menopause. (2020) 27(7):818–26. 10.1097/GME.0000000000001529 [DOI] [PubMed] [Google Scholar]
  • 5.Mendle J, Ryan RM, McKone K. Early menarche and internalizing and externalizing in adulthood: explaining the persistence of effects. J Adolesc Health. (2019) 65(5):599–606. 10.1016/j.jadohealth.2019.06.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhang Z, Hu X, Yang C, Chen X. Early age at menarche is associated with insulin resistance: a systemic review and meta-analysis. Postgrad Med. (2019) 131(2):144–50. 10.1080/00325481.2019.1559429 [DOI] [PubMed] [Google Scholar]
  • 7.Bubach S, De Mola CL, Hardy R, Dreyfus J, Santos AC, Horta BL. Early menarche and blood pressure in adulthood: systematic review and meta-analysis. J Public Health (Oxf). (2018) 40(3):476–84. 10.1093/pubmed/fdx118 [DOI] [PubMed] [Google Scholar]
  • 8.Marshall WA, Tanner JM. Variations in the pattern of pubertal changes in boys. Arch Dis Child. (1970) 45(239):13–23. 10.1136/adc.45.239.13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Marshall WA, Tanner JM. Variations in pattern of pubertal changes in girls. Arch Dis Child. (1969) 44(235):291–303. 10.1136/adc.44.235.291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Petersen AC, Crockett L, Richards M, Boxer A. A self-report measure of pubertal status: reliability, validity, and initial norms. J Youth Adolesc. (1988) 17(2):117–33. 10.1007/BF01537962 [DOI] [PubMed] [Google Scholar]
  • 11.Campisi SC, Marchand JD, Siddiqui FJ, Islam M, Bhutta ZA, Palmert MR. Can we rely on adolescents to self-assess puberty stage? A systematic review and meta-analysis. J Clin Endocrinol Metab. (2020) 105(8):2846–56. 10.1210/clinem/dgaa135 [DOI] [PubMed] [Google Scholar]
  • 12.Desmangles JC, Lappe JM, Lipaczewski G, Haynatzki G. Accuracy of pubertal Tanner staging self-reporting. J Pediatr Endocrinol Metab. (2006) 19(3):213–21. 10.1515/JPEM.2006.19.3.213 [DOI] [PubMed] [Google Scholar]
  • 13.Campisi SC, Humayun KN, Wasan Y, Soofi S, Islam M, Lou W, et al. Self-assessed puberty is reliable in a low-income setting in rural Pakistan. J Pediatr Endocrinol Metab. (2020) 33(9):1191–6. 10.1515/jpem-2020-0246 [DOI] [PubMed] [Google Scholar]
  • 14.Pompéia S, Zanini GAV, Freitas RS, Inacio LMC, Silva FCD, Souza GR, et al. Adapted version of the Pubertal Development Scale for use in Brazil. Rev Saude Publica. (2019) 53:56. 10.11606/s1518-8787.2019053000915 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bond L, Clements J, Bertalli N, Evans-Whipp T, McMorris BJ, Patton GC, et al. A comparison of self-reported puberty using the Pubertal Development Scale and the Sexual Maturation Scale in a school-based epidemiologic survey. J Adolescence. (2006) 29(5):709–20. 10.1016/j.adolescence.2005.10.001 [DOI] [PubMed] [Google Scholar]
  • 16.Chan NP, Sung RY, Nelson EA, So HK, Tse YK, Kong AP. Measurement of pubertal status with a Chinese self-report Pubertal Development Scale. Matern Child Health J. (2010) 14(3):466–73. 10.1007/s10995-009-0481-2 [DOI] [PubMed] [Google Scholar]
  • 17.Koopman-Verhoeff ME, Gredvig-Ardito C, Barker DH, Saletin JM, Carskadon MA. Classifying pubertal development using child and parent report: comparing the Pubertal Development Scales to Tanner staging. J Adolesc Health. (2020) 66(5):597–602. 10.1016/j.jadohealth.2019.11.308 [DOI] [PubMed] [Google Scholar]
  • 18.Norris SA, Richter LM. Are there short cuts to pubertal assessments? Self-reported and assessed group differences in pubertal development in African adolescents. J Adolesc Health. (2008) 42(3):259–65. 10.1016/j.jadohealth.2007.08.009 [DOI] [PubMed] [Google Scholar]
  • 19.Curtis VA, Allen DB. Male pubertal timing—boys will be men, but when? JAMA Pediatr. (2019) 173(9):819–20. 10.1001/jamapediatrics.2019.2306 [DOI] [PubMed] [Google Scholar]
  • 20.Eckert-Lind C, Busch AS, Petersen JH, Biro FM, Butler G, Bräuner EV, et al. Worldwide secular trends in age at pubertal onset assessed by breast development among girls: a systematic review and meta-analysis. JAMA Pediatr. (2020) 174(4):e195881. 10.1001/jamapediatrics.2019.5881 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lunddorf LLH, Ramlau-Hansen CH, Arendt LH, Patton GC, Sawyer SM, Dashti SG, et al. Characteristics of puberty in a population-based sample of Danish adolescents. J Adolesc Health. (2024) 74(4):657–64. 10.1016/j.jadohealth.2023.10.005 [DOI] [PubMed] [Google Scholar]
  • 22.Cheng HL, Harris SR, Sritharan M, Behan MJ, Medlow SD, Steinbeck KS. The tempo of puberty and its relationship to adolescent health and well-being: a systematic review. Acta Paediatr. (2020) 109(5):900–13. 10.1111/apa.15092 [DOI] [PubMed] [Google Scholar]
  • 23.Thomsen AML, Ramlau-Hansen CH, Olsen J, Brix N, Andersen AN, Lunddorf LLH, et al. The influence of parental age on timing of puberty: a study in the Danish national birth cohort. Scand J Public Health. (2022) 50(5):629–37. 10.1177/14034948211019794 [DOI] [PubMed] [Google Scholar]
  • 24.Xi X, Wu D, Wu W, Zhou Y, Zhang Q, Wang Y, et al. The influence of the trajectory of obesity indicators on the age of pubertal onset and pubertal tempo in girls: a longitudinal study in Chongqing, China. Front Public Health. (2023) 11:1025778. 10.3389/fpubh.2023.1025778 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Tanner JM. Normal growth and techniques of growth assessment. Clin Endocrinol Metab. (1986) 15(3):411–51. 10.1016/S0300-595X(86)80005-6 [DOI] [PubMed] [Google Scholar]
  • 26.Carel JC, Leger J. Clinical practice. Precocious puberty. N Engl J Med. (2008) 358(22):2366–77. 10.1056/NEJMcp0800459 [DOI] [PubMed] [Google Scholar]
  • 27.Carskadon MA, Acebo C. A self-administered rating scale for pubertal development. J Adolesc Health. (1993) 14(3):190–5. 10.1016/1054-139X(93)90004-9 [DOI] [PubMed] [Google Scholar]
  • 28.Malkewitz CP, Schwall P, Meesters C, Hardt J. Estimating reliability: a comparison of Cronbach’s α, McDonald’s ωt and the greatest lower bound. Soc Sci Hum Open. (2023) 7(1):100368. 10.1016/j.ssaho.2022.100368 [DOI] [Google Scholar]
  • 29.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. (1977) 33(1):159–74. 10.2307/2529310 [DOI] [PubMed] [Google Scholar]
  • 30.Lin S, Yang W, Yu J. Guideline for clinical diagnosis and treatment of pediatrics of traditional Chinese medicine sexual precocity (amendment). J Pediatr Tradit Chin Med. (2016) 12(03):1–5. 10.16840/j.issn1673-4297.2016.03.01 [DOI] [Google Scholar]
  • 31.Jaruratanasirikul S, Kreetapirom P, Tassanakijpanich N, Sriplung H. Reliability of pubertal maturation self-assessment in a school-based survey. J Pediatr Endocrinol Metab. (2015) 28(3–4):367–74. 10.1515/jpem-2014-0053 [DOI] [PubMed] [Google Scholar]
  • 32.Rabbani A, Noorian S, Fallah JS, Setoudeh A, Sayarifard F, Abbasi F. Reliability of pubertal self assessment method: an Iranian study. Iran J Pediatr. (2013) 23(3):327–32. [PMC free article] [PubMed] [Google Scholar]
  • 33.Chan NP, Sung RY, Kong AP, Goggins WB, So HK, Nelson EA. Reliability of pubertal self-assessment in Hong Kong Chinese children. J Paediatr Child Health. (2008) 44(6):353–8. 10.1111/j.1440-1754.2008.01311.x [DOI] [PubMed] [Google Scholar]
  • 34.Sun Y, Tao FB, Su PY. Self-assessment of pubertal Tanner stage by realistic colour images in representative Chinese obese and non-obese children and adolescents. Acta Paediatr. (2012) 101(4):e163–6. 10.1111/j.1651-2227.2011.02568.x [DOI] [PubMed] [Google Scholar]
  • 35.Saito-Abe M, Nishizato M, Yamamoto-Hanada K, Yang L, Fukami M, Ito Y, et al. Comparison of physician- and self-assessed pubertal onset in Japanese children. Front Pediatr. (2023) 11:950541. 10.3389/fped.2023.950541 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rasmussen AR, Wohlfahrt-Veje C, Tefre de Renzy-Martin K, Hagen CP, Tinggaard J, Mouritsen A, et al. Validity of self-assessment of pubertal maturation. Pediatrics. (2015) 135(1):86–93. 10.1542/peds.2014-0793 [DOI] [PubMed] [Google Scholar]
  • 37.Li R, Lopez DA, Gupta M, Palermo TM. Pubertal development and pain incidence and characteristics in children: a 1-year prospective cohort study of a national sample. Pain. (2023) 164(12):2725–36. 10.1097/j.pain.0000000000002969 [DOI] [PubMed] [Google Scholar]
  • 38.Sehovic E, Zellers SM, Youssef MK, Heikkinen A, Kaprio J, Ollikainen M. DNA methylation sites in early adulthood characterised by pubertal timing and development: a twin study. Clin Epigenetics. (2023) 15(1):181. 10.1186/s13148-023-01594-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Castiello F, Suárez B, Beneito A, Lopez-Espinosa MJ, Santa-Marina L, Lertxundi A, et al. Childhood exposure to non-persistent pesticides and pubertal development in Spanish girls and boys: evidence from the INMA (environment and childhood) cohort. Environ Pollut. (2023) 316(Pt 2):120571. 10.1016/j.envpol.2022.120571 [DOI] [PubMed] [Google Scholar]
  • 40.Torvik FA, Flatø M, McAdams TA, Colman I, Silventoinen K, Stoltenberg C. Early puberty is associated with higher academic achievement in boys and girls and partially explains academic sex differences. J Adolesc Health. (2021) 69(3):503–10. 10.1016/j.jadohealth.2021.02.001 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table1.docx (54.6KB, docx)

Data Availability Statement

The datasets presented in this article are not readily available because the data are stored and used among Chongqing Medical University research teams, and we are working on data sharing policies and a website for collaborators' data access. The datasets used or analyzed during the current study are available from the corresponding author on reasonable request. Requests to access the datasets should be directed to QL, liuqin@cqmu.edu.cn.


Articles from Frontiers in Pediatrics are provided here courtesy of Frontiers Media SA

RESOURCES