Introduction Inadequate care during early childhood can lead to long-term deficits in skills. Parenting programmes that encourage investment in young children are a promising tool for improving early development outcomes and long-term opportunities in low-income and middle-income regions, such as rural China.
Methods We conducted a systematic review and a meta-analysis to investigate the prevalence of early developmental delays and stimulating parenting practices as well as the effect of parental training programmes on child development outcomes in rural China. We obtained data in English from EconPapers, PubMed, PsycARTICLES, Cochrane Library, Web of Science and Scopus (Elsevier) and in Chinese from China National Knowledge Infrastructure, Wanfang Data and VIP Information. We conducted frequentist meta-analyses of aggregate data and estimated random-effects meta-regressions. Certainty of evidence was rated according to the Grading of Recommendations Assessment, Development and Evaluation approach.
Results We identified 19 observational studies on the prevalence of developmental delays and stimulating parenting practices for children under 5 years of age (n=19 762) and ten studies on the impact of parental training programmes on early child development (n=13 766). Children’s risk of cognitive, language and social-emotional delays in the rural study sites (covering 14 provinces mostly in Central and Western China) was 45%, 46%, and 36%, respectively. Parental training programmes had a positive impact on child cognition, language and social-emotional development.
Conclusion There is evidence to suggest that early developmental delay and the absence of stimulating parenting practices (ie, reading, storytelling and singing with children) may be prevalent across rural, low-income and middle-income regions in Central and Western China. Results support the effectiveness of parental training programmes to improve early development by encouraging parental engagement.
Trial registration number This study was registered with PROSPERO (CRD42020218852).
- systematic review
- child health
- public health
- other infection
- or injury
Data availability statement
Data are available on request. The Stata code and data permissible for sharing will be made available on request from DE (email@example.com).
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
What is already known?
Parenting programmes that encourage caregiver psychosocial stimulation investment in young children are a promising tool for improving development outcomes and long-term opportunities in life in low-income and middle-income regions, such as rural China.
The prevalence of developmental delay, its associations with stimulating parenting practices (ie, reading, storytelling and singing with children), and impacts of parenting training programmes in poor, rural communities of China have not been well documented.
Previous global reviews of child development and related interventions reported that developmental delays in China’s rural population exist and can be improved with parental training programmes. However, no previous study has sought to quantify the overall impacts of parental training interventions.
What are the new findings?
We conducted a systematic review and meta-analysis on early developmental delay, parenting practices and impacts of parental training programmes in rural China.
We find that between 36% and 46% of young children below the age of 5 in the rural study sites which are nearly all in low-income and middle-income regions in Central and Western China are developmentally delayed. This high prevalence of delay coincides with evidence that three in four caregivers do not engage their young children in interactive reading, storytelling or singing activities.
Parental training programmes that focus on child psychosocial stimulation can have positive impacts on child cognition, language and social-emotional scores (of 0.26 SD (95% CI: 0.18 to 0.35 SD), 0.17 SD (95% CI: 0.06 to 0.28 SD) and 0.14 SD (95% CI: 0.03 to 0.24 SD), respectively). One of the underlying mechanisms of the success of parental training programmes appears to be increasing caregiver engagement in stimulating parenting practices and increasing the parenting knowledge of caregivers (0.39 SD (95% CI: 0.24 to 0.54 SD) and 0.20 SD (95% CI: 0.11 to 0.28 SD), respectively).
What do the new findings imply?
Early cognitive, language and social-emotional delay may be prevalent in rural communities in Central and Western China.
Early developmental delays are partly due to infrequent engagement of caregivers in stimulating parenting practices such as reading or storytelling with children.
Parenting interventions can improve child development outcomes. Future randomised trials are needed to inform the design and implementation of programmes at a larger scale.
Investment in early childhood development (ECD) has lasting returns for both individuals and nations.1–6 Across all low-income and middle-income countries (LMICs), however, it is estimated that up to 43% of children below the age of 5 are at risk for ECD delays.7 Although China is one of the most rapidly developing nations in the world, it is still considered an LMIC,8 and research has found major gaps in economic development and human capital between urban and rural areas of China.9 10 Although China as a whole is classified by the World Bank as an upper-middle-income country, more than 70% of young children are born and raised in rural communities with living standards on par with lower-middle-income economies.11 12 Given this large share of children who are growing up in rural communities, the ability of these children to acquire skills may have important implications for the nation’s transition into a high-income, high-wage, skilled-labour economy.
Research on ECD in rural China has grown in recent years. In this emerging body of work, there are three main types of studies: (1) those that measure the prevalence of developmental delays13–15; (2) empirical work that identifies the sources of delays, including research that documents the share of caregivers who invest in stimulating parenting and a quality home environment16 17 and (3) randomised controlled trial (RCT)-based evaluations of interventions designed to increase parental knowledge and engagement as a way to improve child development (Qian et al, 2020, unpublished manuscript).18 Despite the high quality of these empirical studies, all have been regional in focus. There have been no large-scale efforts to examine ECD and parental investment across the large number of communities in dispersed geographical areas that characterise rural China. Such research is needed to determine the overall nature of ECD and parenting practices across rural communities in China and to offer policy directions for future investments in rural ECD.
The goal of this study is to conduct a meta-analysis of all empirical studies focused on ECD in rural and migrant communities in China. We aim to achieve three specific objectives: first, to document the prevalence of developmental delays among young children (below the age of 5) across rural communities in China; second, to identify the prevalence of stimulating parenting practices among rural families; and third, to assess the effectiveness of interventions that aim to increase engagement in stimulating parenting practices, increase parenting knowledge and lower risks of developmental delay. Overall, we hope to provide a holistic snapshot of ECD challenges in rural China and to systematically consider the evidence in regard to effective solutions to those challenges.
Search strategy and selection criteria
To meet our study objectives, we conducted systematic searches of the literature in three areas: first, studies that present empirical findings on levels of cognitive, language and social-emotional development in rural China; second, studies that present empirical findings on stimulating parenting practices in rural China; and third, studies that present empirical impacts of interventions designed to improve ECD by improving parental investments (behaviours) in rural China. In conducting systematic reviews, we adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines. We searched six academic databases for studies published in English: EconPapers, PubMed, PsycARTICLES, Cochrane Library, Web of Science and Scopus (Elsevier). In addition, we searched three databases for studies published in Mandarin: China National Knowledge Infrastructure, Wanfang Data and VIP Information. The details of each systematic review are described below.
For our systematic search of empirical findings on the prevalence of ECD delays and parenting practices in rural China, we followed a two-step protocol. In the first step, we searched for empirical primary data studies in rural China published over the past 20 years (15 November 2000–15 November 2020). Our keywords included the terms: (child* OR infant*) AND “rural China” AND “early childhood” (see online supplemental appendix 1 for the full search strategy). To search empirical evidence of developmental delays, we added: AND (“development* delay” OR “cognit* delay” OR “language delay” OR “ling* delay” OR “mot* delay” OR “emotion* delay”). For our search of evidence on caregiving practices, we added: AND (parenting OR caregiving). In the next step, two independent screeners reviewed the titles and abstracts of the papers to check whether the studies satisfied our inclusion criteria. We used four inclusion criteria: (1) the study is an empirical primary data study; (2) the study concerns developmental delays and parent–child interactive caregiving practices for healthy children under age 5 (not severely malnourished; prematurely born; or suffering from a severe disease, mental trauma or disability); (3) the study is situated in rural China and (4) the study contains at least one outcome measure of infant cognition, language or social-emotional delay or the caregiver’s engagement in reading, storytelling or singing activities with the child. We restrict our attention to studies that use standardised instruments, such as the Bayley Scales of Infant Development (BSID) and the Ages and Stages Questionnaires (ASQ), for assessment of developmental delay and the Family Care Indicators (FCI) survey to evaluate the prevalence of stimulating parenting practices. In this paper, we do not include studies that assess early motor delay, as only motor development of the rural Chinese population has been shown to be normal (with an approximate 15% delay) and not a target of parenting intervention programmes.19 20
For our third systematic search of the literature, we aimed to gather all empirical evidence on the impacts of interventions designed to improve ECD by improving parental investments (behaviours) in rural and migrant areas of China. We searched for impact evaluations of ECD experiments in rural China published over the past 20 years (15 November 2000–15 November 2020). Our keywords included the search: (intervention* OR trial* OR experiment* OR RCT*) AND (stimulation OR nutrition) (see online supplemental appendix 1 for the full search strategy). Note that we included the search terms (stimulation OR nutrition) because we aimed to include multiple types of parenting training programmes (ie, programmes that provide parents with training on psychosocial stimulation and/or nutrition), but we did not aim to search for nutrition supplementation or micronutrient fortification programmes. In the next step, we screened the titles and abstracts of the papers to check whether the studies satisfied our inclusion criteria. We used five inclusion criteria: (1) the study is an empirical primary data study; (2) the study concerns a RCT that evaluates a parental training programme that involves children under age 5 (who are not severely malnourished; prematurely born; or suffering from a severe disease, mental trauma or disability) and their primary caregiver; (3) the study is situated and conducted in rural China; (4) the study contains at least one outcome measure of children’s cognition, language or social-emotional development and (5) the study reports the means or SDs of development outcomes (to facilitate comparison across studies).
In addition to these searches, we consulted reference lists of comprehensive reviews and contacted experts on the organising committee of The 1000 Day Initiative for information and leads on any unpublished studies and data. The 1000 Day Initiative is a coalition of experts who work on ECD issues within China, with members of the organising committee who belong to top Chinese research institutions, including Peking University, Tsinghua University, Shanghai Jiao Tong University, and the National Center for Women and Child Health within the Chinese Center for Disease Control and Prevention. Finally, if data (eg, the prevalence of delay) were missing from any of the studies we found, we contacted the principal investigator of the study for additional information.
We first examine the prevalence of ECD delays reported in studies across rural communities in China. We define developmental delay as a cognitive, language or social-emotional development score of 1 or more SDs below the mean of a reference population whose developmental trajectory is expected to be normal (ie, a child population in developed regions who was not prematurely born, severely malnourished, or severely diseased; see online supplemental appendix 2 for more detail). This is in line with the guidelines of the BSID (the gold standard of ECD measurement), which conventionally defines children with a score of more than 1 SD below the normative sample mean as mildly delayed in their development.21 Using 1 SD below the normative mean as a cut-off to capture all severities of developmental impairment is also in line with definitions from the Global Research on Developmental Disabilities Collaborators and the American Association on Intellectual and Developmental Disabilities.22 Many academic studies have used 1 SD below the normative mean as their cut-off as well.23 24 We report and compare the shares of each study sample who is delayed in cognitive, language and social-emotional development. We conduct a meta-analysis of the prevalence of delay by pooling prevalence of delays across studies using a DerSimonian and Laird random-effects model.
In addition to presenting risks of delay among our sampled children, we also investigate parental engagement in stimulating caregiver practices in rural China. We use indicators of the FCI survey, a validated survey instrument developed by UNICEF to evaluate the quality of the home environment (see table 1A).25 26 We report on the share of parents in each study who engaged in interactive reading, storytelling or singing in the days prior to the survey. These three indicators of the FCI survey are selected because they are most commonly reported in the literature on cognitively stimulating parenting practices in rural China. In addition, we compute aggregate shares of rural caregivers in China who engage in stimulating parenting practices. We conduct a meta-analysis of the prevalence of certain parenting practices by pooling the prevalence of parenting practices across studies using a DerSimonian and Laird random-effects model.
Third, for each of the impact evaluation studies identified by our systematic search, we present the impact on children’s cognitive, language and social-emotional development. To facilitate the comparison of treatment impacts across studies, we again conduct a frequentist meta-analysis of the estimated treatment impacts expressed in standardised mean differences, using a DerSimonian and Laird random-effects model. Of the studies that yield significant impacts on one or more indicators of child development, we then identified features common to all or most of the interventions to draw policy lessons for implementers and future researchers.
We use the risk of bias tool for prevalence studies developed by Hoy et al to assess the risk of bias for prevalence studies of developmental delay and parenting practices.27 We use the Effective Public Health Practice Project quality assessment tool to assess the risk of bias for RCTs.28 Two reviewers independently rated the risk of bias of each study. Any disagreement was resolved by consensus with a third member of the review team. We then synthesised data in tabular formats. Further, we graded the overall certainty of evidence on the effectiveness of parental training programmes using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach.
Finally, we assess heterogeneity in risk of delay and treatment impacts across studies by programme curricula and delivery mode, measurement tool and geographical location. We use frequentist meta-analysis to assess heterogeneity in prevalence rates of delay and estimated treatment impacts expressed in standardised mean differences based on a DerSimonian and Laird random-effects model. The statistical analysis was conducted with Stata version 16.1.
Role of the funding source
There was no funding source for this study.
We identified 19, 12 and 10 papers on the prevalence of developmental delay, the prevalence of stimulating parenting practices, and the impact of parental training on child development outcomes, respectively (see figure 1A–C). Table 1 provides an overview of the included prevalence studies (table 1A) and evaluations of RCTs (table 1B). The risk of bias was generally low to moderate after considering the observational design of the included studies (see online supplemental tables 3.1,3.2).
Table 1A shows that data for all studies included in our meta-analysis of prevalence of delay and stimulating parenting practices were collected within the past 20 years. Of the studies, 13 were published within the past 15 years, and 6 are still unpublished. They evaluate 19 different samples from low-income and middle-income rural communities in 14 provinces mostly in Central and Western China: Anhui, Beijing (a rural migrant community), Guangdong, Guizhou, Hebei, Henan, Jiangxi, Ningxia, Qinghai, Shaanxi, Shanxi, Sichuan, Xinjiang and Yunnan. Hence, when we refer to ‘study sites’ in this study, we mean study sites in low-income and middle-income rural regions or migrant communities in Central and Western China. All 19 762 children were below age 5 at the time of data collection. Online supplemental table 7.1 lists each study, the province in which it took place, the implementing organisation, and any involvement of authors of this review.
Table 1B provides an overview of the 10 studies included in our meta-analyses of the impacts of parental training programmes on child development and parenting practices. Baseline data were collected within the past 20 years. Four of the studies were published within the past 15 years, and six are still unpublished. They evaluate nine different RCTs implemented in low-income and middle-income rural communities in six provinces mostly in Central and Western China: Anhui, Gansu, Hebei, Henan, Shaanxi and Yunnan. All 13 766 children who were involved in one of these studies were below 4 years of age at baseline (most of the children were below 30 months of age at baseline). In each of these RCTs, parental training on psychosocial stimulation (and, in some cases, nutrition) was delivered during one-on-one sessions over a period of 6 months up to 2 years. The frequency of the one-on-one training sessions varied from weekly to once every 6 months. One-on-one sessions were delivered by parenting trainers at home or in ECD centres that were installed at centrally located venues. If an ECD centre was installed, then intervention households received unlimited access to the open play areas and group training sessions organised at the centres (Bai et al, 2020; Zhong et al, unpublished manuscripts). All studies with outcome measures of cognition, language or social-emotional development or parenting practices or parenting knowledge are included in the respective meta-analysis.
Figure 2 presents estimates of ECD delays among infants and toddlers in the pooled sample. The share of infants and toddlers with cognitive delays ranges from 22% to 57% (see figure 2, top panel). The weighted average across all studies shows that, of the 19 762 infants and toddlers in the pooled sample, 44.8% (95% CI: 39.8% to 49.9%) were at risk of cognitive delay. The average risk of language delay in each study ranges from 26% to 72%, and the weighted average of the reported studies shows a language delay risk of 45.6% (95% CI: 38.2% to 53.1%) (see middle panel). Risk of social-emotional delays were slightly lower, ranging from 14% to 59%, and the weighted average shows that 36.5% (95% CI: 31.1% to 41.9%) of samples display social-emotional delays (see figure 2, bottom panel).
Figure 3 shows the prevalence of stimulating parenting practices among caregivers across the set of pooled studies. Between 2% and 76% of caregivers had read a book to their child over the past 3 days, with only 23.3% (95% CI: 9.4% to 37.2%)of caregivers, on average, having read a book to their child across the studies (see figure 3, top panel). The share of caregivers who sang songs with their children in the past 3 days ranged between 19% and 85%, with a weighted average of 44.8% (95% CI: 29.5% to 60.1%) across all datasets (see figure 3, middle panel). Between 9% and 76% of caregivers told stories to their child in the past 3 days, and the weighted average of all datasets shows that only 25.2% (95% CI: 9.5% to 40.9%) of caregivers had told a story to their child in the past 3 days (see figure 3, bottom panel).
Figure 4 portrays the intervention effects of the reviewed parental training programmes on child cognitive development and parenting practices (reading, storytelling and singing with children). All of the 10 RCT evaluations include an outcome measure of cognitive development scores (see top panel of figure 4), and all find significant short-run improvements in cognition. The size of the impact on cognition ranges from 0.11 SD to 0.75 SD The mean standardised effect size of child cognition is 0.26 SD (95% CI: 0.18 to 0.35 SD). Further, the bottom panel of figure 4 shows that seven studies reported impacts of parenting training interventions on a summary measure of parental engagement in stimulating parenting practices, with a mean effect size of 0.39 SD (95% CI: 0.24 to 0.54 SD).
Treatment impacts on child language and social-emotional development and parental knowledge and beliefs are summarised in online supplemental figures 5.1 and 5.2. Impacts on child language and social-emotional development are each measured in nine of the 10 intervention studies. For both outcome measures, impacts are less pronounced than those for child cognition. Three studies had a significant impact on language development, and two had a significant impact on social-emotional development. The mean standardised effect size is 0.17 SD (95% CI: 0.06 to 0.28 SD) for language development and 0.14 SD (95% CI: 0.03 to 0.24 SD) for social-emotional development. Online supplemental figure 5.2 shows that six of the intervention studies reported intervention impacts on parenting knowledge and beliefs, with a mean effect size of 0.20 SD (0.11–0.28 SD).
Based on a GRADE summary analysis, we conclude with high certainty that parental training programmes can improve cognitive outcomes of children under age 5 in rural Chinese communities (see first row of online supplemental table 4.1). Further, such training programmes can, with moderate certainty, raise language and social-emotional development of young children (see rows 2 and 3 of online supplemental table 4.1). Finally, we conclude with moderate certainty that the parenting programmes led to increases in stimulating parenting practices and parenting knowledge (see rows 4 and 5 of online supplemental table 4.1). This evidence indicates that, after receiving training for improving the skill development of their young children, parents are likely to both learn and to practice what they learn.
Note that one (unpublished) study by Wang et al17 included in our meta-analysis, evaluates the persistence of treatment effects 2.5 years after programme completion. This study provides important evidence in support of persistent treatment impacts on child cognition, parental investment and parental beliefs and knowledge in the longer run.
We also conduct an analysis of heterogeneity in prevalence and child development delays and estimated treatment impacts by programme curriculum, delivery mode, measurement tool and geographical location. We find that centre-based and home-based programmes both have a positive and significant impact on child cognitive development (see online supplemental figure 6.4). The average impact of centre-based programmes, however, is smaller than the average impact of home-based programmes (ie, 0.19 SD (95% CI: 0.11 to 0.27 SD) and 0.34 SD (95% CI: 0.21 to 0.48 SD), p value of between-group heterogeneity <0.05). The remainder of the results of the heterogeneity analysis are presented and discussed in detail in online supplemental appendix 6.
In this study, we aggregate all empirical studies of ECD in rural China, providing a holistic snapshot of ECD challenges in low-income and middle-income rural communities in Central and Western China, as well as the effectiveness of interventions designed to address these challenges. A major finding is evidence that suggests that early developmental delays may be prevalent across rural communities oin Central and Western China. The studies spanned 14 provinces, covering all of China’s major geographical regions. They included samples from villages, townships, and migrant communities and focused on healthy children under 5 years old. A major strength of this study is that most of the included studies evaluated population-based random samples, which affirms the representativeness of the study samples. All of the included studies identified rates of developmental delays that were higher than what would be expected from a healthy population. The average prevalence of cognitive delay across all studies, based on a random-effects meta-analysis, was 45%. This is comparable to that observed in other LMICs (42%).29 Because delays at this age have been linked to lifelong impacts on school performance, educational attainment, job market outcomes and even criminal activity, these high risks of delay should be viewed as a key area for intervention to promote long-term individual and social well-being.
A second major finding is that rural parents infrequently engage in stimulating parenting practices, such as reading, storytelling and singing with their children. On average, 23% of families read with their children on a regular basis, and 25% tell stories to their children. Frequent engagement in singing is somewhat higher, at 45%. These prevalence levels are consistent with those of Walker et al30 and somewhat lower than those of Nonoyama-Tarumi and Ota,31 two studies conducted in developing settings around the world. The low prevalence of parent-child engagement can be seen as a reflection of the economic inequality that demarcates rural and urban areas.32 The human development literature is unanimous in establishing that households with higher incomes and in more developed regions on average invest more time and resources in their children.33
There are several factors that may limit caregivers in the rural study sites from frequently engaging in stimulating parenting practices, including financial stress, time constraints, knowledge constraints and mental health problems of the caregiver. A handful of quantitative and qualitative studies in rural China have found knowledge constraints to be the underlying primary factor for the low prevalence of stimulating parenting practices in rural China. Although rural households typically have fewer economic resources than urban households, two studies found that the majority of rural households have sufficient economic resources to invest in their children’s development as well as a desire to do so.16 34 Similarly, a recent study found that many rural caregivers reported having sufficient leisure time that could be used for interactive play with their children.34 In contrast, there is evidence that rural caregivers do not have salient knowledge of how to effectively engage in stimulating parenting practices and often lack reliable sources of information about parenting.16 34 This may be compounded by mental health issues among rural caregivers in China, which several studies have found to be prevalent and linked to reduced ECD outcomes.35–38
In light of these problems, 10 studies have completed rigorous evaluations of ECD interventions among the 0–5 age group. These interventions are designed to improve ECD primarily by targeting caregiver knowledge and skills through one-on-one parental training. We find with high certainty that these interventions improve early cognitive development (see online supplemental table 4.1). We find with moderate certainty that they improve early language and social-emotional development as well as increase parenting knowledge and engagement in stimulating parenting practices. These findings are consistent with findings from other low-resource settings globally.20 39–48 These impacts suggest that increases in parental investment in stimulating parenting practices, such as reading, storytelling and singing, and changes in parenting knowledge are important mechanisms behind the impacts on child development.
Several key lessons emerge from the evaluation studies included in our meta-analysis. First, we find that one-on-one parental training is effective for improving early cognitive development in rural China. Each of the interventions included in our analysis contained some version of one-on-one training (see table 1B), and all found positive and significant impacts on children’s cognitive development (see top panel of figure 4). Six different curricula were used; delivery modes included in-home and centre-based parental training sessions; and programmes were variously implemented by local village women, volunteers, doctors and family planning officials. The main commonality among the interventions was the use of a one-on-one training component. We did not identify any evaluations of group-based interventions in rural China, another delivery format that has been used in ECD interventions internationally.49–52 The results of our meta-analysis suggest, however, that, among the many different approaches to crafting an effective ECD intervention in rural China, one-on-one parental training appears to be a common element.
A second major lesson learnt from our meta-analysis of ECD interventions is that, although centre-based and home-based interventions have positive impacts on ECD, home-based interventions seem to have a larger impact on child cognition (0.34 SD vs 0.19 SD; see online supplemental figure 6.4). This can be partly explained by the fact that the most vulnerable children, who would benefit the most from this type of intervention, are more likely to miss out on a centre-based programme than on a home-based programme.53 To understand the reason, we contrast findings from the unpublished study of Zhong et al, a centre-based intervention, and Sylvia et al,54 a home-based intervention. The two studies were conducted by the same research group; the interventions used one-on-one parental training, following the same curriculum, and both were implemented in the same region of the country. The main difference was the location of the training: One was delivered in the home, and the other was delivered in a parenting centre. In the home-based intervention, the highest impacts are observed among children with the lowest levels of parental investment at baseline. In contrast, in the centre-based intervention, children with the lowest levels of parental investment self-select out of the intervention due to hurdles such as high opportunity cost of parents’ time to visit the centre or limited access to reliable and effective transportation. The findings of these two studies are consistent with the research finding that participation in centre-based programmes is widely unequal in developing settings.31 55
We acknowledge limitations to our work. First, we define developmental delay as a cognitive, language or social-emotional development score of 1 or more SDs below the mean of a reference population whose developmental trajectory is expected to be normal (see online supplemental appendix 2 for more detailed information). Although many other academic studies also use 1 SD below the normative mean as their cut-off,23 24 others may use different cut-offs, so caution should be used when comparing our results with those from other settings. Second, the reviewed studies use various measurement tools to assess the prevalence of ECD delay. The BSID is a comprehensive diagnosis tool for developmental progress, whereas the ASQ-3 test is a short screening tool for the detection of children at risk of delay. Studies have found that the ASQ-3 may be a less accurate measurement tool for the detection of developmental delay, although evidence on whether the ASQ-3 test overestimates or underestimates the prevalence of developmental delay as compared with BSID assessments is mixed.56–58 Our measurement-based heterogeneity analysis finds that the point estimates of the prevalence of cognition and language delay are slightly lower in studies that use the ASQ-3 as compared with studies that use the BSID (37.0% vs 48.2% and 35.8% vs 52.7%; see online supplemental figures 6.7 and 6.8, respectively), but we do not detect a difference in the observed prevalence of social-emotional delay when using ASQ-3, ASQ:SE, or BSID (40.5% vs 32.3% vs 36.1%; online supplemental figure 6.9). Note that, regardless of the measurement tool used, the prevalence of delay is high. Further, based on the results presented in online supplemental appendix figures 6.10–6.12, we find no systematic differences in estimated treatment impacts on early cognitive, language and social-emotional development across studies that used the ASQ-3 or the BSID.
We also acknowledge the inclusion of several unpublished papers and reports in our analyses. Because these studies have not yet undergone peer review, there may be unintentional errors or biases in their data, although it should be noted that we found low levels of bias in all of our formal bias checks. As argued by Dwan et al,59 empirical research shows that results in published studies are more likely to be positive and statistically significant than are results in unpublished studies. Moreover, publishing studies with insignificant impacts is more difficult and may take longer, which may lead to a ‘pipeline bias.’ This type of publication bias is a threat to the validity of systematic reviews. In line with routine practice in meta-analyses, we use a visual analysis of a funnel plot, the trim-and-fill method, and Egger’s test of small-study effects to test for publication bias.59–61 We find no evidence of publication bias when we combine the published with the unpublished studies (see online supplemental appendix 7 for a detailed discussion). Hence, we concluded that the benefits of including unpublished studies outweigh the limitations.
In conclusion, the reviewed evidence shows consistently that early developmental delay and the absence of stimulating parenting practices (ie, reading, storytelling and singing with children) may be prevalent across rural communities that are similar to populations in the study sites in Central and Western China. Our meta-analysis, however, has uncovered several gaps in the literature with regard to the potential role that intervention programmes can play. First, although we find that parenting training programmes have large and consistent impacts on children’s cognitive development, we also find that they have smaller and less-consistent impacts on social-emotional and language development. Therefore, future in-depth evaluations of potential impacts on these development areas, including studies that examine the mechanisms behind the observed impacts on these dimensions of development, could provide useful information to guide future intervention designs. Second, it remains unclear to what extent intervention programmes are able to reach the most vulnerable families, who are likely to benefit the most from this type of intervention programme. For example, could a clinic-based setting take advantage of the existing infrastructure and leverage existing social trust to host an effective and inclusive intervention? How can practitioners help to stem attrition from ECD interventions? Limited global evidence suggests that the most marginalised populations are more likely to miss out on clinic-based and centre-based interventions due to limiting factors, such as a remote living location or time constraints.53 55 Further research on inclusiveness, participation constraints and optimal programme targeting can shed light on how we can provide more children, including the most disadvantaged ones, with a fair start in life.
Data availability statement
Data are available on request. The Stata code and data permissible for sharing will be made available on request from DE (firstname.lastname@example.org).
We would also like to acknowledge the effort of several of the groups that were part of the research efforts, especially those that helped in designing, supporting and/or implementing the interventions, including UNICEF, OneSky, China Development Research Foundation, Save the Children, and others.
DE and QJ are joint first authors.
Handling editor Seye Abimbola
Contributors DE and QJ contributed equally to this study. DE conceived the study, collected the data, verified the underlying data, conducted the statistical analysis, interpreted the data and drafted the manuscript. QJ conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. HX contributed to the statistical analysis, verified the underlying data, interpreted the data, conducted the bias assessment, and provided critical edits to the manuscript. YueZ contributed to the statistical analysis, verified the underlying data, interpreted the data, and provided critical edits to the manuscript. YTZ contributed to the statistical analysis, verified the underlying data, interpreted the data, and provided critical edits to the manuscript. YXZ reviewed and provided critical edits to the manuscript. BL reviewed and provided critical edits to the manuscript. S-ED collected the data, conducted the bias assessment, interpreted the data and helped to draft the manuscript. YQ conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. NW conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. HJ conducted the bias assessment, interpreted the data and provided critical edits to the manuscript. JC provided critical edits to the manuscript. XW conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. LW conducted the statistical analysis, interpreted the data and provided critical edits to the manuscript. RL reviewed and provided critical edits to the manuscript. GL conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. JX conducted the statistical analysis and interpreted the data. ML conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. YH conducted the statistical analysis, interpreted the data and provided critical edits to the manuscript. WS conducted the statistical analysis, interpreted the data and provided critical edits to the manuscript. ZL conducted the statistical analysis, interpreted the data and provided critical edits to the manuscript. YuZ conceived the study, interpreted the data, and provided critical edits to the manuscript. SS conceived the study, interpreted the data and provided critical edits to the manuscript. YM conducted the statistical analysis, interpreted the data, and provided critical edits to the manuscript. AM interpreted the data, conducted the bias assessment and drafted the manuscript. SR conceived the study, collected the data, interpreted the data and provided critical edits to the manuscript.
Funding DE acknowledges support from KU Leuven, Long-term structural funding-Methusalem funding by the Flemish Government and by the Fonds voor Wetenschappelijk Onderzoek-Vlaanderen (FWO), and the Fonds de la Recherche Scientifique-FNRS under EOS Project No. G0G4318N (EOS ID 30784531).
Competing interests We have listed the institutions or organisations that were in charge of programme implementation and the involvement of coauthors of this manuscript in reviewed intervention studies in online supplemental appendix table 7.1.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.