Discussion
In this paper, we present a longitudinal study of sustained checklist implementation in Madagascar—to our knowledge, this is one of the first longitudinal large-scale evaluations of checklist use in a LMIC setting to date. Twelve to 18 months after a 3-day training course, there was widespread use of the checklist with 74% of participants still using the checklist. Since the checklist has a dose-dependent effect, compliance rates of less than 100% can still result in improved patient outcomes.6 9 11 Our results are comparable to studies in high-income countries. In England, Russ et al
27 reported that on average only two-thirds of items on the checklist were verified, sign out was not completed in 39% and team members were absent in more than 40% of cases. In New Zealand, the average percentage of checklist items completed ranged from 40% to 69%, and the operating room team engagement was often incomplete.33
The WHO checklist encourages procedural compliance to basic safety processes and aims to improve operating room safety by improving teamwork and communication. Teamwork and communication are known to influence patient outcome,34–37 and it is these aspects of the checklist that are proposed explanations for the success of the checklist in reducing mortality and morbidity by almost 50%.38 Disengaged or cynical use of the checklist may be detrimental.14 Thus, evaluations of checklist implementation must measure both procedural compliance and team behaviour15 and fidelity of checklist use (ie, use of the checklist in the spirit and manner with which it was designed).13 WHOBARS is specifically designed to measure the behavioural aspects of checklist utilisation. In our study, 7 out of 11 hospitals had a high WHOBARS (>5.3 (75%)) and 2 out of 11 had WHOBARS <3.5 (50%). High WHOBARS suggests effective team behaviour and constructive engagement during checklist implementation. Checklist use did not correlate with hospital size, surgical volume, WHOBARS, increased personal satisfaction or reduced stress at work, but was associated with an improved understanding of patient safety. However, most participants personally reported in questionnaires that checklist use had increased their understanding of patient safety and personal satisfaction with their work (87% and 83%, respectively), as well as improving their teamwork, communication and organisation. One explanation for this may be that the checklist requires a team approach not an individual one. Therefore, even though checklist use has a positive individual impact, that is not enough to significantly influence checklist use overall. WHOBARS scores were calculated using a small sample size at each hospital (only 1–3 observations) and half were calculated based on simulation observations rather than real operating room scenarios. This may have compromised the validity of the WHOBARS measurements and weakened any predicted effect on checklist use. A post hoc sensitivity analysis with WHOBARS excluded did not substantially change the correlations of the other variables.
Participants reported improved teamwork and communication, better organisation and preparation and a greater trust and confidence in each other and with patients. This level of positive impact of checklist use on individuals may partially account for the sustained effects since the benefits are tangible at a personal level and that may provide motivation to overcome local challenges. The most commonly reported challenge (6/14 hospitals) was lack of staff during emergency surgery. To overcome this challenge, many participants described trying to do what they could, or starting the surgery and then catching up with items on the checklist in retrospect. Operating room staff persisted in trying to overcome the difficulties perhaps because they could perceive the benefits of checklist use both personally (83%) and for patient safety (87%). This contrasted with two hospitals where dominant and influential team members demonstrated a negative or cynical attitude to the checklist. In these hospitals, staff felt powerless to do anything and said that the checklist had had a negative effect. This reinforces reports that negative or cynical checklist use can be detrimental.14
In focus group discussion, pulse oximetry, counting needles, swabs and instruments and timing of antibiotic prophylaxis were reported as the biggest changes in practice, with most hospitals incorporating these procedures into routine practice. The use of pulse oximetry shows the largest sustained improvement in practice since over 50% of hospitals did not have a pulse oximeter at the start of the project,39 but received a donated pulse oximeter as part of checklist training.26 Counting was not performed prior to the training and was taught as part of the checklist course.26 At 4 months post-training, checklist use was associated with counting instruments but commonly reported difficulties with counting included inability to recall the names of instruments and a lack of personnel to do the counting. One explanation for the association of checklist use with counting (at 4 months) was that if participants were motivated enough to overcome the difficulties of learning the names of the instruments, they may be motivated enough to use the checklist. By contrast in this study at 12–18 months post-training, no one reported difficulties with recalling names of instruments presumably because they had persisted and now knew all the names. Also lack of personnel to perform the surgical count was only a challenge in emergency surgery at 12–18 months because systems and processes had been worked out to incorporate counting into routine daytime work without increasing the number of team members.
The follow-up rate at 12–18 months in our longitudinal study was 37% (158/427 originally trained), compared with 47% (183/427) at 4 months.26 This is higher than other LMIC surgical evaluation studies that have reported follow-up rates of 17%–44%40–48 and slightly lower than response rates of 38%–70% for surveys and self-reporting studies in high-income countries.49 50 LMICs studies generally have lower follow-up rates than high-income countries due to challenges such as non-functioning telephone numbers and email addresses, inadequate record-keeping, transport costs and difficulties accessing rural locations.
This study has a number of limitations. Checklist use at the procedural level was self-reported and may be open to subjective bias, recall bias, under-reporting in the hope of getting further training or over-reporting to create a falsely good impression (ie, social desirability bias). WHOBARS was observed only for one or two interventions in the operating room in half of the hospitals and in up to three simulations in the remainder. For WHOBARS, a sample size of 9 is recommended to show differences between hospitals, but we did not have time to complete nine observations. However, we did not aim to make comparisons between hospitals using WHOBARS but rather to use WHOBARS to assess the behavioural aspect as well as the procedural aspect of checklist utilisation. The focus groups were not recorded and transcribed verbatim due to resource constraints and may be open to subjective recorder bias. Focus groups were heterogeneous and therefore nurses and other staff may have been compromised by social pressure and a hierarchical authority culture from speaking openly in front of surgeons. From our observations, whether or not nurses were compromised from speaking out depended on the surgical team dynamics and the culture of the hospital. In some hospitals, nurses were very vocal and willing to speak out and became ‘checklist champions’ and but in others they were quieter and it was very difficult to engage them in the focus group discussion. Only two-thirds of the original hospital sites were visited and even though no hospital had received further checklist training in the interim, there may have been other factors in the interim that effected operating room procedures, practice and culture outside of our control or knowledge. Follow-up rate from the original training sample of 427 participants was only 37%, but this is comparable to surgical outcome studies in LMICs.40–48 We are unable to contextualise this further because we do not know what per cent of the total surgical staff this represents. As described elsewhere,25 during the original training, entire perioperative teams were asked to be present and hospitals did not schedule non-emergency surgery during the 3-day training. This resulted in the majority of the perioperative staff attending the training. Yet, for the 4-month follow-up and for this study, no surgeries were postponed and no participation incentives were offered, which may have reduced the follow-up rate. Further, when hospital directors made the initial contact with participants prior to our visit, they may have assumed only a few participants would be sufficient to report back for the group as a whole.
Our study also has a number of strengths. To the best of our knowledge, this is the first longitudinal study of national checklist implementation in a LMIC. The study was designed to test the hypothesis that rapid (using a 3-day course) nationwide checklist implementation is not only possible but also has a sustainable impact. We measured procedural compliance and team behaviours as part of the study and triangulated these results with qualitative data from focus groups to give insight into the personal impact of the checklist on staff satisfaction as well as organisational culture. Our future aims are to identify the specific implementation strategies and outcomes associated with success in order to inform national implementation plans elsewhere.
In conclusion, our study shows that 12–18 months after a 3-day training course administered to all the regional hospitals in Madagascar, 74% of participants were still using the checklist, 83% reported that checklist use improved their work satisfaction and 64% (7/11) of hospitals had WHOBARS >75% indicating very good team engagement and communication during checklist administration. An improved general understanding of patient safety was predictive of checklist use but hospital size and surgical volume were not. Further research is needed to evaluate this 3-day checklist course and blended educational implementation model in other countries.