SP methodology: strengths and limitations
SPs are people recruited from the local community and extensively trained to present the same prespecified condition to various providers. For instance, an SP may be trained to portray angina, reporting to the doctor with ‘crushing chest pain’ when he woke up and accurately responding to questions and examinations that the doctor then performs. This method is fundamentally different from other proposed quality measures, both in the richness of the data and its ability to avoid typical biases or confounding issues arising from patient sorting and casemix.
Consider, for example, four traditional methods of quality measurements for healthcare: (1) interviewing patients after they receive services (exit interviews), (2) interviewing providers to assess their knowledge (provider interviews and vignettes), (3) analysing data from claims or medical records (record abstraction), and (4) observing patient–provider interactions (direct patient observation).6–12 We briefly discuss the strengths and limitations of these methods alongside the SP method below but refer the reader to a more thorough discussion in section 1 and table 1.1 in the online supplementary 1.
Relative to these methods, the use of SPs confers five distinct advantages. First, even when substantial information is collected through patient observations or exit surveys, at best the observers see what the doctor recommends. They cannot truly know what the patient is suffering from. We can, therefore, examine specific overall metrics of care (eg, consultation time or antibiotic use), but since we cannot assume whether more or less of each type would be ‘better’, we cannot ascertain the effectiveness, let alone appropriateness, of this care. In contrast, because researchers designed an SP case with angina symptoms, for example, they can thus correctly infer that a patient given antibiotics and sent home was incorrectly managed. Researchers can, therefore, assess the care received, including misdiagnosis, overtreatment and undertreatment, against prespecified benchmarks for the condition of interest.
Second, the use of SPs allows researchers to minimise measurement issues related to patient sorting and casemix, which confound observed relationships in administrative and patient data. This comes from the fact that the same SP can visit multiple providers with an identical presentation. Third, since providers are unaware when they are interacting with an SP, biases from the Hawthorne effect and social desirability whereby providers change their behaviour or survey responses because they know they are being observed or surveyed are avoided.10 Fourth, the ability to design many aspects of the condition presented—down to the way the SP dresses, carries himself and behaves—allows researchers to tailor the mix of conditions presented to a given context and a given research question. When combined with specific research designs and sampling techniques, the SP method can help answer important questions that have proven difficult or impossible to tackle with data from real patients. Fifth, the SP method is necessary to identify the gap between what providers know and what they do in practice (ie, the know-do gap).
These advantages of the SP method have vastly expanded the scale and scope of SP studies beyond the first population-based study in a large, representative sample of providers in India.13 SP protocols have now been developed for a range of medical conditions and implemented in multiple countries around the world and have proven to be a better fit for studies where researchers are interested in (1) understanding clinical practice (rather than knowledge, which are better measured through medical vignettes) and (2) quantifying the extent of overtreatment and undertreatment (currently the only method that produces reliable estimates) based on specific tracer conditions. For example, SPs have been used to answer questions related to quality differences between providers in the public and private sectors, whether providers treat men and women differently, and the impact of medical training on quality of care.12 14 15 Further, provider behaviour can be compared across different methods, adding further insights into the determinants of quality.
Nevertheless, the SP method has its limitations and challenges. For instance, the method is limited to a one-time interaction with a provider and has not yet been validated for multiple, sequential visits to the same provider as may be required for a chronic condition. The SP method is also not feasible for conditions that require physical signs to be evident (eg, trauma and pregnancy). As for assessing quality of care for childhood conditions, the SP method has been used with and without real children present, which requires different, yet detailed precautions.16 Further, the SPs are trained to present their symptoms and history in a manner that should not lead the provider in a wrong direction, but in real-life situations, many patients have difficulty presenting the symptoms and history of their disease in a clear manner, and the provider could be misled because patients give a very confused picture of their symptoms. Even further, the identity of SPs themselves may lead to different estimates–quality estimates from a more educated SP population may not be generalisable for less educated patients. Ethical considerations are also paramount, as both providers and SPs could be subject to harm in the conduct of the research if initial scoping of the setting and proper mitigation strategies are not put in place.
The richness of the data from SPs and the ability of the SP method to account for multiple biases in observational studies yields unique information that can save lives. With this in mind, research teams will want to assess the costs and trade-offs of implementing SPs relative to other data collection methods. We have found that developing the capacity to implement an SP study is the most costly investment (in terms of time, effort and finances) of the SP method. Once this capacity has been developed, with sufficiently large samples, the costs of an SP survey compared with collecting observational data with other methods are roughly equivalent or even lower. This is because, even though the initial SP set-up and training costs are high, the cost of each interaction is relatively low. (For a richer discussion on actual costs, we refer the reader to pp. 47–48 of the online supplementary file, where we discuss costs per interaction across studies.)
Recognising that the rapid expansion of quality measurement using SPs has brought with it increasing demands on the method and a new set of questions, this practice paper has two main goals. The main article presents a regression-based framework for quality of care research using the SP method. Using examples from published studies and ongoing research, we illustrate how SP studies and the data they generate can be designed to answer a variety of descriptive and causal research questions that go beyond basic quality measurement. We discuss issues that have arisen with SP measurement, including those that are yet to be satisfactorily resolved. We highlight that many of these problems are common to all quality measurement methods, but become especially salient because of the richness of the data that SPs provide. The main article thus seeks to answer: ‘Can an SP study contribute to my research question of interest, and if so, what issues should I be aware of?’
We complement the main article with a comprehensive online supplementary file that includes our SP Toolkit and Manual, which has been developed through successive iterations across multiple SP studies dating back to 2008. The online supplementary file includes detailed discussions of ethical issues, institutional review boards (IRB) concerns, costs, SP recruitment and training, and real examples of questionnaires and data structures. It accumulates the wisdom of teams as well as multiple IRB and ethics committees to help conduct SP studies in a valid and ethically robust manner from designing the study and obtaining ethical approval to field implementation. Specifically designed as an updated ‘How To’ guide for those planning an SP study in the field, it answers the question: ‘Having decided to do an SP study, how should I actually implement it?’
We make three further observations. First, these resources complement our recent practical overview of SP implementation in the field.17 Second, this article’s focus is on the econometrics and statistics of the SP method with most examples drawn from our own studies in low-income and middle-income countries (LMICs). This focus reflects our own expertise and familiarity when it comes to quality measurement in healthcare and therefore the areas where we can provide the most value. Third, SPs are new to healthcare quality measurement in LMICs, but are a well-developed tool in research on discrimination in housing and labour markets where they are referred to as ‘audit studies’. For those interested in the general issues with such approaches, we refer readers to the overview by Bertrand and Duflo.18