Making sense of the evidence in population health intervention research: building a dry stone wall

David Ogilvie; Adrian Bauman; Louise Foley; Cornelia Guell; David Humphreys; Jenna Panter

doi:10.1136/bmjgh-2020-004017

Article Text

PDF

Analysis

Making sense of the evidence in population health intervention research: building a dry stone wall

http://orcid.org/0000-0002-0270-4672David Ogilvie1,
Adrian Bauman2,
Louise Foley1,
Cornelia Guell3,
David Humphreys4,
Jenna Panter1

¹MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
²School of Public Health, The University of Sydney, Sydney, New South Wales, Australia
³European Centre for Environment and Human Health, University of Exeter, Truro, UK
⁴Department of Social Policy and Innovation, University of Oxford, Oxford, UK

Correspondence to Dr David Ogilvie; david.ogilvie{at}mrc-epid.cam.ac.uk

Abstract

To effectively tackle population health challenges, we must address the fundamental determinants of behaviour and health. Among other things, this will entail devoting more attention to the evaluation of upstream intervention strategies. However, merely increasing the supply of such studies is not enough. The pivotal link between research and policy or practice should be the cumulation of insight from multiple studies. If conventional evidence synthesis can be thought of as analogous to building a wall, then we can increase the supply of bricks (the number of studies), their similarity (statistical commensurability) or the strength of the mortar (the statistical methods for holding them together). However, many contemporary public health challenges seem akin to herding sheep in mountainous terrain, where ordinary walls are of limited use and a more flexible way of combining dissimilar stones (pieces of evidence) may be required. This would entail shifting towards generalising the functions of interventions, rather than their effects; towards inference to the best explanation, rather than relying on binary hypothesis-testing; and towards embracing divergent findings, to be resolved by testing theories across a cumulated body of work. In this way we might channel a spirit of pragmatic pluralism into making sense of complex sets of evidence, robust enough to support more plausible causal inference to guide action, while accepting and adapting to the reality of the public health landscape rather than wishing it were otherwise. The traditional art of dry stone walling can serve as a metaphor for the more ‘holistic sense-making’ we propose.

prevention strategies
public health
intervention study
systematic review

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

https://doi.org/10.1136/bmjgh-2020-004017

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Summary box

Systematic reviews and guidance development groups frequently conclude that the available evidence about the effects of population health interventions is too diverse, flawed or inconclusive to support a more general conclusion about what should be done.
In spite of all the developments in quantitative methods for primary research and evidence synthesis, we struggle to derive meaningful generalisable inferences from the evaluation of interventions in arenas such as the food, transport or welfare systems to guide and support public health action.
We respond to a long-standing call for more ‘holistic sense-making’ in this arena by proposing a more eclectic, flexible and reflexive approach to building and interpreting the evidence.
We show how a spirit of pragmatic pluralism might be channelled into constructing ‘dry stone walls’ of evidence, robust enough to support more plausible causal inference to guide action, while accepting and adapting to the reality of the public health landscape rather than wishing it were otherwise.
We should look beyond simple notions of ‘interventions’, search for patterns and embrace the mess in evidence synthesis in order to better understand what makes for an effective public health strategy.

Introduction

Effectively tackling population and planetary health challenges such as climate change or diabetes requires us to address the fundamental, upstream determinants of behaviour and health in populations.1 This may sometimes entail contentious policies such as diverting funds from other priorities or constraining people’s freedoms, which ought to be guided by the best available scientific evidence. To this end, it is increasingly accepted that we should advocate, fund and strengthen the evaluation of interventions in arenas such as the food, transport or welfare systems, often in the form of natural experiments.2

However, as the ongoing drip-feed of contested and contradictory research findings in respect of coronavirus pandemic control measures has illustrated, merely increasing the supply (and rigour) of primary studies is not enough.3 Governments have to make decisions all the time. The pivotal link between research and policy or practice should be the cumulation of insight from multiple studies in some form of evidence synthesis,4 but systematic reviews and guidance development groups frequently conclude that the available evidence about the effects of population health interventions is too diverse, flawed or inconclusive to support a more general conclusion about what should be done.2

One reason for this is that studies conducted in ‘real-world’ settings are often critiqued for a lack of internal validity in comparison with randomised trials in more controlled settings. This may be compensated for by greater external validity—the likelihood of producing practice-based evidence that might be successfully translated to the systems in which others work.5 However, the fact that these studies are produced in particular settings is also the main apparent impediment to their generalisability. Interventions to change such things as how products are taxed, how cities are laid out or how society supports people in old age inevitably take place in particular places with particular characteristics, which vary widely across the globe and even within countries. How might we do a better job of deriving meaningful generalisable inferences from studies like this to guide and support public health action in other places?

Promising solutions or false refuges

Feeding the meta-analytical machine: piling the stack and singing in harmony

Thrombolysis was not routinely used to treat heart attack until the late 1980s. If the available trials had been combined in a meta-analysis, however, its effectiveness would have been established beyond reasonable doubt by 1973.6 Precedents like this suggest that the solution to a lack of evidence is simply to conduct more intervention studies in more places, on the basis that once we have a tall-enough stack of good-enough papers to populate a meta-analysis, we will know.

In practice, however, many systematic reviews of population health improvement strategies have been more successful in delineating what we do not know than in identifying unequivocally effective interventions.2 Often, what has prevented the formulation of clear answers is not so much a lack of studies as the lack of a way of reconciling the diversity of their study designs, limitations, interventions and contexts.7 One apparent solution is to limit meta-analysis to a set of more statistically comparable studies, but this risks perpetuating an evaluative bias in favour of an intervention ‘monoculture’ that may or may not include the most promising strategies.8 Another way of dodging the challenge is to split the problem into more and more discrete and evaluable chunks. These may eventually tell us the effect of doing X, but however refined the answers to this kind of ‘splitting’ question turn out to be, they are not sufficient to address the more pressing ‘lumping’ question for public health: how can we best achieve Y?9 10

If population-level intervention studies were to use a common set of exposure and outcome measures, this would make meta-analysis more feasible. Important progress has been made in this respect, for example in physical activity epidemiology.11 One might envisage some form of multicentre study in which more-or-less comparable interventions were introduced (or not) in different places and evaluated along the lines of a cluster randomised controlled trial. However, the Achilles heel of this vision is the qualifier ‘more-or-less comparable’. Some interventions, such as screening programmes, might be designed and implemented in a sufficiently similar way for this kind of multicentre evaluation.12 For upstream interventions in complex systems, however, the harmonised measurement of exposures (to interventions) and intermediate and final outcomes involving multiple causal pathways is challenging. Consider, for example, the array of measures of pricing, product formulation, purchasing, consumption, diet, health, and potentially confounding background trends that are needed to properly quantify the intended and unintended impacts of introducing a national levy on sugar-sweetened drinks.13 Negotiating the harmonised implementation of truly comparable interventions in multiple jurisdictions and beyond the control of researchers may be even less feasible.

Broadening the scope: building the panopticon and modelling the solutions

If empirical intervention studies are so difficult to design, implement or combine in meta-analysis, why not make more use of observational and simulation methods? The growth of ‘big data’ and interest in the ‘quantified self’ now offer unprecedented possibilities to gather enormous quantities of information, whether from surveillance systems such as traffic cameras or portable devices unobtrusively capturing continuous geographical, physiological and other signals from individuals. This torrent of data has led us towards a contemporary version of Bentham’s panopticon, telling observers exactly what people are or have been doing, where, when and even with whom. Datasets of this breadth, depth and precision make it possible to investigate associations with an unprecedented degree of statistical power and analytical complexity, and one might assume that so long as sufficiently rich data are available to populate such analyses, more and more secure causal inference will follow. The results can also be used as inputs to tools such as systems dynamic modelling—to simulate the consequences of altering upstream determinants of health, identify new intervention points and explore to what extent the outcomes observed in one system may be generalisable to others.14

However, enthusiasm for increasing computational complexity in the search for causal inference from observational or simulation data should be tempered with the recognition that design-based inference is generally considered stronger than model-based inference.15 In other words, we should attend at least as much to investigating situations in which different groups are exposed to different exogenous factors (interventions, or at least determinants of change) as we do to refining ways of eliciting ‘causal’ evidence from other datasets. Even if well-founded concerns about the representativeness and privacy implications of relying on ‘big data’ can be addressed, the resulting associational cornucopia is unlikely to help much if it contributes merely to producing ‘ever more sophisticated answers to the wrong questions’.16

For example, hundreds of studies now tell us that more walking is reported in areas where it is easier and safer for people to walk and there are places for them to walk to.17 However, precisely quantifying dose–response relationships of this kind does not necessarily explain how to address the problem of comparative inactivity, just as proving the aetiological case against tobacco did not explain how to reduce the prevalence of smoking.18 We should therefore not assume that the answers to the question of what we should do will be found by searching for statistical associations that only become noticeable in extremely large samples. Cohort and surveillance data collected for other purposes can certainly be used to investigate the effects of interventions,19 but no matter how intensively people’s health, behaviour and environments are quantified in observational studies, it may be a category error to assume that this will necessarily explain whether or how public health strategies actually work (or not). Epidemiology is only one of the tools in the box.20

Is the craft of evidence synthesis fit for purpose?

The point of evidence synthesis is surely to derive more generalisable causal inference. In spite of the academic language, this is as much an applied problem as an abstract, theoretical problem; to put it another way, what can transport planners in Birmingham learn from what their counterparts did in Bogotá?5

If cumulating evidence from multiple studies can be thought of as analogous to building a wall, then the ‘solutions’ outlined above can be regarded as ways of increasing the supply of bricks (the number of pieces of information), the similarity (statistical commensurability) of the bricks or the strength of the mortar (meta-analytical or other statistical methods for holding them together). These are helpful if the aim is to build a larger and stronger conventional wall, formed of neat rows of bricks of roughly the same shape and size.

Important and useful as all these approaches are, they have the potential to distract us from the real problem. Conventional brick walls work best on flat, smoothly prepared ground. Many contemporary public health challenges seem more akin to herding sheep in a mountainous landscape characterised by steep slopes, rocky outcrops and boggy ground. In this terrain, the more artisanal, bespoke and traditional solution of the dry stone wall may be more useful (figure 1). Dry stone walling is a way of transforming a pile of stones, which at first glance do not fit together, into something new and useful. Each stone is considered in its own right and assigned a unique place in the wall. No mortar is required, because careful thought is given to how all the pieces can be related to form a robust structure that is more than the sum of its parts. The art can be learnt, but it requires a level of flexibility and ingenuity that cannot readily be codified. It can therefore stand as a metaphor for the 'holistic sense-making' required of the evidence in population health intervention research.21 How might we better harness our research skills and technologies—ancient and modern—to build evidential structures more suited to the terrain we inhabit?

Figure 1

Dry stone wall. Credit: Lupin at English Wikipedia (CC BY-SA). http://upload.wikimedia.org/wikipedia/commons/1/10/Dry_stone_wall_in_the_yorkshire_dales_detail.jpg

Building a dry stone wall of evidence

Looking beyond ‘interventions’

For most public health interventions—even well-established population screening programmes—the only honest answer to the question ‘Does it work?’ is ‘It depends’.14 22 In most cases, the questioner will need to clarify what they mean by work (in what terms?), what they mean by it (what, exactly, is the intervention?) and indeed what they mean by does (which implies a generalisable inference). Sometimes, what people really mean is ‘Will it work?’—a predictive question—or ‘Did it work?’—an empirical but also a particular question, perhaps better formulated as ‘What happened?’10

Why is this so difficult? Most public health interventions are at least somewhat unique to their context, which ought to be taken account of in their evaluation, and many can also be seen as interventions in complex systems.10 12 22 However, they do not necessarily take place in a context, at least not in the sense that a new clinical procedure might be introduced in a certain hospital or healthcare system. Rather than exerting an effect within a ‘moderating’ context, it may be more helpful to see these interventions as targeting and altering the context in which people live and make choices.10 23 24

But if everything is complex and context ‘is’ the intervention, what exactly might we seek to generalise from one instance to another? We tend to assume that interventions are things that work in and of themselves and might be universally generalisable, like Newton’s laws of motion.25 26 In practice, however, we find ourselves struggling to make sense of an apparently incommensurable body of evidence. By adopting a ‘naive and misplaced commitment to the reproducibility of the complex’, we may have unwittingly set ourselves an unrealistic challenge of identifying generalisable interventions as such.22 27 28 What if we were to release ‘our search for universal generalisability in favour of more modest, more contingent, claims’?22 Among other things, this would entail relaxing our grip on the notion of generalising the effects of interventions based on their forms or their ‘active components’, and turning our attention instead to their functions—the processes and changes they evoke—or indeed their ‘spirit’.12 26

To take a well-known clinical example, some reviews have found that the type of psychotherapy offered to a patient with depression makes little difference to the outcome.29 What to do, then? Others, taking a different approach to analysis, have found that the key lies in the quality of the therapeutic relationship established rather than the particular techniques used.30 Reasoning by analogy, we theorised that in another arena—promoting active travel—the myriad forms of intervention might be underlain by a more limited number of critical functions, such as increasing accessibility or safety, that are generalisable in principle but might be achieved in different ways in different situations.31 Again, this is as much to do with practical strategies as it is to do with theories, and some public health guidance and government policy already implicitly reflects this way of thinking with references to high-level principles, variable interpretations and the like (examples: table 1).

View this table:

Table 1

Examples of generalisable principles reflected in policy and practice guidance

Searching for patterns

If this idea has traction, we will need to expand the scope and flexibility of our repertoire in evidence synthesis in order to derive more concrete and defensible inferences about what to do for public health. Rather than assessing whether interventions of a particular type ‘work’ in an overall sense, this will entail aggregating evidence for and against theories about intervention functions by combining information from studies conducted in different situations, including studies that were not explicitly designed with this in mind.7 14 27 32 33

How might we do this? We could accept the limitations of relying so heavily on testing binary statistical hypotheses about singular study outcomes,2 14 and turn our attention to seeking ‘inference to the best explanation’—that which provides the greatest understanding.18 34 35 We could use intervention theory to predict patterns that might be observed in a variety of data, and then assess the concordance between the observed patterns and the theoretical expectation patterns—testing theories rather than interventions.33 36 We could go further by systematically considering alternative potential explanations for the patterns we observe, doing our best to confirm or disconfirm these, and reaching a conclusion as to the most plausible causal inference from the overall pattern of findings. The approach is most easily illustrated within a single study, an evaluation of new transport infrastructure that was not designed to test any singular overarching hypothesis (case study: table 2).

View this table:

Table 2

Case study of the dry stone wall principle applied to an intervention study

One might counter that the principle of comparing observed and expected data applies equally to the paradigm of the randomised controlled trial. While this is true, testing theories in the way we describe entails a more radical challenge to established notions of a hierarchy of study design. For example, it is often understood that quantitative methods are for testing hypotheses, whereas qualitative (and some quantitative) methods play more subservient roles such as generating hypotheses, developing interventions or assessing their acceptability.32 One might also counter that approaches such as process evaluation, or realist evaluation and synthesis, already offer ways of investigating causal mechanisms.7 24 27 37 While this is true, the higher-order intervention functions and data patterns we are talking about are likely to reflect multiple underlying mechanisms25 and others have argued that more diverse lines of evidence should be converged and brought to bear on the challenge of overall causal inference. These might combine a variety of quantitative sources of causal estimation with a variety of quantitative and qualitative sources of causal explanation such as causal process observations.15 22 34 Simple examples from historical and contemporary communicable disease control illustrate this principle (table 3).

View this table:

Table 3

Examples of arguments for convergent lines of evidence in communicable disease control

Embracing the mess

Some public health strategies will inevitably be more successful than others, and every ‘solution’ has the potential to generate more problems. Such uncertainty is—or at least should be—what drives scientific enquiry in the first place.38 Rather than hoping for 'a neat, coherent story’ of clear-cut outcomes from evaluation, therefore, in most cases we should expect confusing, divergent, mixed or unexpected patterns of results.21 Far from denoting that an intervention or evaluation has failed, these shed light on what really happened, whether we like it or not.10

While the metaphor of the dry stone wall can be applied at the level of the individual study, as shown in table 2, the mess of this apparent dissonance may be better resolved not at the level of the individual study, but by cumulating evidence from multiple studies over time within an intervention research programme or a systematic review. Our final case study illustrates how we applied the principles of linking diverse sources of evidence on causal estimation and causal explanation to identifying patterns and testing theories about intervention functions across a cumulated body of work on infrastructure to support active travel (table 4).31 It draws on a variety of research methods, resting on different philosophical assumptions, in pursuit of ‘clarification and insight, for which a more interpretive and discursive synthesis is needed’.3 9

View this table:

Table 4

Case study of the dry stone wall principle applied to a systematic review

Conclusions

Many readers engaged with conducting, synthesising or applying the findings of population health intervention research are likely to agree with the editors of the Cochrane Handbook, who recently wrote that in spite of all the developments in quantitative methods for evidence synthesis, it is frequently still not possible for these ‘to provide insight beyond a commentary on what evidence has been identified’.7

We need to find a better way, otherwise merely piling up more studies may leave us confronting another kind of stack—endlessly circling the runway of a conclusion on which we never seem to have clearance to land. In this paper we have responded to a long-standing call for more ‘holistic sense-making’ in this arena. We have outlined a strategy of constructing ‘dry stone walls’ of evidence: pluralist mosaics whose strength derives from the complementarity of their components rather than being found in spite of it.39 This approach has the potential to be robust enough to support more plausible inference to guide action, while accepting and adapting to the reality of the public health landscape rather than wishing it were otherwise.

This will entail facing up to the challenge of working as ‘scholars, rather than just researchers’21—that is, artisanal dry stone wallers rather than bricklayers. We are advocating not a new method of evidence synthesis as such, but a more eclectic, flexible and reflexive approach; ‘not the abandonment of more reductive lines of research but the enlargement of these’40 with the more thoughtful and practical application of theory to generating practice-based evidence in public health. Ironically, it may only be by combining growing quantitative sophistication with the least technologically dependent research method of all—the anthropological tradition of the ethnographic observation of people and societies—that we will really understand what makes for an effective public health strategy.

References

↵
1. Marteau TM,
2. White M,
3. Rutter H, et al
. Increasing healthy life expectancy equitably in England by 5 years by 2035: could it be achieved? Lancet 2019;393:2571–3.doi:10.1016/S0140-6736(19)31510-7pmid:http://www.ncbi.nlm.nih.gov/pubmed/31258113
OpenUrl CrossRef PubMed
↵
1. Ogilvie D,
2. Adams J,
3. Bauman A, et al
. Using natural experimental studies to guide public health action: turning the evidence-based medicine paradigm on its head. J Epidemiol Community Health 2020;74:203–8.doi:10.1136/jech-2019-213085pmid:http://www.ncbi.nlm.nih.gov/pubmed/31744848
OpenUrl Abstract/FREE Full Text
↵
1. Greenhalgh T
. Will COVID-19 be evidence-based medicine's nemesis? PLoS Med 2020;17:e1003266. doi:10.1371/journal.pmed.1003266pmid:http://www.ncbi.nlm.nih.gov/pubmed/32603323
OpenUrl PubMed
↵
1. Ogilvie D,
2. Craig P,
3. Griffin S, et al
. A translational framework for public health research. BMC Public Health 2009;9:116. doi:10.1186/1471-2458-9-116pmid:http://www.ncbi.nlm.nih.gov/pubmed/19400941
OpenUrl CrossRef PubMed
↵
1. Reis RS,
2. Salvo D,
3. Ogilvie D, et al
. Scaling up physical activity interventions worldwide: stepping up to larger and smarter approaches to get people moving. Lancet 2016;388:1337–48.doi:10.1016/S0140-6736(16)30728-0pmid:http://www.ncbi.nlm.nih.gov/pubmed/27475273
OpenUrl CrossRef PubMed
↵
1. Antman EM,
2. Lau J,
3. Kupelnick B, et al
. A comparison of results of meta-analyses of randomized control trials and recommendations of clinical experts. treatments for myocardial infarction. JAMA 1992;268:240–8.pmid:http://www.ncbi.nlm.nih.gov/pubmed/1535110
OpenUrl CrossRef PubMed Web of Science
↵
1. Higgins JPT,
2. López-López JA,
3. Becker BJ, et al
. Synthesising quantitative evidence in systematic reviews of complex health interventions. BMJ Glob Health 2019;4:e000858. doi:10.1136/bmjgh-2018-000858pmid:http://www.ncbi.nlm.nih.gov/pubmed/30775014
OpenUrl Abstract/FREE Full Text
↵
1. Trochim W
. Evaluation policy and evaluation practice. Denver, Colorado: American Evaluation Association, 2008.
↵
1. Greenhalgh T,
2. Thorne S,
3. Malterud K
. Time to challenge the spurious hierarchy of systematic over narrative reviews? Eur J Clin Invest 2018;48:e12931. doi:10.1111/eci.12931pmid:http://www.ncbi.nlm.nih.gov/pubmed/29578574
OpenUrl CrossRef PubMed
↵
1. Petticrew M,
2. Knai C,
3. Thomas J, et al
. Implications of a complexity perspective for systematic reviews and Guideline development in health decision making. BMJ Glob Health 2019;4:e000899. doi:10.1136/bmjgh-2018-000899pmid:http://www.ncbi.nlm.nih.gov/pubmed/30775017
OpenUrl Abstract/FREE Full Text
↵
1. Sallis JF,
2. Cerin E,
3. Conway TL, et al
. Physical activity in relation to urban environments in 14 cities worldwide: a cross-sectional study. Lancet 2016;387:2207–17.doi:10.1016/S0140-6736(15)01284-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/27045735
OpenUrl CrossRef PubMed
↵
1. Hawe P,
2. Shiell A,
3. Riley T
. Complex interventions: how "out of control" can a randomised controlled trial be? BMJ 2004;328:1561–3.doi:10.1136/bmj.328.7455.1561pmid:http://www.ncbi.nlm.nih.gov/pubmed/15217878
OpenUrl FREE Full Text
↵
1. Rutter H,
2. Savona N,
3. Glonti K, et al
. The need for a complex systems model of evidence for public health. Lancet 2017;390:2602–4.doi:10.1016/S0140-6736(17)31267-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/28622953
OpenUrl CrossRef PubMed
↵
1. Egan M,
2. McGill E,
3. Penney T, et al
. Guidance on systems approaches to local public health evaluation. Part 2: what to consider when planning a systems evaluation. London: National Institute for Health Research School for Public Health Research, 2019.
↵
1. Dunning T
. Natural experiments in the social sciences: a design-based approach. Cambridge: Cambridge University Press, 2012.
↵
1. Nutbeam D
. Looking beyond RCTs. Available: http://preventioncentre.org.au/blog/looking-beyond-rcts
↵
1. Barnett DW,
2. Barnett A,
3. Nathan A, et al
. Built environmental correlates of older adults' total physical activity and walking: a systematic review and meta-analysis. Int J Behav Nutr Phys Act 2017;14:103.doi:10.1186/s12966-017-0558-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/28784183
OpenUrl PubMed
↵
1. Kelly MP,
2. Russo F
. Causal narratives in public health: the difference between mechanisms of aetiology and mechanisms of prevention in non-communicable diseases. Sociol Health Illn 2018;40:82–99.doi:10.1111/1467-9566.12621pmid:http://www.ncbi.nlm.nih.gov/pubmed/29023919
OpenUrl CrossRef PubMed
↵
1. Craig P,
2. Cooper C,
3. Gunnell D, et al
. Using natural experiments to evaluate population health interventions: new medical Research Council guidance. J Epidemiol Community Health 2012;66:1182–6.doi:10.1136/jech-2011-200375pmid:http://www.ncbi.nlm.nih.gov/pubmed/22577181
OpenUrl Abstract/FREE Full Text
↵
1. Morris J
. Uses of epidemiology. Edinburgh: Livingstone, 1957.
↵
1. Hawe P
. The truth, but not the whole truth? call for an amnesty on unreported results of public health interventions. J Epidemiol Community Health 2012;66:285. doi:10.1136/jech.2011.140350pmid:http://www.ncbi.nlm.nih.gov/pubmed/22003082
OpenUrl FREE Full Text
↵
1. Ling T
. Evaluating complex and unfolding interventions in real time. Evaluation 2012;18:79–91.doi:10.1177/1356389011429629
OpenUrl CrossRef Web of Science
↵
1. Craig P,
2. Di Ruggiero E,
3. Frohlich K, et al
. Taking account of context in population health intervention research: guidance for producers, users and funders of research. National Institute for Health Research: Southampton, 2018.
↵
1. Moore GF,
2. Audrey S,
3. Barker M, et al
. Process evaluation of complex interventions: medical Research Council guidance. BMJ 2015;350:h1258. doi:10.1136/bmj.h1258pmid:http://www.ncbi.nlm.nih.gov/pubmed/25791983
OpenUrl FREE Full Text
↵
1. Bunge M
. How does it work? the search for explanatory mechanisms. Philos Soc Sci 2004;34:182–210.doi:10.1177/0048393103262550
OpenUrl CrossRef Web of Science
↵
1. Hulvej Rod M,
2. Ingholt L,
3. Sørensen B, et al
. The spirit of the intervention: reflections on social effectiveness in public health intervention research. Crit Public Health 2014;24:296–307.doi:10.1080/09581596.2013.841313
OpenUrl
↵
1. Bonell C,
2. Moore G,
3. Warren E, et al
. Are randomised controlled trials positivist? Reviewing the social science and philosophy literature to assess positivist tendencies of trials of social interventions in public health and health services. Trials 2018;19:238. doi:10.1186/s13063-018-2589-4pmid:http://www.ncbi.nlm.nih.gov/pubmed/29673378
OpenUrl PubMed
↵
1. Cohn S,
2. Clinch M,
3. Bunn C, et al
. Entangled complexity: why complex interventions are just not complicated enough. J Health Serv Res Policy 2013;18:40–3.doi:10.1258/jhsrp.2012.012036pmid:http://www.ncbi.nlm.nih.gov/pubmed/23393041
OpenUrl CrossRef PubMed
↵
1. Barth J,
2. Munder T,
3. Gerger H, et al
. Comparative efficacy of seven psychotherapeutic interventions for patients with depression: a network meta-analysis. PLoS Med 2013;10:e1001454. doi:10.1371/journal.pmed.1001454pmid:http://www.ncbi.nlm.nih.gov/pubmed/23723742
OpenUrl CrossRef PubMed
↵
1. Cuijpers P
. Effective therapies or effective mechanisms in treatment guidelines for depression? Depress Anxiety 2013;30:1055–7.doi:10.1002/da.22205pmid:http://www.ncbi.nlm.nih.gov/pubmed/24155017
OpenUrl CrossRef PubMed
↵
1. Panter J,
2. Guell C,
3. Humphreys D, et al
. Can changing the physical environment promote walking and cycling? A systematic review of what works and how. Health Place 2019;58:102161. doi:10.1016/j.healthplace.2019.102161pmid:http://www.ncbi.nlm.nih.gov/pubmed/31301599
OpenUrl PubMed
↵
1. Noyes J,
2. Booth A,
3. Cargo M, et al
. Chapter 21: Qualitative evidence. In: Higgins J, Thomas J, Chandler J, et al., eds. Cochrane handbook for systematic reviews of interventions version 6.0, 2019.
↵
1. Ogilvie D,
2. Cummins S,
3. Petticrew M, et al
. Assessing the evaluability of complex public health interventions: five questions for researchers, funders, and policymakers. Milbank Q 2011;89:206–25.doi:10.1111/j.1468-0009.2011.00626.xpmid:http://www.ncbi.nlm.nih.gov/pubmed/21676021
OpenUrl CrossRef PubMed Web of Science
↵
1. Fiorentino AR,
2. Dammann O,
3. Evidence DO
. Evidence, illness, and causation: an epidemiological perspective on the Russo-Williamson thesis. Stud Hist Philos Biol Biomed Sci 2015;54:1–9.doi:10.1016/j.shpsc.2015.09.010pmid:http://www.ncbi.nlm.nih.gov/pubmed/26497602
OpenUrl PubMed
↵
1. Lipton P
. Inference to the best explanation. In: Newton-Smith W, ed. A companion to the philosophy of science. Oxford: Blackwell, 2000.
↵
1. Trochim WMK
. Outcome pattern matching and program theory. Eval Program Plann 1989;12:355–66.doi:10.1016/0149-7189(89)90052-9
OpenUrl CrossRef Web of Science
↵
1. Pawson R
. Evidence-based policy: a realist perspective. London: Sage, 2006.
↵
1. Schwartz MA
. The importance of stupidity in scientific research. J Cell Sci 2008;121:1771. doi:10.1242/jcs.033340pmid:http://www.ncbi.nlm.nih.gov/pubmed/18492790
OpenUrl FREE Full Text
↵
1. Illari P,
2. Russo P
. Causality: philosophical theory meets scientific practice. Oxford: Oxford University Press, 2014.
↵
1. Rayner G
. Changing the “surfeits of our own behaviour”: an introductory note on behaviour and obesity. ESRC behaviour change seminar series seminar two: obesity, food and physical activity. London: Economic and Social Research Council, 2014.
1. National Institute for Health and Care Excellence
. Physical activity and the environment: NICE guideline. London: National Institute for Health and Care Excellence, 2018.
1. National Institute for Health and Care Excellence
. Guideline scope: physical activity and the environment update. London, 2016.
1. GOV.UK
. Traffic management act 2004: network management in response to COVID-19, London: Department for Transport, 2020. Available: https://www.gov.uk/government/publications/reallocating-road-space-in-response-to-covid-19-statutory-guidance-for-local-authorities/traffic-management-act-2004-network-management-in-response-to-covid-19
1. Ogilvie D,
2. Foley L,
3. Nimegeer A, et al
. Health impacts of the M74 urban motorway extension: a mixed-method natural experimental study. Public Health Res 2017;5:1–164.doi:10.3310/phr05030
OpenUrl
1. Collier D,
2. Brady HE,
3. Seawright J
. Outdated views of qualitative methods: time to move on. Polit Anal 2010;18:506–13.doi:10.1093/pan/mpq022
OpenUrl CrossRef
1. Freedman D
. On types of scientific inquiry: the role of qualitative reasoning. In: Box-Steffensmeier J, Brady H, Collier D, eds. Oxford handbook of political methodology. Oxford: Oxford University Press, 1991.
1. Greenhalgh T
. Face coverings for the public: laying straw men to rest. J Eval Clin Pract 2020:e13415. doi:10.1111/jep.13415

Footnotes

Handling editor Seye Abimbola
Twitter @dbogilvie, @adrianbauman, @loudoestweet, @connyguell, @dkhumphreys, @jennapanter
Contributors DO conceived the original idea and drafted the manuscript. AB provided critical feedback during the initial drafting. LF, CG, DH and JP undertook the studies used as worked examples in tables 2 and 4 in collaboration with DO; and together with AB provided critical feedback during later drafting and contributed to the final version of the manuscript. DO is the guarantor.
Funding DO and JP are supported by the Medical Research Council (Unit Programme number MC_UU_12015/6). LF is funded by the National Institute for Health Research (NIHR) Global Health Research Group and Network on Diet and Activity, for which funding from NIHR is gratefully acknowledged (grant reference 16/137/34). CG is funded by the Academy of Medical Sciences and the Wellcome Trust (Springboard—Health of the Public 2040, grant reference HOP001/1051). The paper was initially developed in the course of a visiting appointment as Thought Leader in Residence at the School of Public Health at the University of Sydney, for which the intellectual environment and financial support provided by the Prevention Research Collaboration is gratefully acknowledged. It was further developed under the auspices of the Centre for Diet and Activity Research (CEDAR), a UKCRC Public Health Research Centre of Excellence at the University of Cambridge, for which funding from the British Heart Foundation, Economic and Social Research Council, Medical Research Council, National Institute for Health Research and the Wellcome Trust, under the auspices of the UK Clinical Research Collaboration, is gratefully acknowledged (grant reference MR/K023187/1).
Disclaimer The views expressed in this publication are those of the authors and not necessarily those of the National Health Service (NHS), the NIHR, the Department of Health and Social Care or any other funder.
Competing interests None declared.
Patient consent for publication Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement There are no data in this work.

[1] ↵
Marteau TM,
White M,
Rutter H, et al
. Increasing healthy life expectancy equitably in England by 5 years by 2035: could it be achieved? Lancet 2019;393:2571–3.doi:10.1016/S0140-6736(19)31510-7pmid:http://www.ncbi.nlm.nih.gov/pubmed/31258113
OpenUrl CrossRef PubMed

[2] Marteau TM,

[3] White M,

[4] Rutter H, et al

[5] ↵
Ogilvie D,
Adams J,
Bauman A, et al
. Using natural experimental studies to guide public health action: turning the evidence-based medicine paradigm on its head. J Epidemiol Community Health 2020;74:203–8.doi:10.1136/jech-2019-213085pmid:http://www.ncbi.nlm.nih.gov/pubmed/31744848
OpenUrl Abstract/FREE Full Text

[6] Ogilvie D,

[7] Adams J,

[8] Bauman A, et al

[9] ↵
Greenhalgh T
. Will COVID-19 be evidence-based medicine's nemesis? PLoS Med 2020;17:e1003266. doi:10.1371/journal.pmed.1003266pmid:http://www.ncbi.nlm.nih.gov/pubmed/32603323
OpenUrl PubMed

[10] Greenhalgh T

[11] ↵
Ogilvie D,
Craig P,
Griffin S, et al
. A translational framework for public health research. BMC Public Health 2009;9:116. doi:10.1186/1471-2458-9-116pmid:http://www.ncbi.nlm.nih.gov/pubmed/19400941
OpenUrl CrossRef PubMed

[12] Ogilvie D,

[13] Craig P,

[14] Griffin S, et al

[15] ↵
Reis RS,
Salvo D,
Ogilvie D, et al
. Scaling up physical activity interventions worldwide: stepping up to larger and smarter approaches to get people moving. Lancet 2016;388:1337–48.doi:10.1016/S0140-6736(16)30728-0pmid:http://www.ncbi.nlm.nih.gov/pubmed/27475273
OpenUrl CrossRef PubMed

[16] Reis RS,

[17] Salvo D,

[18] Ogilvie D, et al

[19] ↵
Antman EM,
Lau J,
Kupelnick B, et al
. A comparison of results of meta-analyses of randomized control trials and recommendations of clinical experts. treatments for myocardial infarction. JAMA 1992;268:240–8.pmid:http://www.ncbi.nlm.nih.gov/pubmed/1535110
OpenUrl CrossRef PubMed Web of Science

[20] Antman EM,

[21] Lau J,

[22] Kupelnick B, et al

[23] ↵
Higgins JPT,
López-López JA,
Becker BJ, et al
. Synthesising quantitative evidence in systematic reviews of complex health interventions. BMJ Glob Health 2019;4:e000858. doi:10.1136/bmjgh-2018-000858pmid:http://www.ncbi.nlm.nih.gov/pubmed/30775014
OpenUrl Abstract/FREE Full Text

[24] Higgins JPT,

[25] López-López JA,

[26] Becker BJ, et al

[27] ↵
Trochim W
. Evaluation policy and evaluation practice. Denver, Colorado: American Evaluation Association, 2008.

[28] Trochim W

[29] ↵
Greenhalgh T,
Thorne S,
Malterud K
. Time to challenge the spurious hierarchy of systematic over narrative reviews? Eur J Clin Invest 2018;48:e12931. doi:10.1111/eci.12931pmid:http://www.ncbi.nlm.nih.gov/pubmed/29578574
OpenUrl CrossRef PubMed

[30] Greenhalgh T,

[31] Thorne S,

[32] Malterud K

[33] ↵
Petticrew M,
Knai C,
Thomas J, et al
. Implications of a complexity perspective for systematic reviews and Guideline development in health decision making. BMJ Glob Health 2019;4:e000899. doi:10.1136/bmjgh-2018-000899pmid:http://www.ncbi.nlm.nih.gov/pubmed/30775017
OpenUrl Abstract/FREE Full Text

[34] Petticrew M,

[35] Knai C,

[36] Thomas J, et al

[37] ↵
Sallis JF,
Cerin E,
Conway TL, et al
. Physical activity in relation to urban environments in 14 cities worldwide: a cross-sectional study. Lancet 2016;387:2207–17.doi:10.1016/S0140-6736(15)01284-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/27045735
OpenUrl CrossRef PubMed

[38] Sallis JF,

[39] Cerin E,

[40] Conway TL, et al

[41] ↵
Hawe P,
Shiell A,
Riley T
. Complex interventions: how "out of control" can a randomised controlled trial be? BMJ 2004;328:1561–3.doi:10.1136/bmj.328.7455.1561pmid:http://www.ncbi.nlm.nih.gov/pubmed/15217878
OpenUrl FREE Full Text

[42] Hawe P,

[43] Shiell A,

[44] Riley T

[45] ↵
Rutter H,
Savona N,
Glonti K, et al
. The need for a complex systems model of evidence for public health. Lancet 2017;390:2602–4.doi:10.1016/S0140-6736(17)31267-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/28622953
OpenUrl CrossRef PubMed

[46] Rutter H,

[47] Savona N,

[48] Glonti K, et al

[49] ↵
Egan M,
McGill E,
Penney T, et al
. Guidance on systems approaches to local public health evaluation. Part 2: what to consider when planning a systems evaluation. London: National Institute for Health Research School for Public Health Research, 2019.

[50] Egan M,

[51] McGill E,

[52] Penney T, et al

[53] ↵
Dunning T
. Natural experiments in the social sciences: a design-based approach. Cambridge: Cambridge University Press, 2012.

[54] Dunning T

[55] ↵
Nutbeam D
. Looking beyond RCTs. Available: http://preventioncentre.org.au/blog/looking-beyond-rcts

[56] Nutbeam D

[57] ↵
Barnett DW,
Barnett A,
Nathan A, et al
. Built environmental correlates of older adults' total physical activity and walking: a systematic review and meta-analysis. Int J Behav Nutr Phys Act 2017;14:103.doi:10.1186/s12966-017-0558-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/28784183
OpenUrl PubMed

[58] Barnett DW,

[59] Barnett A,

[60] Nathan A, et al

[61] ↵
Kelly MP,
Russo F
. Causal narratives in public health: the difference between mechanisms of aetiology and mechanisms of prevention in non-communicable diseases. Sociol Health Illn 2018;40:82–99.doi:10.1111/1467-9566.12621pmid:http://www.ncbi.nlm.nih.gov/pubmed/29023919
OpenUrl CrossRef PubMed

[62] Kelly MP,

[63] Russo F

[64] ↵
Craig P,
Cooper C,
Gunnell D, et al
. Using natural experiments to evaluate population health interventions: new medical Research Council guidance. J Epidemiol Community Health 2012;66:1182–6.doi:10.1136/jech-2011-200375pmid:http://www.ncbi.nlm.nih.gov/pubmed/22577181
OpenUrl Abstract/FREE Full Text

[65] Craig P,

[66] Cooper C,

[67] Gunnell D, et al

[68] ↵
Morris J
. Uses of epidemiology. Edinburgh: Livingstone, 1957.

[69] Morris J

[70] ↵
Hawe P
. The truth, but not the whole truth? call for an amnesty on unreported results of public health interventions. J Epidemiol Community Health 2012;66:285. doi:10.1136/jech.2011.140350pmid:http://www.ncbi.nlm.nih.gov/pubmed/22003082
OpenUrl FREE Full Text

[71] Hawe P

[72] ↵
Ling T
. Evaluating complex and unfolding interventions in real time. Evaluation 2012;18:79–91.doi:10.1177/1356389011429629
OpenUrl CrossRef Web of Science

[73] Ling T

[74] ↵
Craig P,
Di Ruggiero E,
Frohlich K, et al
. Taking account of context in population health intervention research: guidance for producers, users and funders of research. National Institute for Health Research: Southampton, 2018.

[75] Craig P,

[76] Di Ruggiero E,

[77] Frohlich K, et al

[78] ↵
Moore GF,
Audrey S,
Barker M, et al
. Process evaluation of complex interventions: medical Research Council guidance. BMJ 2015;350:h1258. doi:10.1136/bmj.h1258pmid:http://www.ncbi.nlm.nih.gov/pubmed/25791983
OpenUrl FREE Full Text

[79] Moore GF,

[80] Audrey S,

[81] Barker M, et al

[82] ↵
Bunge M
. How does it work? the search for explanatory mechanisms. Philos Soc Sci 2004;34:182–210.doi:10.1177/0048393103262550
OpenUrl CrossRef Web of Science

[83] Bunge M

[84] ↵
Hulvej Rod M,
Ingholt L,
Sørensen B, et al
. The spirit of the intervention: reflections on social effectiveness in public health intervention research. Crit Public Health 2014;24:296–307.doi:10.1080/09581596.2013.841313
OpenUrl

[85] Hulvej Rod M,

[86] Ingholt L,

[87] Sørensen B, et al

[88] ↵
Bonell C,
Moore G,
Warren E, et al
. Are randomised controlled trials positivist? Reviewing the social science and philosophy literature to assess positivist tendencies of trials of social interventions in public health and health services. Trials 2018;19:238. doi:10.1186/s13063-018-2589-4pmid:http://www.ncbi.nlm.nih.gov/pubmed/29673378
OpenUrl PubMed

[89] Bonell C,

[90] Moore G,

[91] Warren E, et al

[92] ↵
Cohn S,
Clinch M,
Bunn C, et al
. Entangled complexity: why complex interventions are just not complicated enough. J Health Serv Res Policy 2013;18:40–3.doi:10.1258/jhsrp.2012.012036pmid:http://www.ncbi.nlm.nih.gov/pubmed/23393041
OpenUrl CrossRef PubMed

[93] Cohn S,

[94] Clinch M,

[95] Bunn C, et al

[96] ↵
Barth J,
Munder T,
Gerger H, et al
. Comparative efficacy of seven psychotherapeutic interventions for patients with depression: a network meta-analysis. PLoS Med 2013;10:e1001454. doi:10.1371/journal.pmed.1001454pmid:http://www.ncbi.nlm.nih.gov/pubmed/23723742
OpenUrl CrossRef PubMed

[97] Barth J,

[98] Munder T,

[99] Gerger H, et al

[100] ↵
Cuijpers P
. Effective therapies or effective mechanisms in treatment guidelines for depression? Depress Anxiety 2013;30:1055–7.doi:10.1002/da.22205pmid:http://www.ncbi.nlm.nih.gov/pubmed/24155017
OpenUrl CrossRef PubMed

[101] Cuijpers P

[102] ↵
Panter J,
Guell C,
Humphreys D, et al
. Can changing the physical environment promote walking and cycling? A systematic review of what works and how. Health Place 2019;58:102161. doi:10.1016/j.healthplace.2019.102161pmid:http://www.ncbi.nlm.nih.gov/pubmed/31301599
OpenUrl PubMed

[103] Panter J,

[104] Guell C,

[105] Humphreys D, et al

[106] ↵
Noyes J,
Booth A,
Cargo M, et al
. Chapter 21: Qualitative evidence. In: Higgins J, Thomas J, Chandler J, et al., eds. Cochrane handbook for systematic reviews of interventions version 6.0, 2019.

[107] Noyes J,

[108] Booth A,

[109] Cargo M, et al

[110] ↵
Ogilvie D,
Cummins S,
Petticrew M, et al
. Assessing the evaluability of complex public health interventions: five questions for researchers, funders, and policymakers. Milbank Q 2011;89:206–25.doi:10.1111/j.1468-0009.2011.00626.xpmid:http://www.ncbi.nlm.nih.gov/pubmed/21676021
OpenUrl CrossRef PubMed Web of Science

[111] Ogilvie D,

[112] Cummins S,

[113] Petticrew M, et al

[114] ↵
Fiorentino AR,
Dammann O,
Evidence DO
. Evidence, illness, and causation: an epidemiological perspective on the Russo-Williamson thesis. Stud Hist Philos Biol Biomed Sci 2015;54:1–9.doi:10.1016/j.shpsc.2015.09.010pmid:http://www.ncbi.nlm.nih.gov/pubmed/26497602
OpenUrl PubMed

[115] Fiorentino AR,

[116] Dammann O,

[117] Evidence DO

[118] ↵
Lipton P
. Inference to the best explanation. In: Newton-Smith W, ed. A companion to the philosophy of science. Oxford: Blackwell, 2000.

[119] Lipton P

[120] ↵
Trochim WMK
. Outcome pattern matching and program theory. Eval Program Plann 1989;12:355–66.doi:10.1016/0149-7189(89)90052-9
OpenUrl CrossRef Web of Science

[121] Trochim WMK

[122] ↵
Pawson R
. Evidence-based policy: a realist perspective. London: Sage, 2006.

[123] Pawson R

[124] ↵
Schwartz MA
. The importance of stupidity in scientific research. J Cell Sci 2008;121:1771. doi:10.1242/jcs.033340pmid:http://www.ncbi.nlm.nih.gov/pubmed/18492790
OpenUrl FREE Full Text

[125] Schwartz MA

[126] ↵
Illari P,
Russo P
. Causality: philosophical theory meets scientific practice. Oxford: Oxford University Press, 2014.

[127] Illari P,

[128] Russo P

[129] ↵
Rayner G
. Changing the “surfeits of our own behaviour”: an introductory note on behaviour and obesity. ESRC behaviour change seminar series seminar two: obesity, food and physical activity. London: Economic and Social Research Council, 2014.

[130] Rayner G

[131] National Institute for Health and Care Excellence
. Physical activity and the environment: NICE guideline. London: National Institute for Health and Care Excellence, 2018.

[132] National Institute for Health and Care Excellence

[133] National Institute for Health and Care Excellence
. Guideline scope: physical activity and the environment update. London, 2016.

[134] National Institute for Health and Care Excellence

[135] GOV.UK
. Traffic management act 2004: network management in response to COVID-19, London: Department for Transport, 2020. Available: https://www.gov.uk/government/publications/reallocating-road-space-in-response-to-covid-19-statutory-guidance-for-local-authorities/traffic-management-act-2004-network-management-in-response-to-covid-19

[136] GOV.UK

[137] Ogilvie D,
Foley L,
Nimegeer A, et al
. Health impacts of the M74 urban motorway extension: a mixed-method natural experimental study. Public Health Res 2017;5:1–164.doi:10.3310/phr05030
OpenUrl

[138] Ogilvie D,

[139] Foley L,

[140] Nimegeer A, et al

[141] Collier D,
Brady HE,
Seawright J
. Outdated views of qualitative methods: time to move on. Polit Anal 2010;18:506–13.doi:10.1093/pan/mpq022
OpenUrl CrossRef

[142] Collier D,

[143] Brady HE,

[144] Seawright J

[145] Freedman D
. On types of scientific inquiry: the role of qualitative reasoning. In: Box-Steffensmeier J, Brady H, Collier D, eds. Oxford handbook of political methodology. Oxford: Oxford University Press, 1991.

[146] Freedman D

[147] Greenhalgh T
. Face coverings for the public: laying straw men to rest. J Eval Clin Pract 2020:e13415. doi:10.1111/jep.13415

[148] Greenhalgh T

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Summary box

Introduction

Promising solutions or false refuges

Feeding the meta-analytical machine: piling the stack and singing in harmony

Broadening the scope: building the panopticon and modelling the solutions

Is the craft of evidence synthesis fit for purpose?

Building a dry stone wall of evidence

Looking beyond ‘interventions’

Searching for patterns

Embracing the mess

Conclusions

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password