Intended for healthcare professionals

Research

A meta-analysis of the diagnostic performance of the direct agglutination test and rK39 dipstick for visceral leishmaniasis

BMJ 2006; 333 doi: https://doi.org/10.1136/bmj.38917.503056.7C (Published 05 October 2006) Cite this as: BMJ 2006;333:723
  1. François Chappuis, lecturer (francois.chappuis{at}hcuge.ch)1,
  2. Suman Rijal, professor2,
  3. Alonso Soto, investigator3,
  4. Joris Menten, statistician4,
  5. Marleen Boelaert, professor4
  1. 1 Geneva University Hospitals, Travel and Migration Medicine Unit, rue Micheli-du-Crest 24, 1211 Geneva 14, Switzerland
  2. 2 BP Koirala Institute of Health Sciences, Dharan, Nepal
  3. 3 Hospital Nacional Hipolito Unanue, Universidad Peruana Cayetano Heredia, Lima, Peru
  4. 4 Institute of Tropical Medicine, Antwerp, Belgium
  1. Correspondence to: F Chappuis
  • Accepted 11 July 2006

Abstract

Objective To compare the performance of the direct agglutination test and rK39 dipstick for the diagnosis of visceral leishmaniasis.

Data sources Medline, citation tracking, January 1986 to December 2004.

Selection criteria Original studies evaluating the direct agglutination test or the rK39 dipstick with clinical visceral leishmaniasis as target condition; adequate reference classification; and absolute numbers of true positive, true negative, false positive, and false negative observations available or derivable from the data presented.

Results 30 studies evaluating the direct agglutination test and 13 studies evaluating the rK39 dipstick met the inclusion criteria. The combined sensitivity estimates of the direct agglutination test and the rK39 dipstick were 94.8% (95% confidence interval 92.7% to 96.4%) and 93.9% (87.7% to 97.1%), respectively. Sensitivity seemed higher and more homogenous in the studies carried out in South Asia. Specificity estimates were influenced by the type of controls. In phase III studies carried out on patients with clinically suspected disease, the estimated specificity of the direct agglutination test was 85.9% (72.3% to 93.4%) and of the rK39 dipstick was 90.6% (66.8% to 97.9%).

Conclusion The diagnostic performance of the direct agglutination test and the rK39 dipstick for visceral leishmaniasis is good to excellent and seem comparable.

Introduction

Visceral leishmaniasis, or kala-azar, is a disease caused by a bloodborne protozoan. The disease, fatal if untreated, affects up to 500 000 people yearly, mostly in poor rural areas of east Africa (Sudan, Ethiopia, Kenya, Uganda), South Asia (India, Bangladesh, Nepal), and Latin America (Brazil).1 Pentavalent antimonials are the most widely used drugs for visceral leishmaniasis. Treatment, requiring painful intramuscular or intravenous injections for 30 days, carries a significant toxicity and is expensive as branded formulations. Accurate diagnosis of this condition is therefore vital to avoid under-diagnosis or over-diagnosis. Microscopical examination of spleen aspirates is sensitive and specific but requires expertise to carry out the aspiration safely and to read the slides accurately. Examination of bone marrow or lymph node aspirates is equally specific but less sensitive.w1

Serological tests have been developed to replace parasitological methods for the diagnosis of visceral leishmaniasis in the field. The direct agglutination test was first described by Allain and Kagan and adapted by El Harith et al.2 w2 w3 The test is semiquantitative and uses microplates with V shaped wells in which increasing dilutions of patient's serum or blood is mixed with stained killed promastigotes of Leishmania donovani. If antibodies to the protozoan are present, agglutination is visible with the naked eye. The test has been validated in several endemic areas and is being used for diagnosis of visceral leishmaniasis in countries such as Sudan.w4 More recently a serological test was developed against a recombinant antigen derived from a 39 amino acid repeat in Leishmania chagasi (rK39). It proved to be accurate when used as an enzyme linked immunosorbent assay35 and was later developed in a simpler dipstick format. Patient's serum or blood is added to a strip and migrates towards the fixed rK39 antigen. A reagent reveals the presence of specific antibodies on the rK39 antigen line. The results are qualitative (positive or negative) and are available within 20 minutes. Since 1998 several brands of dipsticks have been studied in different endemic areas using various study designs.

Several authors propose the direct agglutination test and the rK39 dipstick as the most appropriate serological tests for the diagnosis of visceral leishmaniasis in the field, but an independent and global assessment of their performance is needed. Also, as visceral leishmaniasis is an infectious disease presenting with substantial variability between regions, it is crucial to examine the extent and sources of heterogeneity in published results of serological test performance and the reasons for the heterogeneity. We carried out a meta-analysis of the diagnostic performance of the direct agglutination test and the rK39 dipstick.

Materials and methods

We searched Medline through PubMed for articles published from January 1986 to December 2004 that reported on the diagnostic accuracy of the direct agglutination test and the rK39 immunochromatographic assay used on serum or blood samples for visceral leishmaniasis. The search terms were “visceral leishmaniasis”, “kala-azar”, “L.donovani“, “L.infantum“, and “L.chagasi” combined with “diagnostic accuracy”, “sensitivity”, “specificity”, “validation”, and “diagnostic performance”. This search generated 654 papers, which we subsequently combined with the search terms “Direct Agglutination Test or DAT” and, in a second step, with “rK39 and immunochromatographic test”, “immunochromatographic assay”, and “ICT or dipstick”. We obtained additional articles by citation tracking of review articles and original articles.

We included original studies only. Other inclusion criteria were current clinical visceral leishmaniasis as the target condition (not leishmanial infection or past visceral leishmaniasis); human participants; the absolute numbers of true positive, false negative, true negative, and false positive observations available or derivable from the data presented; and the reference classification judged adequate to correctly classify the target condition. We included studies that complied with these four criteria, independent of other characteristics of design quality and reporting. We excluded studies that evaluated the tests in patients coinfected with HIV and studies of the rapid version of the direct agglutination test (the fast agglutination screening test).6

Data extraction

We used a standard form to extract data on type of study (phase I, II, or III trials), Leishmania species (L donovani, L infantum, or L chagasi), country in which the study was carried out, characteristics of the participants, study design, and test results. Data were entered in a spreadsheet (Microsoft Excel).

The number and type of participants was recorded and categorised as confirmed cases or controls. Several controls were distinguished: healthy non-endemic controls (healthy people living in an area with no transmission of Leishmania and no exposure by travel); healthy endemic controls (people living in an endemic region but without signs or symptoms of leishmaniasis); controls with potentially cross reacting diseases (patients with confirmed disease that might give a false positive reaction to a serological test); and controls (patients with the same clinical syndrome as confirmed cases with visceral leishmaniasis ruled out by a confirmatory test with a high negative predictive value such as on spleen aspirate).

In many articles the numbers of true positive, false negative, true negative, and false positive observations were available. If not, we derived the numbers from the marginal totals and the reported sensitivity and specificity.

Assessment of study quality

Two independent readers assessed the papers according to the quality assessment of studies of diagnostic accuracy approach.7 Disagreements were resolved by a third reader who reassessed problematic papers. The quality assessment of studies of diagnostic accuracy tool lists 14 items that together assess the quality of a study (see tables A-C on bmj.com). Important criteria are a representative spectrum of patients, correct classification of disease status, complete description of reference test and index test, blinding of the readers of the serological test to the result of the reference test and vice versa, and a low potential for verification bias. Such bias may exist if the decision to carry out the standard examination depends on the results of the examination under investigation—that is, if verification with parasitology occurred more often in patients with positive serology results.

Data analysis

We used standard formulas to calculate sensitivity, specificity, and diagnostic odds ratio for each study. Wilson's score method was used to calculate confidence intervals for sensitivities and specificities,8 and the normal approximation with continuity correction for the log of the odds ratio for confidence intervals for the diagnostic odds ratio.9

We explored the relation between sensitivity and specificity by plotting the log odds of a positive test result in diseased participants (cases) against the log odds of a negative test result in non-diseased participants (controls). Publication bias was assessed by graphing the estimated sensitivity and specificity against the number of cases and controls in each study.

Meta-analyses for sensitivity and specificity were carried out using logistic regression models accounting for overdispersion.10 This model weights each study proportional to its sample size while allowing for heterogeneity between studies by inclusion of a dispersion variable. The robustness of this statistical approach was explored by comparison with other statistical methods and simulation. The meta-analysis of the diagnostic odds ratio was carried out using the DerSimonian and Laird random effects model, with studies weighted using the Mantel-Haenszel method.11 To assess the heterogeneity of studies we carried out separate meta-analyses in subgroups stratified by study phase, sample size, study quality, geographical region, species of Leishmania, type of direct agglutination test antigen, brand of dipstick, and type of controls.

Statistical analyses were carried out with the open source statistical language and environment R 2.0.1.12 We used the generalised linear models module (procedure “glm” using the quasibinomial distribution) to fit over-dispersed logistic regression models and the R meta-analysis library (procedure “metabin”) to fit the DerSimonian and Laird random effects model.

Results

Thirteen studies evaluating the rK39 dipstick and 30 evaluating the direct agglutination test were included in the meta-analysis.w1- w38 The 13 dipstick studies included a total of 1119 patients with visceral leishmaniasis and 2676 controls, and the 30 studies of the direct agglutination test included 1698 patients with visceral leishmaniasis and 3876 controls. The ideal method for diagnosis of visceral leishmaniasis in all studies was a positive result on microscopical examination of lymph node, bone marrow, or spleen aspirate. Diagnosis was also confirmed by detection of parasites in culture (n = 4 studies) or by polymerase chain reaction (n = 1) or a good clinical response to antileishmanial drugs in patients with clinically suspected disease who had positive serology results (n = 4). Results of the quality assessment of studies of diagnostic accuracy evaluation are on bmj.com. No evidence of publication bias was observed. Owing to the lack of correlation between the observed sensitivities and specificities (data not shown), the individual sensitivities and specificities for diagnostic accuracy were determined.

Sensitivity, specificity, and diagnostic odds ratio

The estimated sensitivity of both tests was relatively consistent across studies, indicating that a summary of the data in a single estimate with 95% confidence interval (obtained by a statistical method that allows for some variability in results across cases) is appropriate.

Figures 1 and 2 show the results of individual and combined sensitivity estimates for the tests: the estimated sensitivity for the rK39 dipstick was 93.9% (95% confidence interval 87.7% to 97.1%). Only one study seemed inconsistent with this estimate (67.2%, 54.1% to 78.2%).w20 The estimated sensitivity for the direct agglutination test was 94.8% (92.7% to 96.4%). This estimate and confidence intervals were generally consistent with the individual study estimates.

Fig 1
Fig 1

Estimates of sensitivity and specificity (95% confidence intervals) of rK39 dipstick

Fig 2
Fig 2

Estimates of sensitivity and specificity (95% confidence intervals) of direct agglutination test

Some heterogeneity in sensitivity of the tests seemed to be related to the geographical location of the study (tables 1 and 2). Sensitivity seemed higher and more homogenous in studies carried out in South Asia.

Table 1

Sensitivity and specificity of rK39 dipsticks in various subgroups

View this table:
Table 2

Sensitivity and specificity of direct agglutination test in various subgroups

View this table:

The estimated specificity of both tests was heterogeneous across studies (figs 1 and 2), indicating that a summary of the data in a single estimate with 95% confidence interval is not appropriate.13 Specificity estimates were highly influenced by the type of controls (tables 1 and 2). Studies using healthy controls showed high specificities. The estimated specificity in participants with clinically suspected disease in phase III studies of the rK39 dipstick was 90.6% (95% confidence interval, 66.8% to 97.9%) and of the direct agglutination test was 85.9% (72.3% to 93.4%).

For most studies the diagnostic odds ratio exceeded 100; in participants with clinically suspected disease the diagnostic odds ratio was generally lower than in other participants. In participants with clinically suspected disease from phase III studies, the combined estimate of the diagnostic odds ratio for the rK39 dipstick was 128 (26 to 629) and for the direct agglutination test was 76 (13 to 433).

Discussion

This meta-analysis showed good to excellent diagnostic performance of the rK39 dipstick and the direct agglutination test for visceral leishmaniasis in populations not known to be infected with HIV. The overall sensitivity of both tests was excellent but was lower in studies from East Africa than in those from South Asia. Several studies have already commented on the lower sensitivity of rK39 based dipsticks in Sudan.14 w20 For reasons that remain unclear, Sudanese patients seem to develop lower titres of antibodies against rK39 than do Indian patients (S G Reed, personal communication, 2006).

The estimated specificity of the rK39 dipstick and the direct agglutination test was too heterogeneous across studies to allow for a reliable measurement of a single estimate. We thus stratified the meta-analysis by different subgroups, showing that the type of controls significantly influenced the estimates for specificity. Studies that included healthy controls showed very high specificity. The estimate of the specificity of the rK39 dipstick and direct agglutination test that is most relevant for clinical practice is obtained in phase III studies—that is, those recruiting patients with clinically suspected disease consecutively and prospectively in a representative clinical setting. Our meta-analysis includes only four such studies for both the rK39 dipstick and the direct agglutination test, resulting in specificity estimates with wide confidence intervals: rK39 dipstick 90.6% (66.8% to 97.9%) and direct agglutination test 85.9% (72.3% to 93.4%).w16 w19 w25 w26 w34 Some degree of underestimation of these specificity estimates might have occurred, as spleen puncture (the most sensitive reference standard) was not systematically carried out among patients with suspected disease in most of these phase III studies.w4 w16 w19 w25 w34 Previous decision analysis has shown that a serological test with these high sensitivity and “acceptable” specificity levels is a better alternative for case detection algorithms than are tests with insufficient sensitivity, such as bone marrow or lymph node smears.15

By focusing the meta-analysis on sensitivities and specificities, we did not take into account the effect of heterogeneity introduced by variations in diagnostic thresholds. Different diagnostic thresholds to define positive and negative test results might have been used across the studies. Diagnostic thresholds might differ as a result of explicit factors such as differences in titration cut off points for the direct agglutination test or due to unknown factors such as differing reactivity of antigen batches or variations between observers.16 We did not, however, detect any correlation between sensitivity and specificity, indicating that variations due to diagnostic thresholds were insignificant compared with other sources of heterogeneity between the studies.

Our meta-analysis supports the incorporation of these diagnostic tests in country policies. As the performance of the direct agglutination test and the rK39 dipstick is comparable, the choice should be made on the basis of other criteria such as cost, feasibility, and sustainability. rK39 dipsticks are cheaper and easier to use than the direct agglutination test, but a long term commitment of the current dipstick manufacturers is needed as two companies have already stopped production. The brand of dipstick did not significantly influence the pooled sensitivity and specificity estimates in our analysis. Nevertheless, dipsticks from different brands or from different generations of the same brand are likely to show variable diagnostic performance. Moreover, our meta-analysis did not include the results of recent studies carried out in East Africa and Asia with the DiaMed Opti-Leish IT (DiaMed AG, Switzerland), a newly manufactured rK39 dipstick that may perform better than other dipstick formats in East Africa.14 17 18 Therefore we advise that each new brand of rK39 dipstick should be individually validated in each endemic region before being integrated in a diagnostic algorithm.

The development of the direct agglutination test and, more recently, the rK39 dipstick has brought a major improvement in the diagnosis of visceral leishmaniasis in the field. These tests urgently need to be more widely used in the field. Nevertheless, the limited sensitivity of the rK39 dipstick in Sudan and the suboptimal specificity of these serological tests when used for patients with clinically suspected visceral leishmaniasis necessitate the research and development of better diagnostic tools for this fatal disease.19 Moreover, a systematic adoption of quality criteria for diagnostic accuracy studies,8 20 would not only lead to more reliable results but also to quicker and more precise recommendations for policy makers.

What is already known on this topic

The direct agglutination test and rK39 dipsticks have been proposed as the most appropriate serological tests for diagnosis of visceral leishmaniasis in the field

A global evaluation of these tests, including an assessment of the heterogeneity of their performance, is lacking

What this study adds

The direct agglutination test and the rK39 dipstick showed similar diagnostic performance for visceral leishmaniasis, ranging from good to excellent

The performance of dipstick tests for diagnosis in Sudan needs to be improved

Footnotes

  • Embedded ImageAdditional tables and references w1-w38 are on bmj.com

    We thank Isabelle Roger for her independent review of data extracted from included papers. The study was part of a PhD thesis for FC.

  • Contributors FC and MB wrote the protocol with input from all authors, did the initial search and study selection, data abstraction, and analysis. FC wrote the paper and is the guarantor. SR helped to develop the protocol, contributed to study selection, data analysis and interpretation, and edited the final manuscript. AS and JM reviewed data analysis, did the statistical analysis, and reviewed the manuscript. MB supervised all phases of the study including the writing of the manuscript.

  • Funding None.

  • Competing interests None declared.

  • Ethical approval Not required.

References

  1. 1.
  2. 2.
  3. 3.
  4. 5.
  5. 6.
  6. 7.
  7. 8.
  8. 9.
  9. 10.
  10. 11.
  11. 12.
  12. 13.
  13. 14.
  14. 15.
  15. 16.
  16. 17.
  17. 18.
  18. 19.
  19. 20.
View Abstract