Evaluation of a computer-assisted data entry procedure (including Teleform) for large-scale mailed surveys

https://doi.org/10.1016/S0010-4825(03)00012-XGet rights and content

Abstract

Computer-assisted data entry procedures for self-completion population surveys are previously unreported. We used Teleform for a questionnaire to registrants (aged 50+) with general practices in North Staffordshire, UK (n=8995). The survey achieved a 77% response rate. 98% (n=6647) of returned questionnaires were scannable. These questionnaires were checked (and corrected) before scanning for entries that may be inaccurately read by the software. Three per cent of 1020 scanned questionnaires checked (0.041% of all questionnaire items) had a data entry error. Systematic processes enable accurate survey data transfer to be achieved with Teleform. However, many administrative and technical issues need addressing.

Introduction

Self-completion surveys are a key method in assessing, for example, outcomes of treatment programmes or healthcare needs. When undertaking surveys, medical researchers are concerned with minimising data error. Data error affects the study's internal validity and may compromise the findings and implications for clinical practice or healthcare policy. Recent technological advances have resulted in the development of computer-assisted data entry systems and hence the availability of new methods for processing survey data. Computer-assisted data entry systems offer an alternative to traditional methods of survey data entry (manual and double manual) and can be achieved by installation of scanners and specialist forms processing software for optical mark reading (OMR) and intelligent character recognition (ICR). Paper based questionnaires are processed through a scanner, the data is interpreted by the data entry software, verified by the operator and exported to a database for analysis.

There is limited evidence about the use of computer-assisted data entry for large-scale self-completion population surveys [1], [2]. A review of the medical literature revealed 10 articles that made reference to automatic data entry systems in other research or clinical settings. There is conflicting evidence within these papers about the benefits of computer-assisted data entry over manual data entry regarding, for example, the accuracy of the data. It is also difficult to compare findings as different attributes (number of questionnaires, questionnaire items, or individual variables) have been measured, in different settings with different levels and type of resources. Most published studies in the literature have used Teleform (Cardiff Software). These studies have not, however, used Teleform in a large-scale self-completion community survey. We have used Teleform software (Teleform Standard version 6.1) during a postal survey about knee pain and general health in the general population. We report, therefore, on the error rates found within our scanned data, the pros and cons of using this approach, and on the practical experience of using the new system.

Section snippets

Teleform

Teleform is a software application that enables the creation of forms (e.g. questionnaires) for collecting data and reads the returned data using a scanner. After processing the data, Teleform can automatically export this into other databases for analysis [3].

Teleform has three modules. Firstly, Teleform Designer is used for the questionnaire and database design. Questions can be designed in a number of formats. “Choice” and “Entry” fields enable respondents to tick boxes or fill in bubbles.

Survey response rate

Questionnaires using automatic forms processing can be designed to look very similar to traditionally designed questionnaires. They differ, however, in three ways, as they require:

  • cornerstone markings to enable the software to locate and then read marks from the page,

  • a numeric code in one corner that holds reference information about the form,

  • “constrained print” and “comb fields” to record accurately numerical and string variables.

These features are automatically generated by the software and

Discussion

There are no published studies that have used computer-assisted data entry systems for large-scale self-completion health surveys. Studies that have used computer-assisted data entry systems elsewhere report conflicting evidence on their benefits. We have used a computer-assisted data entry system during a postal survey about general health and knee pain. The pros and cons of implementing this new approach in this setting are discussed below.

Summary

We have implemented a computer-assisted data entry system and used Teleform software to process self-completed questionnaire data. Our questionnaire was of quite complex design and was completed (unassisted) by men and women (aged 50 and over) in the community. We implemented integrated data checking and scanning procedures and found a very low questionnaire item data entry error rate (overall error rate of 0.041%). This error rate is consistent with studies that have used Teleform in other

Acknowledgements

The authors would like to acknowledge the NHS Executive (West Midlands) New Blood Fellowship committee for awarding funding for this project and the Haywood Rheumatism Research and Development Foundation (HRRDF) (North Staffordshire) who supported the survey through a project grant. The authors would like to thank all of the people in Primary Care Sciences Research Centre who helped with the administration of the knee pain survey. Special thanks go to Joanne Bailey for organisation and Paul

References (16)

  • C.K. Jorgensen et al.

    Validation of automated forms processing, A comparison of Teleform with manual data entry

    Comput. Biol. Med.

    (1998)
  • A. Bowling

    Research Methods in Health

    (1997)
  • J.H. Abramson et al.

    Survey Methods in Community Medicine

    (1999)
  • Cardiff Software Inc., Teleform Standard User Guide Version 6.1, San Marcos,...
  • C. Jinks et al.

    A brief screening tool for knee pain in primary care. 1. Validity and reliability

    Rheumatology

    (2001)
  • J. Ware et al.

    SF 36 Health Survey. Manual and Interpretation Guide

    (1993)
  • A.S. Zigmond et al.

    The hospital and anxiety depression scale

    Acta Psychiatry Scand.

    (1983)
  • N. Bellamy, WOMAC Osteoarthritis Index, A Users Guide, Health Services Centre, McMaster University, London,...
There are more references available in the full text version of this article.

Cited by (17)

  • Assessment of the impact of rapid syphilis tests on syphilis screening and treatment of pregnant women in Zambia

    2015, International Journal of Gynecology and Obstetrics
    Citation Excerpt :

    Ten data collectors from Kalomo District were trained and tested in research ethics, standardized data extraction, and management techniques prior to data collection. Data capture was done using TeleForms (Hewlett-Packard, Palo Alto, CA, USA) [14]. Pregnant women attending their first ANC visit during the observation period were randomly selected from the MOH-approved Safe Motherhood (SMH) register using a sequential skip pattern dependent on ANC volume at the facility.

  • Evaluation of a Teleform-based data collection system: A multi-center obesity research case study

    2014, Computers in Biology and Medicine
    Citation Excerpt :

    Others have investigated the data fidelity resulting from recognition software [2–8]. These studies have identified factors such as illegible handwriting, incomplete or faint markings, and responses placed outside of data entry fields as common issues leading to data inaccuracies [2,4–6]. However, none of these investigations to our knowledge have reported use of customizable scripts.

  • The impact of fine-tuning of optical recognition system on database reliability

    2012, Computers in Biology and Medicine
    Citation Excerpt :

    The rate of errors is related to the specific method and should be investigated in detail before data acquisition. A well setted ORS seems to offer the opportunity to reach a final accuracy comparable to DDE, the rate of error with the use of Teleform [3,4,21] ranging between 0.04% [4] and 0.07% [3], a single study performed with a system developed in Formic reporting an error rate of 0.02% [2]. A 0.1% absolute rate of error is usually considered as acceptable cut-point [24].

View all citing articles on Scopus

This study was funded by the NHS Executive (West Midlands) as part of a New Blood Research Training Fellowship. The Haywood Rheumatism Research and Development Foundation (HRRDF) (North Staffordshire) funded the survey costs.

View full text