Mining Trauma Injury Data with Imputed Values

Kay Penny*, Thomas Chesney

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)


Methods for analyzing trauma injury data with missing values, collected at a UK hospital, are reported. One measure of injury severity, the Glasgow coma score, which is known to be associated with patient death, is missing for 12% of patients in the dataset. In order to include these 12% of patients in the analysis, three different data imputation techniques are used to estimate the missing values. The imputed datasets are analyzed by an artificial neural network and logistic regression, and their results compared in terms of sensitivity, specificity, positive predictive value and negative predictive value. Although there is little distinction between results for the three imputation methods for the overall dataset, the hot-deck imputation method appears to give more accurate results than the model-based or propensity score imputation methods, when comparing the subsets of cases including only those patients with imputed Glasgow coma score (GCS) scores. Results show that imputation does not reduce the overall predictive accuracy following a data-mining analysis; demonstrating that all cases may be included when undertaking analysis of these trauma injury data. Copyright © 2009 Wiley Periodicals, Inc., A Wiley Company
Original languageEnglish
Pages (from-to)246-254
JournalStatistical Analysis and Data Mining
Issue number4
Publication statusPublished - 24 Sept 2009


Dive into the research topics of 'Mining Trauma Injury Data with Imputed Values'. Together they form a unique fingerprint.

Cite this