- Research article
- Open Access
- Open Peer Review
Validity of International Classification of Diseases (ICD) coding for dengue infections in hospital discharge records in Malaysia
BMC Health Services Researchvolume 18, Article number: 292 (2018)
Hospitalization due to dengue illness is an important measure of dengue morbidity. However, limited studies are based on administrative database because the validity of the diagnosis codes is unknown. We validated the International Classification of Diseases, 10th revision (ICD) diagnosis coding for dengue infections in the Malaysian Ministry of Health’s (MOH) hospital discharge database.
This validation study involves retrospective review of available hospital discharge records and hand-search medical records for years 2010 and 2013. We randomly selected 3219 hospital discharge records coded with dengue and non-dengue infections as their discharge diagnoses from the national hospital discharge database. We then randomly sampled 216 and 144 records for patients with and without codes for dengue respectively, in keeping with their relative frequency in the MOH database, for chart review. The ICD codes for dengue were validated against lab–based diagnostic standard (NS1 or IgM).
The ICD-10-CM codes for dengue had a sensitivity of 94%, modest specificity of 83%, positive predictive value of 87% and negative predictive value 92%. These results were stable between 2010 and 2013. However, its specificity decreased substantially when patients manifested with bleeding or low platelet count.
The diagnostic performance of the ICD codes for dengue in the MOH’s hospital discharge database is adequate for use in health services research on dengue.
Dengue has become a global public health concern. Epidemiologic measures of the burden of dengue such as its incidence and prevalence are important for policy-making and monitoring the progress of disease control. WHO reported the global incidence of dengue has increased 30-fold in the past 50 years and estimated some 50 to 100 million new infections occurred annually , causing about 20,000 deaths . These estimates are largely based on dengue notification data reported to national surveillance systems, which are widely used as a proxy measure of dengue incidence [3,4,5,6,7].
However, for dengue where the majority of infected individuals are asymptomatic and may suffer no or little adverse health consequences, estimates of incidence and prevalence are measures of disease frequency rather than measures of the disease burden. Case fatality rate of dengue remains low with an average of 2.4 deaths over 100,000 notified case in Malaysia , but symptomatic dengue infection is significantly associated with considerable morbidity. Hospitalization due to an acute illness like symptomatic dengue infection is an important measure of morbidity. Dengue hospitalization is a critical driver of economic cost of the illness to society, especially in a country endemic of dengue. Apart from the medical costs, hospitalization also leads to loss in economic productivity arising of sick workers taking sick leaves or parents taking time off to care for their sick children. Nonetheless, health services utilization associated with dengue illness, and hospitalization in particular, remains poorly characterized, though there have been many hospital based, mostly single-center, studies on dengue [9,10,11,12,13,14,15,16,17]. Few studies have make use of administrative data which are available from hospital discharge and health insurance claims databases  in part because the validity of discharge diagnoses codes is uncertain. We therefore undertook this study to evaluate the accuracy of discharge diagnoses coded according to the International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) codes for dengue. Validation of these codes is necessary to promote the wider use of hospital discharge and insurance claims data for health services research on dengue.
This validation study extracted data from the available hospital discharge records and retrospective review of medical records. The Medical and Research Ethics Committee approved the study (NMRR-15-452-25,624).
Hospital discharge records
The Health Informatics Centre (HIC) of the Ministry of Health (MOH) Malaysia maintains a data warehouse containing data on hospital discharges from both public and private hospitals in the country. Discharge diagnoses are coded according to the International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM). Using this database, we randomly extracted 3219 discharge records across all age groups from seven hospitals for the years 2010 and 2013. These are tertiary hospitals located in Peninsular Malaysia. The hospitals were conveniently sampled according to their geographical locations. Three of them are located at the Central (Kuala Lumpur Hospital, Tengku Ampuan Rahimah Hospital and Subang Jaya Medical Centre); two are located at the North (Seberang Jaya Hospital and Sultanah Bahiyah Hospital); one located at the South (Sultanah Aminah Hospital); and another one located at the East Coast of Peninsular Malaysia (Tengku Ampuan Afzan Hospital). Records coded with dengue include A90 for dengue fever [classical dengue] and A91 dengue haemorrhagic fever. Codes for records without dengue such as B34.9, A83, R50, A92 to A99 were categorized as non-dengue. Refer to Additional file 1 for detailed descriptions of these ICD-10-CM codes. We deliberately restricted selection of non-dengue records to these codes (B34.9, A83, R50, A92 to A99) because these discharges that are most likely a priori to be misclassified as false-positive for dengue.
Review of medical records and laboratory data
We abstracted data from medical and laboratory records for these 3219 discharge records from the seven hospitals. We obtained information on patients’ demographics, dates of admission and discharge, disease severity and laboratory results for dengue non-structural protein 1 (NS1) antigen immunoassay and dengue immunoglobulins M (IgM) assay. Records of patients coded with dengue constituted 2.9% of HIC database while records coded with either B34.9, A83, R50, or A92 to A99 constituted 2.0%. Estimates of PPV and NPV are sensitive to the prevalence of the dengue. We therefore randomly sampled 216 records for patients with codes for dengue and 144 records without dengue codes, in keeping with their relative frequency in the database. Only records with complete medical and laboratory data for evaluation are included in the sampling.
Diagnostic performance of ICD-10-CM codes for dengue
We compared discharge diagnoses ICD codes for dengue against the standard diagnostic tests based on dengue NS1 and IgM. Both NS1 and IgM might not identify all dengue cases presenting to the hospitals but they are sufficiently accurate to be recommended for routine diagnostic use [19, 20]. NS1 and IgM are the only two diagnostic tests routinely available in almost all hospitals in Malaysia. Public health surveillance in Malaysia is also based on these two tests to ascertain occurrence of dengue in the population. More accurate diagnostic methods such as plaque reduction neutralisation test, polymerase chain replication-based assay or virus isolation  are either non-routinely performed or not available at all hospitals in Malaysia. For this study we defined a record of having a true diagnosis of dengue if showed a positive result for NS1 and/or IgM. We then calculated the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of ICD-10-CM codes for dengue against this lab–based diagnostic standards.
The background demography, clinical manifestations and clinical outcome of sampled cases were described. All categorical variables were presented as frequencies and percentages; while continuous variables were expressed as means with standard deviation (SD) or medians with interquartile ranges (IQR). The background characteristics between patients with and without ICD-10-CM codes for dengue were examined with Chi-square test for categorical variables, and Student’s T-test or Mann-Whitney U tests for numerical variables. The two-sided statistical significance level was set at 0.05. We also estimated the binomial exact 95% confidence intervals (CIs) for the sensitivity, specificity, PPV and NPV of ICD-10-CM codes for dengue.
Table 1 shows the characteristics of the patients with and without ICD-10-CM codes for dengue whom were included in this study. Their mean age were 37 and 28 years respectively. About half the patients were diagnosed in year 2010, and the other half in 2013. Bleeding, visceral signs, low platelet count, raised blood hematocrit were more commonly present among patients with ICD-10-CM codes for dengue. Organ impairment however were found equally between the two groups. Table 2 summarizes the number of cases with positive result for NS1 and/or IgM among patients with and without ICD-10-CM codes for dengue. 87% of those with the dengue codes had positive laboratory results (NS1 or IgM) for dengue while only 8% of those without dengue codes had positive results.
Table 3 shows the diagnostic performance of the ICD-10-CM codes for dengue. The sensitivity was 94% (188/ 200) and specificity 83% (132/160), giving an overall accuracy of 88.9% (320/ 360). The positive Likelihood ratio (LR) was 4.7 and negative LR 0.075. This provides adequate evidence for the diagnostic validity of ICD dengue codes. Its PPV was 87% (188/ 216) and NPV 92% (132/ 144). The diagnostic performance of dengue codes did not change between 2010 and 2013 (Table 3). Neither did it differ between the sexes. However, the specificity of ICD-10-CM codes for dengue decreased substantially when the patients presented with bleeding manifestations or low platelet count.
Administrative data such as hospital discharge and health insurance claims databases have infrequently been used for health services research on dengue despite their considerable strengths which included nationally representative sample to allow generalizability, larger sample size and low cost. The main disadvantage of administrative data is uncertain validity of the ICD codes used to identify patients with dengue. This study was undertaken to address this weakness and is the first to investigate the validity of ICD-10-CM codes for dengue. Overall we demonstrated adequate diagnostic evidence to support the use of ICD-10-CM codes in identifying hospitalized patients with positive NS1 or IgM test for dengue.
The sensitivity of ICD-10-CM codes for dengue is high (94%), but its specificity is more modest at 83% (17% false positive ratio). The lower specificity could have two explanations. Firstly, despite advances in dengue diagnostic tests , the diagnosis of dengue in current practice is still clinically based rather than entirely dictated by positive NS1 or IgM lab results . This is further supported by our findings of lower specificities when patients manifested with bleeding or low platelet count, both commonly associated with dengue. In other words, patients are more likely to be clinically diagnosed of having dengue even when their NS1 or IgM tests were negative. Secondly, our records selection process also contributed to this modest specificity result. We have deliberately restricted selection of records without ICD codes for dengue to B34.9, A83, R50, A92 to A99 codes because they are most likely a priori to be misclassified as false-positive, thus lowering the specificity estimates. If we would have randomly selected all records without ICD codes for dengue as controls, we would then find nearly all to be true negative given the low prevalence (< 3%) of dengue among all hospitalized subjects, thus inflating the estimate of specificity.
Our results have several implications. For the use of administrative data in descriptive studies to estimate the incidence or prevalence of patients hospitalized for dengue illness, the high sensitivity of A90–91 codes ensures that most of the dengue patients will be identified. However, its moderate specificity (83%) will inflate the estimate of the true frequency of dengue in the population, with high number of false positives. Even moderate loss in specificity can lead to substantial over-estimation especially when the incidence or prevalence of the condition under investigation is low. Our finding that the validity of the ICD-10-CM codes has been stable over time is reassuring for studies on the secular trend in dengue incidence or prevalence.
For analytic studies, the ICD-10-CM codes can be used to sample cases of dengue and non-dengue controls to investigate outcome of subjects’ exposure to dengue. Assuming non-differential misclassification (ie, classification by ICD-10-CM codes is independent of the outcome of interest) and assuming having dengue increases risk of the outcome, the modest specificity (83%) of A90–91 codes will cause cases to have more false positive subjects among them to dilute the risk estimate of the case group, while the high sensitivity (94%) will minimize the number of false negative subjects in the control group, thus lessening its risk inflation. The end result will be a biased estimate of the relative risk towards null.
Limitations of study
As this study involved only seven hospitals, the findings cannot be generalized to all hospitals in Malaysia. In addition, NS1 and/or IgM were used as the diagnostic standards to validate the ICD-10-CM codes for dengue. In this case, both NS1 and IgM are not perfect tests, so some of the dengue cases may be missed. Nevertheless, both NS1 and IgM are the only diagnostic tests that are available in almost all hospitals in Malaysia. Lastly, as this study involved only data from 2010 and 2013, the observed trend of ICD-10-CM validity may not be entirely true for the data of other years.
We conclude that the hospital discharge database maintained by the Ministry of Health (MOH) Malaysia is potentially a useful data resource to support health services research for dengue in Malaysia. The adequate diagnostic performance of the ICD-10-CM coding in the national hospital discharge database in identifying patients with dengue makes it particularly acceptable for use in research on dengue.
International Classification of Diseases, 10th revision
Immunoglobulins M assay
Ministry of Health
Negative predictive value
Non-structural protein 1 antigen
Polymerase Chain Reaction
Positive predictive value
World Health Organization
WHO. Neglected tropical diseases, Dengue. [cited 2016 November 15th]. Available from: http://www.searo.who.int/entity/vector_borne_tropical_diseases/data/data_factsheet/en/. Accessed 15 Nov 2016.
WHO. Global strategy for dengue prevention and control. Geneva: World Health Organization; 2012.
Beatty ME, Letson GW, Margolis HS. Estimating the global burden of dengue. Am J Trop Med Hyg. 2009;81(1):231.
Hotez PJ, Alvarado M, Basa ´ne˜z M-G, Bolliger I, Bourne R, et al. The global burden of disease study 2010: interpretation and implications for the neglected tropical diseases. PLoS Negl Trop Dis. 2014;8:e2865.
Arima Y, Matsui T. (2011) Epidemiologic update of dengue in the western Pacific region. Western Pac Surveill Response J 2. 2010:4–8. https://doi.org/10.5365/WPSAR.2011.2.2.005. Accessed 30 Nov 2016.
World Health Organization: Western Pacific region (2012) dengue: dengue in the western Pacific region. Available: http://www.wpro.who.int/emerging_diseases/Dengue/en/index.html. Accessed 30 Nov 2016.
Mia MS, Begum RA, Er AC, Abidin RD, Pereira JJ. Trends of dengue infections in Malaysia, 2000–2010. Asian Pac J Trop Med. 2013;6:462–6.
MOSTI. iDengue [cited 2016 November 30th]. Available from: http://idengue.remotesensing.gov.my/idengue/page2.php?kandungan=content/statistik.pdf.
Sam S-S, Omar SFS, Teoh B-T, Abd-Jamil J, AbuBakar S. Review of dengue hemorrhagic fever fatal cases seen among adults: a retrospective study. PLoS Negl Trop Dis. 2013;7(5):e2194. https://doi.org/10.1371/journal.pntd.0002194.
Anders KL, Nguyet NM, Chau NVV, Hung NT, Thuy TT, Lien LB, Farrar J, Wills B, Hien TT, Simmons CP. Epidemiological factors associated with dengue shock syndrome and mortality in hospitalized dengue patients in ho chi Minh City. Vietnam Am J Trop Med Hyg. 2011;84:127–34.
Wichmann O, Hongsiriwon S, Bowonwatanuwong C, Chotivanich K, Sukthana Y, Pukrittayakamee S. Risk factors and clinical features associated with severe dengue infection in adults and children during the 2001 epidemic in Chonburi, Thailand. Tropical Med Int Health. 2004;9:1022–9.
Kittigul L, Pitakarnjanakul P, Sujirarat D, Siripanichgon K. The differences of clinical manifestations and laboratory findings in children and adults with dengue virus infection. J Clin Virol. 2007;39(2):76–81. Epub 2007 May 15
Chuang VW, Wong TY, Leung YH, Ma ES, Law YL, Tsang OT, Chan KM, Tsang IH, Que TL, Yung RW, Liu SH. Review of dengue fever cases in Hong Kong during 1998 to 2005. Hong Kong Med J. 2008;14:170–7.
Wang CC, Lee IK, Su MC, Lin HI, Huang YC, Liu SF, Wu CC, Lin MC. Differences in clinical and laboratory characteristics and disease severity between children and adults with dengue virus infection in Taiwan, 2002. Trans R Soc Trop Med Hyg. 2009;103:871–7.
Hon KL, Nelson EA. Gender disparity in paediatric hospital admissions. Ann Acad Med Singap. 2006;35:882–8.
Kabra SK, Jain Y, Pandey RM, Madhulika ST, Tripathi P, Broor S, Seth P, Seth V. Dengue haemorrhagic fever in children in the 1996 Delhi epidemic. Trans R Soc Trop Med Hyg. 1999;93:294–8.
Hammond SN, Balmaseda A, Perez L, Tellez Y, Saborio SI, Mercado JC, Videa E, Rodriguez Y, Perez MA, Cuadra R, Solano S, Rocha J, Idiaquez W, Gonzalez A, Harris E. Differences in dengue severity in infants, children, and adults in a 3-year hospital-based study in Nicaragua. Am J Trop Med Hyg. 2005;73:1063–70.
Uhart M, Blein C, L’Azou M, Thomas L, Durand L. Costs of dengue in three French territories of the Americas: an analysis of the hospital medical information system (PMSI) database. Eur J Health Econ. 2016;17:497–503.
Peeling RW, Artsob H, Pelegrino JL, Buchy P, Cardosa MJ, Devi S, Enria DA, Farrar J, Gubler DJ, Guzman MG, Halstead SB, Hunsperger E, Kliks S, Margolis HS, Nathanson CM, Nguyen VC, Rizzo N, Vázquez S, Yoksan S. Evaluation of diagnostic tests: dengue. Nat Rev Microbiol. 2010 Dec;8(12 Suppl):S30–8.
WHO. Dengue guidelines for diagnosis, treatment, prevention and control. 3rd ed. Geneva: World Health Organization; 2009.
The authors would like to extend their sincere gratitude and appreciation to Ms. Teo JS and Ms. Lena Yeap for their efforts in managing and analyzing the data. We would like to thank the Director of Sultanah Aminah Hospital, Tengku Ampuan Afzan Hospital, Kuala Lumpur Hospital, Tengku Ampuan Rahimah Hospital, Seberang Jaya Hospital, Sultanah Bahiyah Hospital and Subang Jaya Medical Centre for their approval in data collection from respective hospitals. We would also like to thank Dr. Md. Khadzir bin Sheikh Ahmad, Deputy Director of Health Informatics Centre for his support in this study; and the Director General of Health for the approval in publishing the study. This study could not have been completed without the helps and assistance given by the Clinical Research Centre of respective hospitals and Mr. Yen from Subang Jaya Medical Centre. We would also like to thank Dr. Hor Chee Peng for proof reading the manuscript. Lastly, we wish to thank all those whose names are not mentioned here who render their excellent service especially during the data collection.
This study is funded by the Ministry of Health Malaysia.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author upon request.
All authors conceptualized the study. TOL designed the study and performed data analysis and interpretation. YLW, KYL, SFZMA involved in project management and data collection. All authors are subject matter expertise and wrote the original draft. All authors read and approved the final manuscript.
Ethics approval and consent to participate
This study is based on available hospital discharge records and retrospective review of medical records. The Ministry of Health’s (MOH) Medical and Research Ethics Committee formally waived the requirement for individual informed consent and approved the study.
Consent for publication
Not applicable. The manuscript does not report on any individual participant’s data in any form (images, videos, voice recordings etc).
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
ICD-10-CM Diagnosis Codes. (DOCX 13 kb)