Research article | Open | Open Peer Review | Published:
Calculating hospital length of stay using the Hospital Episode Statistics; a comparison of methodologies
BMC Health Services Researchvolume 17, Article number: 347 (2017)
Accurate calculation of hospital length of stay (LOS) from the English Hospital Episode Statistics (HES) is important for a wide range of audit and research purposes. The two methodologies which are commonly used to achieve this differ in their accuracy and complexity. We compare these methods and make recommendations on when each is most appropriate.
We calculated LOS using continuous inpatient spells (CIPS), which link care spanning across multiple hospitals, and spells, which do not, for six conditions with short (dyspepsia or other stomach function, ENT infection), medium (dehydration and gastroenteritis, perforated or bleeding ulcer), and long (stroke, fractured proximal femur) average LOS. We examined how inter-area comparisons (i.e. benchmarking) and temporal trends differed. We defined a classification system for spells and explored the causes of differences.
Stroke LOS was 16.5 days using CIPS but 24% (95% CI: 23, 24) lower, at 12.6 days, using spells. Smaller differences existed for shorter-LOS conditions including dehydration and gastroenteritis (4.5 vs. 4.2 days) and ENT infection (0.9 vs. 0.8 days). Typical patient pathways differed markedly between areas and have evolved over time. One area had the third shortest stroke LOS (out of 151) using spells but the fourth longest using CIPS. These issues were most profound for stroke and fractured proximal femur, as patients were frequently transferred to a separate hospital for rehabilitation, however important disparities also existed for conditions with simpler secondary care pathways (e.g. ENT infections, dehydration and gastroenteritis).
Spell-based LOS is widely used by researchers and national reporting organisations, including the Health and Social Care Information Centre, however it can substantially underestimate the time patients spend in hospital. A widespread shift to a CIPS methodology is required to improve the quality of LOS estimates and the robustness of research and benchmarking findings. This is vital when investigating clinical areas with typically long, complex patient pathways. Researchers should ensure that their LOS calculation methodology is fully described and explicitly acknowledge weaknesses when appropriate.
Within the UK, hospital bed capacity has come under increasing pressure from the dual threat of growing demand within emergency departments  and increasing discharge delays . Reductions to hospital length of stay (LOS) could release pressure on beds, provide a timely boost to deteriorating hospital finances , and improve patient outcomes (e.g. reduced infections ). Benchmarking, where hospitals or regions are compared to identify opportunities for LOS reductions, could be undermined by inaccuracies in the way LOS is commonly calculated and reported. Accurate LOS calculations are crucial for a variety of other audit and research purposes including forecasting patient flow, designing interventions to reduce discharge delays, and evaluating policy impact.
Within England, LOS measurement is primarily driven by the way hospital care is reported . Inpatient treatment is recorded by hospitals, collated by the Health and Social Care Information Centre (HSCIC), and released as part of the Hospital Episode Statistics (HES). HES data are widely used by publicly-funded and commercial organisations, including the National Health Service (NHS), to better understand and improve hospital care. HES are recorded at the finished consultant episode (FCE) level, which represents the time spent under the care of a single consultant. These are frequently joined together to create spells [6,7,8,9,10], the time spent within a single hospital (which may include multiple FCEs), or continuous inpatient spells (CIPS) [11,12,13,14,15], the entire period of inpatient care (which may include spells at multiple hospitals).
FCEs and spells are susceptible to vagaries in the way hospitals organise their care, and in particular their propensity to transfer patients between consultants or to new hospitals. Theoretically, CIPS overcome these limitations and provide a more reliable measure of LOS, however creating these requires episode-level data, substantial computational power and experienced analysts. For this reason organisations often default to a spell-based analysis , however the impact of this decision on study findings remains unclear. An improved understanding of the bias of spell-based LOS could increase the quality of data provided to policymakers, lead to more robust decisions, and improved patient outcomes.
In this paper we empirically investigate the magnitude of differences between using a CIPS- and spell-based methodology when calculating LOS nationally, benchmarking across areas, and investigating temporal trends. We define a classification system for spells and use this to explore the causes of differences.
This study was completed as part of a wider programme of work investigating geographic variation in unplanned ambulatory care sensitive condition (ACSC) admission rates. ACSCs are those where admission can potentially be avoided through improved community or primary care . Our original study included 28 common ACSCs however, for simplicity, we focussed this study on a subset of six. We selected two conditions with a short LOS (dyspepsia or other stomach function, ENT infection), medium LOS (dehydration and gastroenteritis, perforated or bleeding ulcer) and long LOS (stroke, fractured proximal femur). We identified admissions for each condition using ICD-10 diagnosis codes from previous work (Appendix) .
We used the HES admitted patient care dataset to identify admissions between 1st April 2007 and 31st March 2012 . We joined FCEs to create spells, and then joined spells to create CIPSs using a unique patient identifier. CIPS spanning over the extract end date (31st March 2012) were censored or omitted from the HES dataset entirely. Therefore we excluded all episodes in the 90 days prior to the extract end date and all CIPS (and their constituent episodes and spells) lasting more than 90 days. Stays censored by death were also excluded as they do not represent a complete hospital stay. Our analysis compared LOS across primary care trusts (PCTs). Until 2013 there were 151 PCTs in England which were responsible for commissioning most of the healthcare for their populations. PCTs have now been replaced by clinical commissioning groups (CCGs) which perform a broadly similar role but place an increased emphasis on the role of general practitioners.
Patient pathways can be complicated. To better understand the reasons for differences between a CIPS- and spell-based analyses we developed a classification system which categorised spells into four mutually exclusive and exhaustive categories. For ease of understanding we focus on a calculation of LOS for unplanned stroke admissions below, however the arguments are identical for other conditions.
Admission spell: The first spell within a CIPS. It encompasses the time between a patient being first admitted to hospital for unplanned stroke care until they are either discharged or transferred to another hospital.
Transfer spell: A subsequent spell after a patient is transferred to a different hospital to receive unplanned stroke care.
Rehabilitation spell: A subsequent spell after a patient is transferred to a different hospital to receive planned stroke care.
New condition spell: A subsequent spell after a patient is transferred to a different hospital to receive treatment for a non-stroke problem.
For example a patient with acute stroke might be first admitted to a local hospital (admission spell), transferred to a stroke unit for acute treatment (transfer spell), transferred back to a local hospital for rehabilitation (rehabilitation spell) and have an adverse event (e.g. fall) requiring transfer to an acute hospital (new condition spell). Figure 1 displays these graphically and highlights the major weaknesses of a spell-based methodology. Among four hospital stays lasting an identical amount of time (20 days), the estimated mean unplanned LOS using spell-based methods differs substantially depending on the patient’s pathway. As there is no linkage across hospitals in a spell-based analysis, time spent in ‘rehabilitation’ and ‘new condition’ spells is excluded entirely. Therefore patients admitted for unplanned stroke care, and then transferred to another hospital for rehabilitation or non-stroke care, may have only a fraction of their total hospital stay included in the LOS calculation (Spell 3 & 4, Fig. 1). Furthermore, whilst time spent in ‘transfer’ spells is included, these are treated as new admissions causing the mean LOS to be decreased by at least one half (Spell 2, Fig. 1). LOS is accurately measured when using CIPS regardless of the patient’s pathway, and is therefore regarded as the ‘gold-standard’ methodology.
We calculated the number of unplanned hospital stays, total bed days and mean LOS under a CIPS- and spell-based methodology for each year of the study. We calculated the percentage difference between a CIPS and spell analysis for each of these metrics. Using both methods, we also ranked PCTs from highest (longest mean LOS) to lowest (shortest mean LOS). We calculated the median and maximum absolute difference in rankings. To better understand the differences between a CIPS and spell analysis we calculated the proportion of hospital time spent in ‘admission’, ‘transfer’, ‘rehabilitation’ and ‘new condition’ spells at both the national and PCT level. As preliminary analysis revealed that time spent in rehabilitation spells was the most important driver of differences we explored how this differed across PCTs and evolved over time. We estimated 95% confidence intervals using non-parametric bootstrapping.
Stroke and fractured proximal femur mean LOS was 23.8% (95% CI: 23.2, 24.3) and 19.3% (95% CI: 18.8, 19.8) shorter when calculated using spells rather than CIPS (Table 1). Much smaller, although still important, differences of between 3.7% and 5.5% were found for other conditions. Differences mainly resulted from a lower number of bed days in the spell analysis and, more specifically, the exclusion of a substantial amount of time spent in rehabilitation (Table 2). For example, the 140,712 stroke bed days (19.6% of CIP total) spent in rehabilitation accounted for the vast majority (94%) of the 150,345 bed day disparity between and a CIPS- and spell-based analysis. Double counting of hospital admissions played a relatively minor role in driving differences in mean LOS as ‘transfer’ spells were rare across all conditions.
For fractured proximal femur and stroke, conditions with the longest LOS, there was little accord in the rankings of PCTs when using a CIPS- or spell-based methodology to calculate LOS (Fig. 2). The median difference for fractured proximal femur was 40 ranks, with extremely large disparities for individual PCTs (Fig. 2). For example, one PCT (PCT A) had the third highest rank (longest mean LOS) for fractured proximal femur when using a CIPS methodology yet the forth lowest (shortest LOS) under a spell framework representing a gulf of 145 (95% CI: 122, 148) ranks. Differences between rankings based on CIPS and spells were generally driven by extreme variability in the proportion of time spent in rehabilitation spells across PCTs (Fig. 3). In PCT A 61% of fractured proximal femur hospital stays were spent in rehabilitation meaning that a spell-based analysis provided an extreme underestimate of mean LOS compared to using CIPS (10.6 days vs. 28.6 days). In contrast, there were no transfers recorded among patients admitted for fractured proximal femur in another PCT (PCT B) meaning that LOS was identical regardless of how it was calculated. The median difference was seven ranks or less for the other conditions included in our study, however important differences in excess of 30 ranks, and up to 102 ranks for perforated or bleeding ulcer, still existed for some PCTs (Fig. 2).
In general, the discrepancy between a CIPS- and spell-based analyses increased over the study period (Fig. 4). For example, the difference in stroke LOS was 16.4% (95% CI: 15.9, 16.9) during 2007/8 and 23.8% (95% CI: 23.1, 24.3) during 2011/12. Similarly, the discrepancy for fractured proximal femur was 16.2% (95% CI: 15.9, 16.6) during 2007/8 and increased to 19.3% (95% CI: 19.0, 19.7) during 2011/12. In both cases these disparities where driven by a sharp increase in the number of rehabilitation spells. This was most pronounced among stroke patients where the proportion of time spent in rehabilitation increased by over 50% during the study period from 13.0% in 2007/8 to 19.6% in 2011/12. The difference between a CIPS- and spell-based analyses was relatively consistent across time for other conditions where patients were are typically treated within a single hospital (e.g. perforated or bleeding ulcer).
Statement of principal findings
Measuring length of stay using spells can lead to substantial underestimates of nearly 25% for some conditions. The typical patient pathway often differs between areas. Under a spell-based analysis this can impair benchmarking and lead regions to appear efficient simply because they transfer a large proportion of patients for rehabilitation. In general, the time spent in rehabilitation spells has increased over time which could undermine examination of temporal trends in LOS under a spell-based analysis. Each of these issues were most profound for stroke and fractured proximal femur, as patients were frequently transferred to a separate hospital for rehabilitation, however important disparities also existed for conditions with simpler pathways (e.g. ENT infections, dehydration and gastroenteritis).
Strengths and weaknesses
This analysis addresses an important, but often overlooked, methodological issue when using the HES dataset to calculate LOS nationally, compare between regions, or investigate temporal trends. By including a diverse range of conditions we have identified the circumstances under which these biases are largest, and when they are perhaps tolerable. We have used bootstrapping methods to calculate sampling distributions around key parameters (e.g. ranking of PCTs) which provides an objective measure of uncertainty.
The main weakness in our study lies in its potential lack of generalisability beyond those using the HES dataset; nevertheless it seems likely that similar issues will exist in any country where administrative data is collected within hospitals and used to guide decision making. The HES dataset is widely used for research and audit purposes  meaning that our findings have extremely important implications for NHS policymaking. Our decision to exclude spells censored by death may have introduced a small bias into our results. It is possible that more advanced competing-risk survival models could be used to overcome this . The spell classifications used within our analysis may be too simplistic to differentiate between the myriad of pathways a patient my take during a hospital stay. Future investigation into the causes and consequences of variable hospital pathways may require a more comprehensive system. For example, it may be useful to delineate ‘new condition’ spells related to medical error from those which are unpreventable.
Comparison with other studies
To our knowledge this is the first study to compare LOS using CIPS and spells. A previous study  found substantial differences in admission counts when they were calculated using FCEs and spells. Although that study did not investigate LOS, it seems highly likely that such analysis would have demonstrated disparities in LOS. In agreement with our findings, one study has highlighted vast differences among hospitals in the use of rehabilitation centres for patients admitted with hip fracture .
Implications for clinicians and policymakers
Accurate calculation of LOS is extremely important for a wide range of audit and research purposes. Benchmarking has been identified as a key tool to drive productivity savings in the National Health Service , however our analysis demonstrates this can be completely undermined when using spell LOS. This could severely limit the ability of NHS organisations to identify and act on improvement opportunities. Cross-sectional studies investigating the effect of patient or hospital characteristics on LOS have been commonly used to identify the most important drivers of LOS, and develop interventions to reduce discharge delays. However these factors (e.g. condition volume , clinical guidelines ) could appear to be strongly associated with spell LOS, when a relationship doesn’t actually exist with the total time spent in hospital, if they are correlated with the probability of hospital transfer. Similarly, before and after studies have been commonly utilised to investigate the effect of healthcare policies (e.g. payment by results , centralisation of stroke care ) on LOS however, when using a spell-based methodology, changes in LOS could be due to evolving patient pathways (e.g. more transfers for stroke rehabilitation) rather than any true difference in the time spent within hospital. It is unlikely that the deficiencies of a spell-based analysis could be overcome by statistical adjustment, as the causes of differing hospital pathways are likely to be complex and, in many cases, intangible. Our analysis empirically describes the potential bias of a spell-based analysis for the first time, and should provide a stimulus for improved methodological rigour.
Despite the pitfalls of calculating LOS using spells, this methodology is widely employed by NHS organisations and academic researchers. The HSCIC, which is the national provider of data to analysts and commissioners, presents national-level LOS using spells . Perhaps of even greater concern, given the results in our study, is that several NHS benchmarking tools including the NHS Better Care, Better Value Indicators , NHS Compendium of Information , and the RightCare Atlas of Variation  base at least some of their outputs on spell LOS. Inaccurate benchmarking analysis could lead to vital improvement opportunities being missed, or costly investigations being launched to solve problems that don’t actually exist. The academic literature also contains many studies using spell LOS which could undermine their conclusions [6,7,8,9,10]. For example, a study finding an association between the introduction of payment by results and reduced spell LOS  might be confounded by an increasing proportion of hospital transfers over time. Similarly, another study finding inter-hospital variation in four common types of surgery may simply reflect differences in patient pathways rather than true disparities in the time spent in hospital . Several studies did not provide sufficient detail on the methodology used to calculate LOS [28,29,30] which prevents readers from determining the robustness of their results.
Our results highlight the need for a step change in how LOS is calculated and reported. National data providers, such as the HSCIC, have sufficient resources to routinely report CIPS-based LOS and should switch to this methodology. Higher quality data could lead to more robust decisions and improved patient outcomes. Similarly, publishers of LOS benchmarking tools should ensure these are based on CIPS as spell-based comparisons are unreliable, even for conditions where care is typically provided by a single hospital. At the very least, spell-based LOS comparisons should explicitly acknowledge the weaknesses of this approach and advise caution when interpreting the results. It is perhaps understandable that small research teams with limited resources sometimes forgo the complex procedure of creating CIPS, and instead opt to use spell LOS. Our results suggest that this may be defensible providing they do not compare across areas and are solely interested in clinical areas where care is typically provided within a single hospital. Such analyses should always be accompanied by a report on the proportion of spells which end with a hospital transfer. However, CIPS-based analysis is always preferable and should be conducted when possible.
Accurate calculation of LOS is extremely important for a wide range of audit and research purposes. However commonly used spell-based methodologies may omit important parts of the patient’s pathway and undermine analyses aiming to calculate LOS nationally, benchmark across areas, and investigate temporal trends. National reporting organisations and researchers should calculate LOS using CIPS, particularly when investigating clinical areas with complex patient pathways, or conducting benchmarking. Researchers should ensure that their LOS calculation methodology is fully described and explicitly acknowledge weaknesses where appropriate. Future investigation into the causes and consequences of variable hospital pathways is required to understand its impact on healthcare costs and patient outcomes.
Ambulatory care sensitive condition
Clinical commissioning group
Continuous inpatient spells
Ear, nose and throat
Finished consultant episode
Hospital episode statistics
Health and social care information centre
Length of stay
National health service
Primary care trust
A&E Attendances and Emergency Admissions 2015-16 (Monthly) [https://www.england.nhs.uk/statistics/statistical-work-areas/ae-waiting-times-and-activity/statistical-work-areasae-waiting-times-and-activityae-attendances-and-emergency-admissions-2015-16-monthly-3/]. Accessed 3 April 2016.
House of Commons Library. Delayed transfers of care in the NHS. 2015.
NHS Trust Development Authority and Monitor. Quarterly report on the performance of the NHS provider sector: 9 months ended 31 December 2015. 2016.
Graffunder EM, Venezia RA. Risk factors associated with nosocomial methicillin-resistant Staphylococcus aureus (MRSA) infection including previous use of antimicrobials. J Antimicrob Chemother. 2002;49(6):999–1005.
Sinha S, Peach G, Poloniecki JD, Thompson MM, Holt PJ. Studies using English administrative data (Hospital Episode Statistics) to assess health-care outcomes--systematic review and recommendations for reporting. Eur J Public Health. 2013;23(1):86–92.
Street A, Gutacker N, Bojke C, Devlin N, Daidone S. Health services and delivery research. In: Variations in outcome and costs among NHS providers for common surgical procedures: econometric analyses of routinely collected data. Southampton (UK): NIHR Journals Library; 2014.
Martin S, Street A, Han L, Hutton J. Have hospital readmissions increased in the face of reductions in length of stay? Evidence from England. Health Policy. 2016;120(1):89–99.
Bell D, Lambourne A, Percival F, Laverty AA, Ward DK. Consultant input in acute medical admissions and patient outcomes in hospitals in England: a multivariate analysis. PLoS ONE. 2013;8(4):e61476.
Whitston M, Chung S, Henderson J, Young B. What can be learned about the impact of diabetes on hospital admissions from routinely recorded data? Diabet Med. 2012;29(9):1199–205.
Farrar S, Yi D, Sutton M, Chalkley M, Sussex J, Scott A. Has payment by results affected the way that English hospitals provide care? Difference-in-differences analysis. BMJ. 2009;339:b3047.
Morris S, Hunter RM, Ramsay AIG, Boaden R, McKevitt C, Perry C, Pursani N, Rudd AG, Schwamm LH, Turner SJ, et al. Impact of centralising acute stroke services in English metropolitan areas on mortality and length of hospital stay: difference-in-differences analysis. BMJ. 2014;349:g4757.
Bottle A, Mozid A, Grocott HP, Walters MR, Lees KR, Aylin P, Sanders RD. Preoperative stroke and outcomes after coronary artery bypass graft surgery. Anesthesiology. 2013;118(4):885–93.
Sanders RD, Bottle A, Jameson SS, Mozid A, Aylin P, Edger L, Ma D, Reed MR, Walters M, Lees KR, et al. Independent preoperative predictors of outcomes in orthopedic and vascular surgery: the influence of time interval between an acute coronary syndrome or stroke and the operation. Ann Surg. 2012;255(5):901–7.
James G, Hugh G, Rita S, Luigi S. Long term care provision, hospital length of stay and discharge destination for hip fracture and stroke patients. In.: Centre for Health Economics. York: University of York; 2013.
Gill PJ, Goldacre MJ, Mant D, Heneghan C, Thomson A, Seagroatt V, Harnden A. Increase in emergency admissions to hospital for children aged under 15 in England, 1999-2010: national database analysis. Arch Dis Child. 2013;98(5):328–34.
Purdy S, Griffin T, Salisbury C, Sharp D. Ambulatory care sensitive conditions: terminology and disease coding need to be more specific to aid policy makers and clinicians. Public Health. 2009;123(2):169–73.
Hospital Episode Statistics [http://www.hscic.gov.uk/hes]. Accessed 3 April 2016.
Brock GN, Barnes C, Ramirez JA, Myers J. How to handle mortality when investigating length of hospital stay and time to clinical stability. BMC Med Res Methodol. 2011;11:144.
Aylin P, Williams S, Bottle A, Jarman B. Counting hospital activity: spells or episodes? BMJl. 2004;329(7476):1207.
The National Hip Fracture Database. The National Hip Fracture Database National Report 2012 - Supplement. 2012.
Reducing Length of Stay Supporting Information [http://www.productivity.nhs.uk/Reports/IndicatorSupportingInformationPdf?indicatorId=615&practiceCode=National&percentileId=2&yearQtrId=18]. Accessed 3 April 2016.
Conway PH, Keren R. Factors associated with variability in outcomes for children hospitalized with urinary tract infection. J Pediatr. 2009;154(6):789–96.
Price LC, Lowe D, Hosker HS, Anstey K, Pearson MG, Roberts CM, British Thoracic S, the Royal College of Physicians Clinical Effectiveness Evaluation U. UK National COPD Audit 2003: Impact of hospital resources and organisation of care on patient outcome following admission for acute COPD exacerbation. Thorax. 2006;61(10):837–42.
Hospital Episode Statistics, Admitted Patient Care, England - 2012-13: Metadata [http://content.digital.nhs.uk/catalogue/PUB12566]. Accessed 8 May 2016.
NHS Better Care, Better Value Indicators [http://www.productivity.nhs.uk/]. Accessed 3 April 2016.
Dataset: Mean Length of Stay for Inpatients with Long Term Neurological Conditions [https://indicators.ic.nhs.uk/webview/]. Accessed 3 April 2016.
RightCare. NHS Atlas of Variation in Healthcare 2015: Indicator Metadata. Map 84. 2015.
Liddle AD, Judge A, Pandit H, Murray DW. Adverse outcomes after total and unicompartmental knee replacement in 101 330 matched patients: a study of data from the National Joint Registry for England and Wales. Lancet. 2014;384(9952):1437–45.
Judge A, Chard J, Learmonth I, Dieppe P. The effects of surgical volumes and training centre status on outcomes following total joint replacement: analysis of the Hospital Episode Statistics for England. J Public Health. 2006;28(2):116–24.
Cheung CR, Smith H, Thurland K, Duncan H, Semple MG. Population variation in admission rates and duration of inpatient stay for bronchiolitis in England. Arch Dis Child. 2013;98(1):57–9.
None to declare.
Availability of data and materials
Pseudonymised Hospital Episode Statistics (HES) data were provided to the University of Bristol, School of Social and Community Medicine by The Health & Social Care Information Centre (The HSCIC) under data re-use agreement RU919. HES data are copyright © 2013, Re-used with the permission of The HSCIC. All rights reserved. No additional data will be made available.
All authors conceived the study. JB conducted the analysis and drafted the manuscript. WH and SP critically revised the article for intellectual content. All authors read and approved the final manuscript.
The authors declare that they have no competing interest.
Consent for publication
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Included ACSCs and ICD-10 codes used to define them. List of ICD-10 codes used to identify admissions for each condition within the analysis.