Episode10.doc version of October 19, 2000
A Mathematical Theory For Identifying and Measuring Severity of Episodes of Care
Farrokh Alemi, Ph.D.
Valentin Prudius
Health System Management Program
College of Nursing and Health Science
George Mason University
Fairfax VA
This is confidential information based on patent application 10/054,706 filed on 1/24/2002 by George Mason University. We grant permission to individual scientists within university, Federal and State governments settings to use these procedures free of licensing fees for the purpose of evaluating its effectiveness.
Acknowledgment
The methodology for identifying episodes of care was constructed while Farrokh Alemi was at George Mason University and collaborating with P.J. Maddox Ph.D.
Abstract
Objectives: We propose and test a method for constructing episode of care from administrative databases.
Subjects: We created a measure for severity of episodes of illness for 565 randomly chosen developmentally delayed children who were enrolled in the Medicaid program of one Southeastern State.
Study Design: Regression analysis was conducted to test the percent of variance explained by our proposed mathematical model in cost of care.
Data Collection: Data included both hospitalizations and clinic visits obtained from Medicaid programs from one Southeastern State.
Methods: For each patient, the likelihood that two diagnoses might be part of the same episodes was calculated based on the similarity of the diagnoses and the time between them. Similarity of two diagnoses was estimated by the rate of concurrence of two diagnoses within 30 days for the same patient. The estimated likelihood ratios were used to classify patients into episodes. The severity of an episode was calculated as a Muliplicative Utility function of severity of each diagnosis. To evaluate our method of constructing episodes, we regressed cost of care on the number of episodes, average severity of the episodes, and the interaction between number and average severity of the episodes.
Results: The number of episodes (alpha = 0.001), the average severity of the episodes (alpha = 0.001), and the product of the two (alpha = 0.001) had statistically significant relationships to average cost of the case. The three variables together explained 53% of variation in cost of care.
Conclusions: These data suggest that our proposed mathematical approach is reasonable and produces severity scores that are predictive of objective criteria such as cost of care. The paper ends with an algorithm that can be used on any encounter database that reports time of diagnoses and diagnoses of the patient.
Key Words: Episodes of Care, Administrative Databases, Severity of Illness, Utilization of Services, Multi-attribute Utility models.
Background
This paper provides a mathematical model for identifying episodes of care and measuring the severity of an episode. Measures of episodes of care, in general, and the approach proposed here, in particular, can be used to set capitation rates [[1]] or to profile clinicians’ performance [[2]]. Numerous approaches to measuring episodes of care exist [[3]]. Examples include Prospective Risk Adjustment [[4]], Ambulatory Visit Groups [[5]], Disease Staging [[6]], Products of Ambulatory Care [[7]], Ambulatory Diagnosis Groups [[8]] and Ambulatory Care Groups [[9]]. In addition to broad approaches to measurement of episodes of illness, many have developed disease specific episodes of care [[10], [11], [12], [13], [14]]. There have been a number of review articles about episodes of care [e.g., [15]]. Given the wide range of approaches to measurement of episodes of care, the natural question to ask is why we need yet another approach? Three reasons have motivated us to seek a new approach to measuring episodes of care.
First, we provide a mathematical model for measuring episodes of care. No other approach does so. Most existing approaches to measuring episodes of care do not describe the internal procedures used for measuring severity or identifying episodes of care. Some commercial approaches consider such information as business secrets that should not be disclosed. Even when they do describe the internal mechanism of the approach, all rely on heuristics that make clinical sense but do not provide a mathematical theory for the relation between the variables used in constructing episodes of care. Thus, researchers face a black box -- the content of which they know little about or is based on heuristics that they cannot easily modify and reapply. In the absence of a theory, it is difficult to learn from one study how better measures can be constructed. Each study and each approach exists on its own merits and fails to contribute to the other. Then researchers tend to compete on claims of accuracy rather than to build on each other's work. As a result, while many approaches exist, cumulative progress in the field – where in one investigator builds on another person’s approach, has been limited. The mathematical theory proposed in this paper allows us to accumulate information and improve our understanding of how severity of episodes of care should be measured. Future researchers can change the theory to arrive at more accurate predictions. As new insights are found, the theory is modified and knowledge is accumulated.
Second, our proposed approach does not classify diagnoses into clusters of diseases before identifying episodes of care. All existing approaches are built on the concept of classifying possible diagnoses into a few clusters and then findings rules for creating episodes for these clusters. Schneeweiss and colleagues reported that 92 diagnosis clusters make up 86 percent of all ambulatory visits [[16]]. Others have expanded this set to 125, with varying levels of severity and different time periods, during which the diagnoses in the cluster will belong to the same episode [[17]]. We propose an approach that does not attempt to reduce the large set of possible diagnoses into a smaller set of clusters. Reductionist approaches, by definition, give up important nuances in order to have a manageable set of diagnoses. For example, infections often follow wounds and therefore may be considered part of the same episode. But an otitis media, even though an infection of the ear, could not possibly be part of an episode of trauma to the leg. Defining all infections as one cluster of diagnoses forces investigators to ignore important differences that might exist between types of infections. In our approach, all operations are defined on individual diagnoses without need to pre-set these diagnoses into broad clusters. Sometimes classification of diseases into clusters is based on the etiology of the disease, leading in our view to counter intuitive classifications. An episode of trauma may include fracture to the knee as well as fracture of the leg – even though the knee and leg are different problems. Congestive heart failure may be part of episode of myocardial infarction even though one involves the heart the other the lung. Two very dissimilar diagnoses may be part of the same episode, even though these diagnoses do not describe the same illness.
Third, the objective of our proposed approach, contrary to some existing approaches [e.g., [18], [19], [20]], is not to create homogenous resource use episodes. Thus, not all follow-up visits are part of the same episode even though they may all be short visits and therefore have similar resource use. In our approach the nature of the diagnosis, not the intensity of visits, is the basis of classifying visits into episodes. For example, follow-up visit for myocardial infarction is part of an MI episode and a follow-up visit for trauma is part of trauma episode. Intensity-based measures cannot be used for evaluating whether the numbers of visits are appropriate. In essence, they are fee schedules, except that these fee schedules are based on groups of visits or diagnoses and not single visit diagnosis. We propose a relation-based episode classification system that remedies this important shortcoming. It can be used to judge appropriateness of number of visits.
We start our description of the methods with a few definitions. An episode of care is a group of diagnoses on the same patient that describes the course of a given illness. Note that this definition does not depend on the nature of services delivered, the doctor delivering services, or the site of services [[21]]. Nor, contrary to others [[22]], does this definition assume that services are temporally contiguous. Thus, it allows for episodes to be overlapping; for example, a patient may have an acute exacerbation of their chronic diabetes and experience an episode of upper respiratory infection.
Episodes are characterized by what we call "an anchor diagnosis." This is the diagnosis that gives its name to the episode. Episodes have starting (sometimes referred to as trigger diagnosis [[23]]) and stopping points that may be different from the anchor diagnosis. Episodes are characterized by a rate of progression, a peak severity during the course of episode, and morbidity and mortality outcomes. One episode, for example, may have a rapid onset, progress to a very serious condition, and then lead to death. Another episode may have a slow onset and never become serious.
Pia= function {Tia, Sia}
Where the similarity between the diagnosis “i” and diagnosis “a” is Sia; and number of days between diagnosis “i” and diagnosis “a” is Tia and calculated as:
Tia = Ta – Ti Tia > 0
Note that the probability of being part of the same episode, Pia, should be directly related to similarity of two diagnoses, Sia, and inversely related to, Tia, the time between the two diagnoses. A specific mathematical function that preserves these two relationships is:
Pia= aSia / (1+bTia)
In the above equation, a and b are constants.
When a patient presents with several diagnoses, then the probability that any two of the diagnoses may belong to an episode is calculated using the above formula. Later, these pair-wise probabilities of belonging to the same episode are used to classify diagnosis into groups -- using one of many widely available classification methods. For a specific example of a classification algorithm see Appendix at end of this paper.
Severity of an episode
Diagnoses differ in terms of their severity. We show severity of diagnosis “i” as Sevi and calculate the overall severity of an episode by the following mathematical formula:
Overall severity of episode = 1- pi (1 - Sevi)
There are many different mathematical formulas for aggregating severity of individual diagnosis to severity of an episode. The most common approach is to add or average the severity scores for each diagnosis. Adding scores is not appropriate, as episodes with few severe diagnoses would be scored lower than episodes with many non-severe diagnoses. Averaging is also not appropriate, as patients who have two diagnoses, one severe and the other not, will be rated lower than patients with just the severe diagnosis. Instead of adding or averaging the scores, we prefer using the above Multiplicative model. For example, if a patient has two diagnoses, one with severity score 0.9 and another with severity score 0.5, then the overall severity of the episode is calculated as:
Overall severity for the patient = 1 -(1-0.9)*(1-0.5) = 0.95
Compared to the adding or the averaging formula, the multiplicative formula has several advantages: The influence of severe diagnoses on the overall score are not diluted by non-severe diagnoses and merely increasing the number of diagnoses will not necessarily result in high overall severity scores.
evaluatioN of Proposed Approach and Measurement of parameters
We created a measure for severity of episodes of illness for developmentally delayed children who were enrolled in the Medicaid program of one Southeastern State. Developmentally delayed children use health services extensively. To reduce computational difficulties and without loss of generality, we randomly sampled 565 patients among the 3250 patients in the database.
The data included both in-patient and outpatient Medicaid payments for the patient. The in-patient portion included both the health professionals billing and the institution's bills. On average, the State paid $9,296 per patient per year. The standard error of the cost was $2,238, reflecting large variation in cost of care across patients. Cost ranged from low of $29 (reflecting patients enrolled for portion of the year) to $884,967 per year.
Estimating the time between two diagnoses, Tia, is easy and can be read directly from the database by taking the absolute value of the difference in dates of the two diagnoses. Estimating the similarity of the two diagnoses, Sia, was more difficult. A surrogate measure of similarity of two diagnoses is the number of times the two diagnoses co-occur within a specific time frame. The implicit assumption is that complications and related problems tend to occur in clusters.
We calculated a score proportional to the likelihood that two diagnoses belong to the same episode by the formula provided earlier. We used this score to classify diagnoses into episodes in such a manner that diagnoses within one episode were more similar than diagnoses in different episodes. Appendix A gives a detailed example of how diagnoses were classified. The mean number of episodes was 147 (standard error = 320). Patients differed considerably in the number of episodes they had.
We calculated the severity of each diagnosis as the average amount paid for the diagnosis. Severity and costs are not always related, especially when patients die before expensive services can be delivered. But in our database no patient passed away. Therefore, cost may have been a reasonable surrogate measure of severity.
To test the accuracy of our measures of episodes of care, we regressed cost of care on severity of the episode, number of episodes and interaction between number of episodes and severity of episodes. We measured cost of care by the amount the State paid for each encounter. Since patients' eligibility for Medicaid changes frequently, the amount paid by the State is only an approximate measure of total cost of care of the patient. To have one estimate of severity for a patient, we averaged the severity scores for each patient across all their episodes during the year. The averaged severity score ranged from 0.01 to 0.27. The mean was 0.03 (standard error = 0.001).
Table 5 summarizes regression results. The dependent variable was "the amount paid by the State". All three independent variables -- "the average severity of the episodes", "the number of episodes of the patient" and “the interaction between the severity and the number of episodes” -- were statistically significant predictors of the dependent variable at alpha levels lower than 0.001. The R-Squared adjusted by degrees of freedom was 53%.
Data showed that episodes of care can be constructed from encounter databases. Furthermore, the proposed measure of episode of care explained a large percentage of variance in cost of care. The magnitude of the percent of variance explained by the measures reported here is of special interest. Many measures of severity and case mix report R2 values less than 10%.[24] Because our approach explains a large percent of the variance, our confidence in the validity of our measure of severity of episodes is increased.
One may expect the performance of the approach developed in this paper to deteriorate when parameters of the model are estimated from one database and applied to another unrelated database. Nevertheless the magnitude of percent of variations in objective data explained by our approach is so high that we are hopeful that even with drops in performance, our approach will remain relatively more accurate than many existing approaches.
The approach proposed here can be used to construct episodes of care for specific diseases. Thus, if one investigator is interested in episodes for diabetes and another is interested in episodes of cancer, both can use the algorithm proposed here by pre-selecting patients with a particular disease.
The most appealing part of the proposed approach is the ease with which the approach can be integrated with existing databases. The proposed mathematical model works on any administrative database, which has information on date of visit and diagnosis. Any person familiar with database operations can implement it. In addition, electronic medical record companies can use the algorithm proposed here to embed methods of analyzing performance of clinicians within their electronic record systems.
1) Start with a Table of patient unique identification number, diagnosis and time of diagnosis. Here is a small example database:
Time (dd/mm/yy) |
Patient ID |
Diagnosis |
01/01/01 |
1001 |
A |
12/01/01 |
1001 |
B |
22/01/01 |
1002 |
A |
12/01/01 |
1002 |
B |
22/01/01 |
1003 |
C |
02/02/01 |
1001 |
D |
02/02/01 |
1002 |
B |
12/02/01 |
1003 |
D |
13/02/01 |
1003 |
B |
01/05/01 |
1002 |
C |
2) Create a query identifying for any pair of diagnoses the number of unique patients for whom the two diagnoses co-occur within 30 days. Note that the co-occurrence of diagnosis "a" and "b" does not depend on the order of which one comes first. Here is how the query will look like for the above example data:
First diagnosis |
Second diagnosis |
Co-occurrences |
|
First diagnosis |
Second diagnosis |
Co-occurrences |
A |
A |
2 |
|
C |
A |
0 |
A |
B |
2 |
|
C |
B |
1 |
A |
C |
0 |
|
C |
C |
2 |
A |
D |
1 |
|
C |
D |
1 |
B |
A |
2 |
|
D |
A |
1 |
B |
B |
2 |
|
D |
B |
2 |
B |
C |
1 |
|
D |
C |
1 |
B |
D |
2 |
|
D |
D |
2 |
3) For each patient conduct the following analysis:
a) For the patient, when the same diagnosis occurs at two different time periods, rename the diagnoses into unique names -- usually a combination of the name and date of diagnosis. For example patient 1002 has the following data when renamed:
Time (dd/mm/yy) |
Patient ID |
Diagnoses |
12/01/01 |
1002 |
B1201 |
22/01/01 |
1002 |
A |
13/02/01 |
1002 |
B1302 |
01/05/01 |
1002 |
C |
b) For the patient, measure the absolute value of the length of time between any pair of diagnoses for the patient, refer to this as time between any two diagnoses. For example for patient 1002 the time between two different diagnoses will be:
First diagnosis |
Second diagnosis |
Time |
|
First diagnosis |
Second diagnosis |
Time |
A |
B1201 |
10 |
|
B1302 |
A |
21 |
A |
B1302 |
21 |
|
B1302 |
B1201 |
31 |
A |
C |
38 |
|
B1302 |
C |
17 |
B1201 |
A |
10 |
|
C |
A |
38 |
B1201 |
B1302 |
31 |
|
C |
B1201 |
48 |
B1201 |
C |
48 |
|
C |
B1302 |
17 |
c) For the patient, look up the similarity of any pair of different diagnoses they have from step "2" and divide this by absolute value of the time between the two diagnoses, from step "b". Refer to this as the score. For example for the patient 1002 the results will be:
First diagnosis |
Second diagnosis |
Time |
|
First diagnosis |
Second diagnosis |
Time |
A |
B1201 |
2/10= .20 |
|
B1302 |
A |
2/ 21=.10 |
A |
B1302 |
2/21= .10 |
|
B1302 |
B1201 |
2/ 31=.06 |
A |
C |
0/ 38=0 |
|
B1302 |
C |
1/17= .06 |
B1201 |
A |
2/10=.20 |
|
C |
A |
0/ 38=0 |
B1201 |
B1302 |
2/31= .06 |
|
C |
B1201 |
1/ 48=.02 |
B1201 |
C |
1/48= .02 |
|
C |
B1302 |
1/ 17=.06 |
d) For the patient, standardized the score so that it ranges between 1 and zero by subtracting the minimum value from each score and dividing the results by the difference of maximum and minimum score. Refer to this as standardized score. For the patient 1002 the standardized score is as follows:
First diagnosis |
Second diagnosis |
Time |
|
First diagnosis |
Second diagnosis |
Time |
A |
B1201 |
1.0 |
|
B1302 |
A |
.50 |
A |
B1302 |
.48 |
|
B1302 |
B1201 |
.30 |
A |
C |
.00 |
|
B1302 |
C |
.30 |
B1201 |
A |
1.0 |
|
C |
A |
.00 |
B1201 |
B1302 |
.32 |
|
C |
B1201 |
.10 |
B1201 |
C |
.10 |
|
C |
B1302 |
.30 |
e) Classify different diagnoses into episodes by using the standardized score. The following is one classification procedure that could be used:
i) Combine the two diagnoses with maximum standardized score into one episode if the value of the standardized score is higher than a pre-set cutoff -- usually 0.5.
ii) Create a new diagnosis to represent the two diagnoses that were combined into an episode. Calculate the standardized score for this new diagnosis by averaging the standardized score of its two components.
iii) Exclude the diagnoses that have already been combined into new diagnoses from further analysis and repeat steps starting from step "i".
For example, the data for case 1002 will follow these steps:
Ø Maximum is 1 therefore diagnoses A and B1201 are combined.
|
A |
B1201 |
B1302 |
C |
A |
|
1.0 |
.48 |
.00 |
B1201 |
1.0 |
|
.32 |
.10 |
B1302 |
.50 |
.32 |
|
.30 |
C |
.00 |
.10 |
.30 |
|
Ø A new diagnosis is created named AB1201 and standardized scores for the new diagnosis are calculated as the average of its component:
|
A |
B1201 |
AB1201 |
B1302 |
.50 |
.32 |
(.5+.32)/2 |
C |
.00 |
.10 |
(.00+.10)/2 |
Ø The diagnosis already combined into an episode are excluded from further analysis and the steps are repeated and a new maximum of 0.41 is found.
|
B1302 |
C |
AB1201 |
B1302 |
|
.30 |
.41 |
C |
.30 |
|
.05 |
Ø The new maximum is not higher than the cutoff of 0.5. Therefore, no other diagnoses are combined into new episodes.
Ø The result of the calculation for patient 1002 was three episodes:
1. The combination of diagnosis A and diagnosis B on 12/01/01.
2. Diagnosis B on 13/02/01 by itself.
3. Diagnosis C by itself.
Note that diagnosis B on 13/02/01 was not combined with diagnosis B on 12/01/01 even though both are the same diagnosis.
An Access program that achieves this algorithm follows.
Table 1: Regression of "Amount paid by the State" on severity and number of episodes
|
Coefficients |
P-value |
Intercept
|
-7297 |
0.003 |
Average severity of episodes
|
-33.58 |
0.000 |
Number of episodes
|
444971 |
0.000 |
Product of number of episodes and average severity of episodes |
756 |
0.000 |
Adjusted R Squared = 53.11%
|
Number of observations = 565 |
References
[1] Brailer DJ, Kroch EA. Member risk adjustment for ambulatory episodes of care. Health Care Manag Sci 1999 Jul; 2 (3):125-36.
[2] Miller LG. Provider profiling: advancing to episodes of care. Physician Exec 1995 Oct; 21 (10): 40-1.
[3] Hornbrook MC, Hurtado AV, Johnson RE. Health care episodes: Definitions, measurement and use. Medical Care Review, 1985, 42(2): 163-218.
[4] Goldfield N, Averill R, Eisenhandler J, Hughes JS, Muldoon J, Steinbeck B, Bagadia F. The prospective risk adjustment system. J Ambulatory Care Manage 1999 Apr; 22 (2): 41-52.
[5] Schneider KC, Lichtenstein JL, Freeman JL, Newbold RC, Fetter RB, Gottlieb L, Leaf P, Portlock CS. "Ambulatory visit groups: An outpatient classification system." Journal of Ambulatory Care Management, 1988, 11(3): 1-12.
[6] Rosen AK, Houchens RL, Gibson TB, Mayer-Oakes A. Developing episodes of care for adult asthma patients: a cautionary tale. Am J Med Qual 1998 Spring;13(1):25-35.
[7] Tenan PM, Fillmore HH, Caress B, et al. PACs: Classifying ambulatory care patients and services for clinical and financial management. Journal of Ambulatory Care Management 1988, 11 (3), 36-53.
[8] Weiner JP, Starfield BH, Steinwachs DM, Mumford LM. A case mix system for application to ambulatory care quality assurance and utilization review. Presented at the American Medical Review Research Center's Annual Research Symposium, Washington 1988,
[9] Starfield B, Weiner J, Mumford L, Steinwachs D. Ambulatory care groups: a categorization of diagnoses for research and management. Health Services Research 1991 Apr;26(1):53-74.
[10] Schulman KA, Yabroff KR, Kong J, Gold KF, Rubenstein LE, Epstein AJ, Glick H. A claims data approach to defining an episode of care. Health Serv Res 1999 Jun; 34 (2): 603-21.
[11] Mehta SS, Suzuki S, Glick HA, Schulman KA. Determining an episode of care using claims data. Diabetic foot ulcer. Diabetes Care 1999 Jul;22(7):1110-5.
[12] Rosen AK, Mayer-Oakes A . Developing a tool for analyzing medical care utilization of adult asthma patients in indemnity and managed care plans: can an episodes of care framework be used?Am J Med Qual 1998 Winter;13 (4): 203-12.
[13] Moscovice I. A method for analyzing resource use in ambulatory care settings. Medical Care 1977, XV (12): 1024-1044.
[14] Gold M. Effects of hospital based primary care setting on internists treatment of primary care episodes. Health Services Research 1981, 16 (4): 383-405.
[15] Rosen AK, Mayer-Oakes A. Episodes of care: theoretical frameworks versus current operational realities. Jt Comm J Qual Improv 1999 Mar; 25 (3): 111-28.
[16] Schneeweiss R, Rosenblatt RA, Cherkin DC, Kirkwood CR, Hart G. Diagnostic clusters: A new tool for analyzing the content of ambulatory medical records. Medical Care 1983, XXI (1): 105-122.
[17] Cave DG. Profiling physician practice patterns using diagnostic episode clusters. Medical Care 1995, 33 (5): 463-486.
[18] Sala J, Riba MD, Manresa F. A way of obtaining isoresource consumption care episodes. J Ambulatory Care Manage 1999 Apr;22 (2): 53-62.
[19] Stimson DH, Charles G, Rogerson CL. Ambulatory care classification systems. Health Serv Res 1986 Feb;20(6 Pt 1):683-703
[20] Rogerson CL, Stimson DH, Simborg DW, Charles G. Classification of ambulatory care using patient-based, time-oriented indexes. Med Care 1985 Jun; 23(6): 780-8.
[21] Pineault R. The effect of prepaid practice on physicians utilization behavior. Medical Care 1976; 14: 121-36.
[22] Young KM, Fisher CR. Medicare episodes of illness: a study of hospital, skilled nursing facilities, and home health agency care. Health Care Financing Review 1980; 2: 1-23.
[23] Morris SS, Cousens SN, Lanata CF, and Kirkwood BR. Diarrhea -- defining the episode. International Journal of Epidemiology, 1994, 23 (3): 617-623.
[24] Ettner SL, Frank RG, McGuire TG, Newhouse JP, Notman EH . Risk adjustment of mental health and substance abuse payments. Inquiry 1998 Summer;35(2):223-39.