What is your intuition? Do those analysis results look too good to be true? Do they just not make sense? Do the findings contradict another piece of data you learned about recently?
Whenever you have those doubts, you may want to go back to this basic question: what did you measure? How did you know the data you chose were a good representation of the real-world phenomenon you wanted to assess? This article is about finding representative measures.[1]The other angle of representativeness is whether the group of people, products, situations etc. you are measuring is expected to be very similar to the group you want to assess or make decisions for. … Continue reading
Consider this example from health care:
Who is eligible for high-risk care programs?
In many communities, health providers, such as hospitals, physicians and outpatient service providers, formally work together as a “health system.” Health systems often enroll patients with complex health needs in high-risk care programs.
These programs help patients better manage their health needs. Additionally, they have been shown to reduce the number of emergency room visits and hospitalizations, therefore saving costs.
But who decides if a patient has complex health needs? The primary care physician? What if the patient does not have one? What if that doctor doesn’t have access to the patient’s health history across providers and therefore lacks the bigger picture?
How do you measure health needs?
Health insurance companies come closest to having a complete picture. They have your entire claims history. Using this data, you can develop an algorithm that will identify the health conditions, doctor visitation patterns etc. that point to severe health problems. Once a patient shows this pattern, you can auto-enroll them in the high-risk care program.
However, to develop this algorithm you must describe “severe health problems” or “health needs”, the real-world phenomenon you want the algorithm to identify, in a language it can understand. You need a good data representation or good measure for health needs.
In this case, the algorithm’s developers chose to use health care cost. Why? It is easy to obtain since insurance companies know what they paid for treatments. Also, unsurprisingly, when you see high healthcare bills, you find that these patients indeed have poor health.
The wrong measure
What sounds like a logical and elegant solution at first, turned out to be a problem.[2]You can find a summary of the study here, including the reference to the actual study: https://review.chicagobooth.edu/economics/2019/article/how-racial-bias-infected-major-health-care-algorithm The algorithm’s recommendations led to a disproportionately lower enrollment of Black people with evidently complex health needs.
How? Many health care systems have historically spent relatively less money on the same health conditions when exhibited by Black people versus white people. As such, health care cost, the measure, turned out to be a poor global representation of actual health needs, the real-world situation they tried to assess.
When a measure is representative it means it moves in a predictable direction as the real-world behavior changes. Health needs grow, health care cost grows.
There is a pattern you can see if you graphed the data. And, importantly, the pattern holds for the entire group of people, products, events or whatever you want to draw conclusions about. On the latter point health care cost as an indicator of health needs failed.
Clarify ambiguous measures
Let’s look at some popular measures from different domains and their technical definitions: