Confounding correlations: Enlightenment and noise in the age of so much data
Data, data, everywhere — but how to make sense of it all? In a playful new interactive data visualization, we look at the challenges of finding meaning among the correlations.
In an age of expanding data and increasingly powerful tools to process and visualize it, it’s increasingly difficult to filter out the noise and find meaning in a sea of numbers. With this piece, Arcadia vice president of product development Nick Stepro has fun with this concept by running over 1,000 combinations of metrics through a correlation matrix. The links he visualizes are both obvious and completely unexpected.
Of the obvious findings — risk scores, healthcare expenditures and inpatient utilization are all positively correlated (though costs and inpatient utilization are most strongly correlated). Less obvious — counties with a higher rate of women physicians are more densely populated, and have a lower per-capita Medicare spend and fewer knee replacements per 1,000. Nurse practitioners are more common in states with less college attendance, and are also associated to higher rates of colonoscopies.
D3.js SVG, with Illustrator
Data from CMS Public Use File, US Census American Community Survey, and CMS NPPES NPI Registry