Skip to main content

Simpson's Paradox

 


My random reading in matters mathematical has brought me to Simpson's Paradox, the idea that a clear trend in a group of data can disappear, or even reverse, when different groups of data that taken severally exhibit the same trend are aggregated. 

It is named for Edward Simpson, who described the effect in "The Interpretation of Interaction in Contingency Tables" in the Journal of the Royal Statistical Society in 1951.

Colin Blyth rediscovered this paper and gave the phenomenon the name Simpson's Paradox in a 1972 paper.

It was once referenced in an episode of the prime-time cartoon The Simpsons because ... well, the name.

Wikipedia has a good discussion.

This is not really a "paradox" at all, though the name will probably stick.  It wouldn't impress Zeno, or whoever started the thing about the lying Cretan. But the phrase "Simpson's odd-seeming phenomenon" doesn't seem very resonant. 

Good brief discussion in pp. 70 - 72 of Titelbaum, Fundamentals of Bayesian epistemology.

The following example comes from Titelbaum's book. It involves two basketball pros active in the 2016-2017 season in the NBA: James Harden of the Houston Rockets and DeMar DeRozan of the Toronto Raptors. 

1. Harden made a higher percentage of his two-point shots than DeRozan.

2. Harden also made a higher percentage of his three-point shots than DeRozan.

Given both of those facts, you probably would not expect this:

3. Harden made a smaller percentage of his overall shots than DeRozan. 

Aggregation can kill a trend. The forest can be very different from the trees.  Insert some other cliche here. I know, right?  

Comments

  1. Facts are pretty crucial to getting to the crux of anything. If one bases a conclusion on less than all of them, one's *conclusion* is no more than an educated inference. Inference does not=reality in such matters. I would hold that the "paradox" is not a paradox at all---only an incomplete assessment. Jack Webb, in *Dragnet*: gimme the facts, m'am, just the facts*.
    [I used to do some administrative law work]

    ReplyDelete
  2. This is, arguably, diissociated from paradox...maybe. Today, I read a post on "elite, social justice activism". That struck me as odd. At best. The terms, *elite*, and, *social justice * are mutually contradictory, if not outright antagonistic, seems to me. When did elitists become advocates of social justice? Answer: when phrases such as *effective altruism* and a more-or-less similar characterization of philanthropy surfaced, d'aurtrement (spelling?), they are not. I could not log a comment on the blog, without committing money. Ding! Lost my attention, tout de suite. Et, non, mon tablet, il ne parle pas Francais, o, Espanol. Love your posts, ami.
    Carry on.

    ReplyDelete

Post a Comment

Popular posts from this blog

A Story About Coleridge

This is a quote from a memoir by Dorothy Wordsworth, reflecting on a trip she took with two famous poets, her brother, William Wordsworth, and their similarly gifted companion, Samuel Taylor Coleridge.   We sat upon a bench, placed for the sake of one of these views, whence we looked down upon the waterfall, and over the open country ... A lady and gentleman, more expeditious tourists than ourselves, came to the spot; they left us at the seat, and we found them again at another station above the Falls. Coleridge, who is always good-natured enough to enter into conversation with anybody whom he meets in his way, began to talk with the gentleman, who observed that it was a majestic waterfall. Coleridge was delighted with the accuracy of the epithet, particularly as he had been settling in his own mind the precise meaning of the words grand, majestic, sublime, etc., and had discussed the subject with William at some length the day before. “Yes, sir,” says Coleridge, “it is a maj...

The Lyrics of "Live Like You Were Dying"

Back in 2004 Tim McGraw recorded the song "Live Like You were Dying." As a way of marking the one-decade anniversary of this song, I'd like to admit that a couple of the lines have confused me for years. I could use your help understanding them. In the first couple of verses, the song seems easy to follow. Two men are talking, and one tells the other about his diagnosis. The doctors have (recently? or a long time ago and mistakenly? that isn't clear) given him the news that he would die soon. "I spent most of the next days/Looking at the X-rays." Then we get a couple of lines about a man crossing items off of his bucket list. "I went sky diving, I went rocky mountain climbing, I went two point seven seconds on a bull named Fu Man Chu." Then the speaker -- presumably still the old man -- shifts to the more characterological consequences of the news. As he was doing those things, he found he was loving deeper and speaking sweeter, and givin...

Five Lessons from the Allegory of the Cave

  Please correct me if there are others. But it seems to be there are five lessons the reader is meant to draw from the story about the cave.   First, Plato  is working to devalue what we would call empiricism. He is saying that keeping track of the shadows on the cave wall, trying to make sense of what you see there, will NOT get you to wisdom. Second, Plato is contending that reality comes in levels. The shadows on the wall are illusions. The solid objects being passed around behind my back are more real than their shadows are. BUT … the world outside the the cave is more real than that — and the sun by which that world is illuminated is the top of the hierarchy. So there isn’t a binary choice of real/unreal. There are levels. Third, he equates realness with knowability.  I  only have opinions about the shadows. Could I turn around, I could have at least the glimmerings of knowledge. Could I get outside the cave, I would really Know. Fourth, the parable a...