Editing 1950: Chicken Pox and Name Statistics

Jump to: navigation, search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 8: Line 8:
  
 
==Explanation==
 
==Explanation==
This is another comic with one of [[Randall|Randall's]] [[:Category:Fun fact|fun facts]].
 
 
 
In this comic statistics are used to point out some non-intuitive correlations.  The first panel sketches out the prevalence of {{w|Chickenpox|chicken pox}} by age in the {{w|United States}}. <br />
 
In this comic statistics are used to point out some non-intuitive correlations.  The first panel sketches out the prevalence of {{w|Chickenpox|chicken pox}} by age in the {{w|United States}}. <br />
 
As the graph indicates, prior to the introduction of the {{w|Varicella vaccine|varicella vaccine}} in the United States, it was an exceptionally common childhood illness, with almost 100% of the population experiencing it at some point. The illness is highly memorable (since the symptoms last for days and are intensely uncomfortable) and noticeable (since the characteristic blisters are distinctive and difficult to hide), meaning that it was once a common experience that people expected to both experience and see in their peers. <br />
 
As the graph indicates, prior to the introduction of the {{w|Varicella vaccine|varicella vaccine}} in the United States, it was an exceptionally common childhood illness, with almost 100% of the population experiencing it at some point. The illness is highly memorable (since the symptoms last for days and are intensely uncomfortable) and noticeable (since the characteristic blisters are distinctive and difficult to hide), meaning that it was once a common experience that people expected to both experience and see in their peers. <br />
Line 18: Line 16:
 
The final panel points out that these trends, taken together, generate the interesting effect that you can, in some cases, estimate the odds of someone having had chicken pox, based solely on their first name. Having a name like "Brian" or "Sarah" raises the odds that you're over 30, which raises the odds that you had chicken pox. People named "Harper" or "Jaxon" are almost certainly young enough to have grown up with the vaccine in broad use.  These time-based trends predict both the odds of a person having had the illness personally, and the odds that they grew up in a time when infections were common and generally expected.  
 
The final panel points out that these trends, taken together, generate the interesting effect that you can, in some cases, estimate the odds of someone having had chicken pox, based solely on their first name. Having a name like "Brian" or "Sarah" raises the odds that you're over 30, which raises the odds that you had chicken pox. People named "Harper" or "Jaxon" are almost certainly young enough to have grown up with the vaccine in broad use.  These time-based trends predict both the odds of a person having had the illness personally, and the odds that they grew up in a time when infections were common and generally expected.  
  
The cartoon demonstrates the correlative fallacy, i.e. what can go wrong if one attempts to draw conclusions based on a random comparison of two variables, as described by the famous saying: "{{rw|Correlation_does_not_imply_causation|Correlation does not imply causation}}". In this case, there's a real correlation between names and the incidence of a particular disease. A superficial reading could suggest that either certain names make people prone to the disease, or that the disease, in some way, impacts a person's name. The real cause of this correlation is simply that certain trends just happen to coincide, causing them to statistically correlate without either variable having a real causal affect on the other.  
+
The cartoon demonstrates a particular type of correlative-based fallacy, i.e. what can go wrong if one attempts to draw conclusions based on a random comparison of two variables. In this case, there's a real correlation between names and the incidence of a particular disease. A superficial reading could suggest that either certain names make people prone to the disease, or that the disease, in some way, impacts a person's name. The real cause of this correlation is simply that certain trends just happen to coincide, causing them to statistically correlate without either variable having a real causal affect on the other.  
  
 
The citations are real articles. The first citation ''DOI:10.15585/mmwr.mm6534a4'' is on the {{w|Centers for Disease Control and Prevention|Center for Disease Control}} (CDC) web site at [https://www.cdc.gov/mmwr/volumes/65/wr/mm6534a4.htm] and the second citation ''DOI:10.1016/j.vaccine.2012.05.050'' is an article in Vaccine at [https://www.sciencedirect.com/science/article/pii/S0264410X12007761?via%3Dihub]. Both articles describe the effects of the vaccine for varicella which is the virus that causes chicken pox and shingles (also known as herpes zoster).
 
The citations are real articles. The first citation ''DOI:10.15585/mmwr.mm6534a4'' is on the {{w|Centers for Disease Control and Prevention|Center for Disease Control}} (CDC) web site at [https://www.cdc.gov/mmwr/volumes/65/wr/mm6534a4.htm] and the second citation ''DOI:10.1016/j.vaccine.2012.05.050'' is an article in Vaccine at [https://www.sciencedirect.com/science/article/pii/S0264410X12007761?via%3Dihub]. Both articles describe the effects of the vaccine for varicella which is the virus that causes chicken pox and shingles (also known as herpes zoster).
Line 25: Line 23:
  
 
==Transcript==
 
==Transcript==
:[The first panel shows graph with only the x-axis labeled and with seven labeled ticks. The y-axis has three ticks with percentage labels. A red line rises from bottom to top as it goes from left to right. There is an arrow pointing at the graph with a label. Above the left part of the red line there is a title and beneath that a reference to the source of the data in gray font.]
+
:[The first panel shows graph with only the x-axis labeled and with seven labeled ticks. The y-axis has three ticks with percentage labels. A red line rises from bottom to top as it goes from left to right. There is an arrow pointing at the graph with a label. Above the left part of the red line there is a title and beneath that a reference to the source of the data in gray font:]  
:[According to the graph, the percentage is close to 0 for ages below 15 and close to 100 for ages above 30.]
 
 
:Fraction of kids your age who got chicken pox
 
:Fraction of kids your age who got chicken pox
 
:<span style="color:gray">(Very rough US estimates based on DOI:10.15585/mmwr.mm6534a4 and DOI:10.1016/j.vaccine.2012.05.050)</span>
 
:<span style="color:gray">(Very rough US estimates based on DOI:10.15585/mmwr.mm6534a4 and DOI:10.1016/j.vaccine.2012.05.050)</span>
Line 35: Line 32:
 
:<span style="color:gray">Vaccine introduced in 1995</span>
 
:<span style="color:gray">Vaccine introduced in 1995</span>
  
:[The second panel shows a graph with only the x-axis labeled and with seven labeled ticks. The graph has six gray lines with labels on them. To the right is a title and beneath that a reference to the source of the data in gray font.]
+
 
:[According to the graphs, Sarah and Brian are more popular names for older age groups.]
+
:[The second panel shows a graph with only the x-axis labeled and with seven labeled ticks. The graph has six gray lines with labels on them. To the right is a title and beneath that a reference to the source of the data in gray font:]
 
:Relative popularity of several names in your age group
 
:Relative popularity of several names in your age group
 
:<span style="color:gray">(Source: ssa.gov)</span>
 
:<span style="color:gray">(Source: ssa.gov)</span>
 +
:Your age: 5&nbsp;&nbsp;10&nbsp;&nbsp;15&nbsp;&nbsp;20&nbsp;&nbsp;25&nbsp;&nbsp;30&nbsp;&nbsp;35
 
:Harper
 
:Harper
 
:Sarah
 
:Sarah
Line 44: Line 42:
 
:Brian
 
:Brian
 
:Brooklyn
 
:Brooklyn
:Logan
+
:Logan.
:Your age: 5&nbsp;&nbsp;10&nbsp;&nbsp;15&nbsp;&nbsp;20&nbsp;&nbsp;25&nbsp;&nbsp;30&nbsp;&nbsp;35
 
  
 
:[The third panel shows a list of names with a percentage next to them. Above the list is a title and beneath that a statement in gray font:]
 
:[The third panel shows a list of names with a percentage next to them. Above the list is a title and beneath that a statement in gray font:]
Line 69: Line 66:
 
[[Category:Charts]]
 
[[Category:Charts]]
 
[[Category:Fun fact]]
 
[[Category:Fun fact]]
[[Category:Kids]]
 

Please note that all contributions to explain xkcd may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see explain xkcd:Copyrights for details). Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel | Editing help (opens in new window)