Difference between revisions of "3262: Sports Commentary"

Explain xkcd: It's 'cause you're dumb.
Jump to: navigation, search
(Explanation)
(Explanation: Title text missing in explanation. Has mentioned in incomplete reason and put in a place holder for where the text should be explained.)
Line 10: Line 10:
  
 
==Explanation==
 
==Explanation==
{{incomplete|This page was created at a statistically insignificant time, but it is the FIRST PAGE TO START WITH 3262. Don't remove this notice too soon.}}
+
{{incomplete|This page was created at a statistically insignificant time, but it is the FIRST PAGE TO START WITH 3262. Don't remove this notice too soon. There is no mention of the title text.}}
  
 
{{w|P-hacking}} is the academically problematic practice of attempting to come up with a question for which the data offers a significant ''p''-value (probability value), a subject [[882: Significant|previously covered]] in comic form. This is in contrast to correct scientific analysis, in which a realistic question is formulated clearly and then answered (or shown to be unjustified) with data.
 
{{w|P-hacking}} is the academically problematic practice of attempting to come up with a question for which the data offers a significant ''p''-value (probability value), a subject [[882: Significant|previously covered]] in comic form. This is in contrast to correct scientific analysis, in which a realistic question is formulated clearly and then answered (or shown to be unjustified) with data.
Line 19: Line 19:
  
 
Randall satirizes this with an example in which the restriction uses very specific criteria largely irrelevant to gameplay patterns in order to narrow down the subgroup sample size to a measly two games. The 0-2 record (there were two situations considered as comparable, and neither of them resulted in the result hoped for in this current case) reflects random noise much more than any significant insight. As well as being irrelevant to gameplay, their ''p''-hacking also makes the game sound like jargon, which can be confusing and difficult to understand. This is ironic given a sports commentator's job is supposed to be to explain the situation they are fronting, rather than making them more vague and incomprehensible. However, this may be the inevitable response to being left in front of the camera during breaks in play, or even during periods of gameplay that are nominally unremarkable — feeling the pressure to say ''something'', they will draw upon ever more obscure and irrelevant details to justify their (or their off-screen advisors') efforts and expertise to entertain and inform the viewing public.
 
Randall satirizes this with an example in which the restriction uses very specific criteria largely irrelevant to gameplay patterns in order to narrow down the subgroup sample size to a measly two games. The 0-2 record (there were two situations considered as comparable, and neither of them resulted in the result hoped for in this current case) reflects random noise much more than any significant insight. As well as being irrelevant to gameplay, their ''p''-hacking also makes the game sound like jargon, which can be confusing and difficult to understand. This is ironic given a sports commentator's job is supposed to be to explain the situation they are fronting, rather than making them more vague and incomprehensible. However, this may be the inevitable response to being left in front of the camera during breaks in play, or even during periods of gameplay that are nominally unremarkable — feeling the pressure to say ''something'', they will draw upon ever more obscure and irrelevant details to justify their (or their off-screen advisors') efforts and expertise to entertain and inform the viewing public.
 +
 +
The title text... Needs to be explained here!
  
 
{{cot|A breakdown of the commentary's statement}}
 
{{cot|A breakdown of the commentary's statement}}

Revision as of 12:40, 24 June 2026

Sports Commentary
The plural of anecdote may not be data, but the singular of data is anecdote.
Title text: The plural of anecdote may not be data, but the singular of data is anecdote.

Explanation

Ambox warning blue construction.png This is one of 44 incomplete explanations:
This page was created at a statistically insignificant time, but it is the FIRST PAGE TO START WITH 3262. Don't remove this notice too soon. There is no mention of the title text. If you can fix this issue, edit the page!

P-hacking is the academically problematic practice of attempting to come up with a question for which the data offers a significant p-value (probability value), a subject previously covered in comic form. This is in contrast to correct scientific analysis, in which a realistic question is formulated clearly and then answered (or shown to be unjustified) with data.

A common way of doing p-hacking is analyzing subgroups to attempt to find significance when the full dataset does not yield statistically significant results; for instance, arbitrarily restricting the analysis of medical data to male subjects to derive a significant p-value when the inclusion of female subjects would have changed the conclusion. There are actual biological reasons why treatments may work differently between the different groups, and other reasons why female subjects may be less suitable participants in the trial, but a post facto decision to only present the 'male data' would be problematic. Similarly, looking at many other retrospective distinctions and then choosing to present only the possibly-random patterns that stood out, and ignoring all those that did not, would be questionable.

Sports commentators are known to do a form of p-hacking in which they recall facts regarding past performance, and sometimes they are made to sound more significant by choosing only such 'facts' that coincide heavily with the situation developing in front of them. By using overly narrow superlatives, a severe form of narrowing down of applicability (which was previously covered), it also realistically reduces any real confidence that such a dwindling number of precedents are a useful predictor of how the upcoming event will turn out.

Randall satirizes this with an example in which the restriction uses very specific criteria largely irrelevant to gameplay patterns in order to narrow down the subgroup sample size to a measly two games. The 0-2 record (there were two situations considered as comparable, and neither of them resulted in the result hoped for in this current case) reflects random noise much more than any significant insight. As well as being irrelevant to gameplay, their p-hacking also makes the game sound like jargon, which can be confusing and difficult to understand. This is ironic given a sports commentator's job is supposed to be to explain the situation they are fronting, rather than making them more vague and incomprehensible. However, this may be the inevitable response to being left in front of the camera during breaks in play, or even during periods of gameplay that are nominally unremarkable — feeling the pressure to say something, they will draw upon ever more obscure and irrelevant details to justify their (or their off-screen advisors') efforts and expertise to entertain and inform the viewing public.

The title text... Needs to be explained here!

This comic was published 11 days into the 2026 FIFA World Cup. The World Cup was also the subject of 3260: Messi, published the previous Wednesday. Sports commentary was also the subject of 904: Sports.

Transcript

Ambox warning green construction.png This is one of 28 incomplete transcripts:
Don't remove this notice too soon. If you can fix this issue, edit the page!
[Cueball and Ponytail are sitting at a table, looking at the wall behind them. On the wall is a screen showing a soccer field with some mostly unreadable score information above it. The only readable information is that the score is 2-1.]
Cueball: They could be in trouble. Over the last 36 years, they've gone 0 for 2 when they've scored in the 37th minute to lead 2-1 against a team whose country comes before theirs alphabetically.
[Caption below comic:]
I wish sports commentators hadn't discovered p-hacking.

comment.png  Add comment      new topic.png  Create topic (use sparingly)     refresh discuss.png  Refresh 

Discussion

Did this example actually happen? 47.151.65.120 04:33, 23 June 2026 (UTC)

This comic reminds me of 1122: Electoral Precedent and 2383: Electoral Precedent 2020. Generalizing coincidences.

I am not a native English speaker. What does " they've gone 0 for 2" mean? Obviously it cannot be the score, since they are already leading 2-1? Or does this refer to a previous match? And on a more general note, I am really surprised to discover the second football themed comic strip in a few days. OK it's the World Cup, but I always thought that Randall doesn't really care about sports? --92.209.171.90 08:37, 23 June 2026 (UTC)

I am a native English speaker, but it was also a bit impenetrable to me. In part, perhaps, because it was intended to sound impenetrable (as part of the joke). But, even if not, it may be because it's using Americanized sports-talk phrasing that just isn't (yet!) used so much in my more native Anglicised commentaries that I'm used to.
However, I think they're saying that "in the two specific occasions in which all those other conditions occur, they won in neither of them".
A simpler version being perhaps to state that a given team/player has gone nought-for-two in previous matches with their current opponent(s). The results of those contests might have been anything (the winner having gone to 3-2 after penalties, 6-love/6-love/6-love, a par-4 advantage or getting them all out for 178 — depending upon the sport), it's just the win/lose (or win/not-win) count thats "0 for 2".
But this is a case of Overly Narrow Superlative (overlapping with P-Hacking), making it a dubious analysis. Starting with ignoring all the games there are in which a given svoreline was not achieved in a particular minute of play. I think part of this set-up is the difference between Gridiron 'football'/"hand-egg" having tons of points scored, whereas this football (Soccer) often turns on comparatively low scores that (one-nil can be a worthy and entertaining win/loss, and even a no-score-draw might have been fun to watch if your side isn't in desperate need for a win). These commentators, or at least the US audience they're commentating to, are used to spieling things about "the last time they were down on the forty-yard line in the fifth quarter, with two home runs and a shot from the free-throw line in hand..." (look, I know I don't know what they'd really say, to any accuracy, there was no point even trying!), at least to fill in the copious down-time/time-out pauses. (Which isn't actually as easy with low-scoring but more ever-moving 'soccer', where there's often much to be said about current player and ball movements almost all the time; although a five-day international cricket test match(!) commentary on the radio does rather famously lapse into 'filler' like discussing the nice cake that was sent to them by a listener, in the gaps between balls being bowled...)
Sorry, that was a long and convoluted paragraph. (But then, so was the Explanation, before I decided to say this down here. I hope it's been tweaked since then. I'm only really guessing about the Leftpondian commentator-speak being parodied here, and ball-sports aren't really my main interest in the sprorting sphere itself. (But, regarding balls that aren't themselves spheres, I'd happily discuss Rugby League or Rugby Union, and why they're 'better'... though I would totally acknowledge Aussie Rules as a class of its own as far as such contact-sports go.)
HTH, HAND. 82.132.236.84 10:08, 23 June 2026 (UTC)
I'm also English, and it's totally alien to me too. GSLikesCats307 (talk) 11:53, 23 June 2026 (UTC)
I don't know what prompted the rant above, but if you don't care to read it, "going 0 for 2" means having 0 successes out of 2 chances. In the context of this commentary, it's referring to winning 0 games out of the 2 games that meet the criteria. It's not intended to sound impenetrable; it's a common phrasing.163.116.145.33 13:37, 23 June 2026 (UTC)
I don't know what prompted you to think it was a rant. It's certainly quite lengthy (in the context of discussion comments here - not in the grand scheme of things), but that's not really the definition of a rant. 82.13.184.33 14:57, 23 June 2026 (UTC)
Because it was lengthy, but it almost entirely ignored the question, focusing instead on your opinion on sports and their commentary styles. Going off on one vs. ranting...it's a blurry distinction. You absolutely did the former; possibly the latter. 0 for 2 (pronounced as a letter "O") simply means zero victories for two games played. It's not obscure terminology. Yorkshire Pudding (talk) 15:50, 23 June 2026 (UTC)
Yes, it is obscure. The only context in which “0 for 2” makes sense to me is in cricket: 0 runs scored, 2 batsmen out. The 0 of / out of / from 2 described above is just not something which would occur to me, and it's largely because of that ‘for’. If it's common, why have I never heard it on (for example) Match of the Day? Randomnonsense (talk) 16:44, 23 June 2026 (UTC)
It's hard to argue that a usage common to the US - the largest English-speaking country in the world - Canada (https://www.google.com/search?q=%22went+0+for%22+site%253Athestar.com), and Australia (https://www.google.com/search?q=%22went+0+for%22+site%3Aabc.net.au, including their coverage of American sports leagues, local sports leagues, international tennis, and international cricket) is obscure. Your personal knowledge is not a good metric for obscurity. 163.116.145.44 18:50, 23 June 2026 (UTC)
It's clearly not a primarily UK-English construction, but it's nevertheless the sort of thing that this UK-native copy editor would not be concerned about should it come up in a publication for a UK audience. It might be whined about as an "Americanism" by the sort of people who make those sorts of complaints, but they would recognise it as a phrase they're supposed to get huffy about, rather than being baffled by it. Faux baffled, maybe...
It isn't obscure though. Yorkshire Pudding (talk) 10:27, 24 June 2026 (UTC)
It didn't focus on my opinion - it wasn't my comment. I agree it was long and arguably somewhat digressive. But that still isn't the definition of a rant. A rant implies some degree of anger or worked-up-ness, and it didn't have either of these. It was even apologetic for its own length and convolutedness. 82.13.184.33 08:26, 24 June 2026 (UTC)
Precisely what Yorkshire Pudding said; you could have stopped after your third paragraph, or really even after your second, and you wouldn't have lost any relevant information. I'll add that the rantiness is enhanced by saying things like Gridiron 'football'/"hand-egg", and the very "sportsball"-coded "the last time they were down on the forty-yard line in the fifth quarter, with two home runs and a shot from the free-throw line in hand...," which, apart from their irrelevance to the topic, have a very superior air. 163.116.145.44 16:51, 23 June 2026 (UTC)
"you could have stopped after your third paragraph, or really even after your second" - I couldn't, because I didn't write it. 82.13.184.33 08:26, 24 June 2026 (UTC)
Indeed, I did. You just had the misfortune to be in the same general IP block as me, then (and maybe me now, too, but I won't know that until I post, or perhaps preview, and render my signature-tildes accordingly). Also, you seemed to appreciate my style more than others, so doubly guilt-by-association.
Explanation for everyone else: I didn't want to be as dry and dull as just tersely snapping back "it means <foo>, of course", perhaps with an implied (but not intended) "...as should be obvious with but a moment's thought. How well I managed that is open for discussion.
The "But this is a case..." perhaps should have been given its own space (it's what I was thinking of saying, before I realised I could also try to helpfully clarify). And I should know by now (with forty years of netizenship under my belt) that my attempts at playful self-deprecation might not necessarily be read in the tone of humour that I thought I was writing in. Note to self: go back to using excessive smileys, rather than rationing them. Then deal with the complaints that I over-smiley! :(
This is, of course, me speaking for myself, in my own voice, here in the Talk section. Knowing that everyone else can safely ignore me ifI'm just being boring.
But if I make a tonal/terminological error in the Explanation (or, at best, an ambiguity), it's open to editing. And editing it down. So if I just write something like "colour" (by accident, usually, not ignorance... though something like "on accident" sounds/reads so strange to me that I might tend to presume that "by accident" is no less valid under US grammar) then please change it to "color" (but it wasn't a 'typo', just a misregionalised/-ized slip of the fingers). And true overexplaining can always be cut out/down. But also reinstated, if that's in turn considered over-cut.
All this is now firmly meta-territory. But it looked like it needed clarification (and a direct apology from me to 82.13.*, when I was coming in via 82.132.*) even if the price for that was a drift through what I think is a bit of comforting drawl about the periphery information, to lighten the mood again, but may possibly be just as unwelcome a block of contribution. So apologies for where my judgement in that turned out to be grossly wrong. ("Get a blog", you say? BTDTGTTS, but this is just for here and now.)
We now return you to your regularly scheduled program(/programme). 82.132.237.27 12:31, 24 June 2026 (UTC)
Worth noting I think that it's a common AMERICAN phrasing. It's hardly ever used in England. 2a0a:ef40:5b7:1c01:dedd:33f3:54a1:6705 (talk) 17:22, 23 June 2026 (UTC) (please sign your comments with ~~~~)

Closest match I can find is Germany - Curacao but there Germany took the lead in the 38th minute (not the 37th). I leave the deep dive on Germany's record against teams alphabetically before them when they have taken the lead 2-1 in the 37th/38th minute to someone else...

And, of course, Germany destroyed Curaçao 7-1, just like they did to Brazil (which is also alphabetically before Germany!) 12 years prior Wilh3lm (talk) 12:38, 23 June 2026 (UTC)
As I recall, the Germans had already scored 7 before the Brazilians scored, and quite a few people independently came up with Ger-many Braz-nil… Randomnonsense (talk) 16:48, 23 June 2026 (UTC)
I did a limited look into possible edge-cases, but didn't get anywhere so far as to confirm all minutes-of-goals for those. Concentrated on (fairly recent) games with more than a 2-2 final score, as the requirements are that they must have at least been 2-1 and then at least got levelled up by the lagging side, before the possible tie-breaking penality shootouts.
I suppose I could have first narrowed it down to every game with a reported 37th minute goal (given the rarity of that exact event, by apparently common agreement) then tallying up the precise game-state at that time, plus the final result. But I was looking into the other fine details first. 81.179.200.152 23:14, 23 June 2026 (UTC)

Don't assume "bad" p-hacking without looking closer at the data: Sometimes what looks like P-hacking is really finding previously-unseen patterns. If you have a drug trial on a drug that you have no reason to think will show gender differences and you are asking "is this drug better than existing drugs" and the results are inconclusive, then you do "p-hack" subgroups and find that in males between the ages of 18 and 50 it demonstrates superior results, you MAY be cherry-picking results or you MAY have found a hidden pattern. Assuming your sub-group size isn't ridiculously small, you can legitimately claim that you need more funding for a follow-up study or at least a follow-up analysis of this subgroup in previous studies of the same drug. 150.221.155.241 13:35, 23 June 2026 (UTC)

Yes, more data is usually the solution. The comic deliberately uses an extremely small dataset. You can make up almost any hypothesis and find 2-3 datapoints that fit it. Barmar (talk) 14:24, 23 June 2026 (UTC)
Or, perhaps more commonly, if you have a sufficiently large dataset you can mine through it and come up with two or three interesting-looking 'hypotheses' that it'll appear to support, even if you didn't have any to start with. 82.13.184.33 15:02, 23 June 2026 (UTC)
The difference is that taking a lot of data and looking at many possible patterns to that data is likely to reveal artefacts of mere chance.
Considering a single possible pattern and looking to see if it is justified is (even if, as an individual pattern, exactly as much a coincidental artefact or not) useful. So one might legitimately be able to suggest that the male subjects may show a legitimate long-term improvement to a treatment (possibly either because it works better with male hormones, or because there aren't the same underlying hormonal variations across each month, or just because it works better for different body-mass/fat distributions that are more typically male than female), or perhaps vice-versa (that it actually works more usefully for females, again for such reasons).
But filtering inconclusive results through many possible sorting algorithms and extracting 'nuggets' of apparent significance devalues those nuggets. Especially if it gets more complexly combinatorial. Imagine the number of criteria you might have considered, together and individually, to establish anything like "the treatment was twice as effective as the control treatment in males less than 30 with an older brother and females over 38 whose job is in education". How you'd even rationalise/explain such a complicated cause/effect relationship is one thing. That you've probably discarded so many other datums you had (out of hundred subjects, you're decided to ignore maybe half the people because of being under-/over-age, in this gendered thing, cutting back far more than that for the family/employment requirements and then of the remainder (probably somewhat <10, I'd guess, depending on where you actually got the initial subject-pool) and then, for such a comparison to stand up, the non-discards must be further split between on-treatment and whatever off-treatment/old-treatment you're comparing to, is an issue of practicality which is the other issue. (At least until you come up with a decent reason to recruit these subsets explicitly for a follow-up study, assuming the relationship doesn't just vanish/revert-to-the-mean when you do as it was pure chance after all.)
Realistically, you should weight the apparent significance of 'results' according to how many results you actively looked for. (e.g. had fifty possible things, divide the significance of any positive 'match' by 50. Although I can see reasons why division by 50**2 or ln(50) would be better, depending upon the scenario relevant to this kind of siftingt.) But better just to avoid that.
Which might be a big problem in AI-derived research results. You're asking an algorithm to make an uncounted large number of separate interpretations (including of exactly what question it is that you're asking) and then it's returning the 'nicest' results that seemingly fulfil its mysterious 'training' insofar as they aim to be the single/several results from the largely randomised treatments that are judged to be the least incomprehensible. 81.179.200.152 20:00, 23 June 2026 (UTC)

The saddest part of all this is that Randall, in the title text of this comic, might be one of the last persons in the English-speaking world to recognize that "data" is a plural noun, the singular of which (icymi) is "datum". 147.81.27.244 15:38, 23 June 2026 (UTC)

The Latin word "data" is a plural now, but the English word "data" is [+ sing/pl verb], and in most writing is used as a plural. It's really only used as a singular in scientific and technical contexts. Borrowed English words don't follow Latin rules, and "data" is not alone in this. Meetings have an agenda, even though in Latin an agenda is many agendum. https://www.merriam-webster.com/grammar/some-data-about-datum. --49.255.140.250 01:38, 24 June 2026 (UTC)
I can assure you that Randall knows full well how "data" operates and that's part of the joke. If all you have is a single data point, you can extrapolate whatever you want from it; which makes it no better than noise or anecdote. 74.202.210.170 19:05, 23 June 2026 (UTC)

I feel like there's more to be said about the specific version of p-hacking that sports commentators do. A few aspects I haven't seen in the explanation that are often tells of sports commentary p-hacking - and that Randall parodies here explicitly:

  • a very specific period, such as "the last 36 years". This usually means that right before that cutoff date, the pattern does not hold at all.
  • data point scarcity (specific to the World Cup and end-of-season playoffs): maybe the team didn't reach this stage of the competition at every opportunity

Also some more subtle omissions in this kind of commentary, which are also common:

  • ignoring circumstances: 36 years ago, the team or country may have been in a completely different state. Specifically for the World Cup, 36 years ago there was one Czechoslowakia, there were 2 Germanies, Uzbekistan wasn't a sovereign state, Curaçao didn't have a true national team, etc.
  • ignoring player turnover/churn: most of the current players were probably not on the team even 8 or 12 years ago (if this is specific to World Cup games), let alone their coaching staff. Blagae (talk) 08:16, 24 June 2026 (UTC)

This isn't really an Overly Narrow Superlative, as tvtropes defines it (or as the plain English of the term implies) - it is overly narrow, but it isn't superlative. The announcers aren't claiming that this is the worst/best team when faced with this hyper-specific scenario. They're just stating the facts that they are 0-2 in the scenario. It could possibly fit if it were phrased as being *similar* to that trope, but it doesn't fit as is. The subsequent paragraph implies that the point is to narrow the relevant data points to 2 games; I would suggest that the point is to narrow the relevant data points to something that gives an impressive result (like never winning in that situation, or never losing). 2 games isn't the intent, it's the consequence. I also don't think there is jargon there - all of the terminology would be familiar to a casual viewer of the sport; even people unfamiliar would be unlikely to be fazed by any, except perhaps the "scoring in the 37th minute". And "As an equivalent example, in the 2022 World Cup, 14 group stage games (out of 48) and 9 knockout stage games (out of 16) may have at some point reached a 2-1 scoreline for one or other team, depending upon the order the respective teams' goals occurred" - presumably that's based on a count of games in which at least one team has reached 2 goals, and the other has reached at least 1 goal, so they *may* have been 2-1 at some point. But surely there's enough information out there to identify which games *actually* were 2-1 at some point. I'd make changes, but given that I've butted heads here with the person I suspect wrote most of that, it's maybe better that someone else do it.163.116.145.44 18:40, 24 June 2026 (UTC)

I think you butted heads with someone else, but it's me that only went so far to get to the "may have been" stage (I was mostly looking for various other details, across many other past World Cups). The decision to present 2022 as an example was after I'd stopped looking at specific tournaments. I could go back in a check this for sure, but had rather consciously left it as an exercise for the next editor who cared enough to dive into the same sort of infodumps. As mentioned in comments. Please, if you get there before I do, solidify it in your own way.
I'm also absolutely not confused by "in the 37th minute", that's a fairly common (stylee of) match-report phrase. At least in British commentaries. Because stoppages are rare, in our football, it's a handy metric that doesn't need to keep track of too many unscheduled pauses in playing-time.
Other bits of the phrase sound more American (the "oh-for-two"), though are creeping into British parlance so perhaps aren't totally incomprehensible here. Can't speak for the rest of the anglosphere (beyond Canada, being already experiencing the North American traditions) or non-anglospheric readers of all kinds, though.
I had pondered the EverythingExceptMostThings link, instead. But it seemed to fit as the ONS type more. "This fact I am giving has solid gold evidence behind it (though it is actually based upon very little of that evidence, it's perfect as far as it goes)." Meh, it could easily be rewritten as you suggest if it pains any for it to be otherwise. 81.179.200.152 19:52, 24 June 2026 (UTC)


There was a funny comparison of football/soccer sports commentary now and then by the German comedian Urban Priol: Ernst Huberty in the 1970ies in a slow, somniferous voice: "Vogts ... to Beckenbauer ... Beckenbauer ... to Overath ... back to Vogts ... got the ball ... on to Hoeness ... Hoeness to Müller ... Müller ... Müller ... goal!" and a recent commentary, a flood of words with lots of irrelevant details: " bla bla bla ... I can tell you all these details because nothing important happens on the field ... bla bla bla ... oups, now we missed the goal" ;-) 2A01:599:447:B511:6D96:2BD4:9643:DA1C 19:11, 24 June 2026 (UTC)
      comment.png  Add comment