Editing Talk:2533: Slope Hypothesis Testing

Jump to: navigation, search
Ambox notice.png Please sign your posts with ~~~~

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 20: Line 20:
 
:: You seem to be the only person so far who's learned in academia why this is wrong.  Is the current state of the article correct? [[Special:Contributions/172.70.114.3|172.70.114.3]] 14:31, 26 October 2021 (UTC)
 
:: You seem to be the only person so far who's learned in academia why this is wrong.  Is the current state of the article correct? [[Special:Contributions/172.70.114.3|172.70.114.3]] 14:31, 26 October 2021 (UTC)
 
: The scientific error isn't quite what people are saying it is. The issue here is not "reusing a single test score" or an issue with non-normality of errors, the issue is that the data are *nested* within participants and that isn't being accounted for. There are fairly standard ways of managing this, at least in the social science literature (and these ways are statistically valid), most commonly the use of multilevel modeling (also known as hierarchical linear modeling). This accounts for the correlated nature of the errors. Now, even using the right method they're not going to attain statistical significance, but at least they aren't making a statistical mistake.
 
: The scientific error isn't quite what people are saying it is. The issue here is not "reusing a single test score" or an issue with non-normality of errors, the issue is that the data are *nested* within participants and that isn't being accounted for. There are fairly standard ways of managing this, at least in the social science literature (and these ways are statistically valid), most commonly the use of multilevel modeling (also known as hierarchical linear modeling). This accounts for the correlated nature of the errors. Now, even using the right method they're not going to attain statistical significance, but at least they aren't making a statistical mistake.
:: You can look at it in different ways. Obviously the error they made in their research was sampling the same people repeatedly and assuming that could increase the significance of their result. But mathematically, the analysis fails because the errors are not normally distributed. The math doesn't care about how you gather data. If the random variables ε is normally distributed, you can calculate confidence intervals for the parameters and for predictions exactly. The sum of squared residuals will have a chi square distribution. In particular, this means the estimator for beta will be normally distributed, and therefore the standard error in the point estimator will have a (scaled and translated) t distribution. Knowing this, you can use the CDF for the t distribution to compute the p-value exactly. To explain what's wrong with this, we need to spot which assumption was in error. Here, the false assumption was the normality of ε. Even if ε is not normally distributed, as long as it has finite mean and variance, the mean of n independent samples will converge in distribution to z as n goes to infinity. But here, the samples are not independent. So even if you sampled these same kids infinitely many times, the distribution of errors will not converge to the normal distribution, so the assumption of normality remains false. (BTW, this is the same IP as before, just from my phone this time.) [[Special:Contributions/172.70.134.131|172.70.134.131]] 19:40, 27 October 2021 (UTC)
+
:: You can look at it in different ways. Obviously the error they made in their research was sampling the same people repeatedly and assuming that could increase the significance of their result. But mathematically, the analysis fails because the errors are not normally distributed. The math doesn't care about how you gather data. If the random variables ε is normally distributed, you can calculate confidence intervals for the parameters and for predictions exactly. Even if it isn't normally distributed, the central limit theorem guarantees that the sum of squared residuals will have a chi square distribution. In particular, this means the estimator for beta will be normally distributed, and therefore the standard error in the point estimator will have a (scaled and translated) t distribution. Knowing this, you can use the CDF for the t distribution to compute the p-value exactly. To explain what's wrong with this, we need to spot which assumption was in error. Here, the false assumption was the normality of ε. Even if ε is not normally distributed, as long as it has finite mean and variance, the mean of n independent samples will converge in distribution to z as n goes to infinity. But here, the samples are not independent. So even if you sampled these same kids infinitely many times, the distribution of errors will not converge to the normal distribution, so the assumption of normality remains false. (BTW, this is the same IP as before, just from my phone this time.) [[Special:Contributions/172.70.134.131|172.70.134.131]] 19:40, 27 October 2021 (UTC)
  
 
I don't think the title text speakers are unidentified, I'm pretty sure it's a direct continuation of the dialogue in the last panel. [[User:Esogalt|Esogalt]] ([[User talk:Esogalt|talk]]) 04:11, 26 October 2021 (UTC)
 
I don't think the title text speakers are unidentified, I'm pretty sure it's a direct continuation of the dialogue in the last panel. [[User:Esogalt|Esogalt]] ([[User talk:Esogalt|talk]]) 04:11, 26 October 2021 (UTC)

Please note that all contributions to explain xkcd may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see explain xkcd:Copyrights for details). Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel | Editing help (opens in new window)

Template used on this page: