<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>https://www.explainxkcd.com/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Benjaminhwilliams</id>
		<title>explain xkcd - User contributions [en]</title>
		<link rel="self" type="application/atom+xml" href="https://www.explainxkcd.com/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Benjaminhwilliams"/>
		<link rel="alternate" type="text/html" href="https://www.explainxkcd.com/wiki/index.php/Special:Contributions/Benjaminhwilliams"/>
		<updated>2026-04-15T20:12:12Z</updated>
		<subtitle>User contributions</subtitle>
		<generator>MediaWiki 1.30.0</generator>

	<entry>
		<id>https://www.explainxkcd.com/wiki/index.php?title=2731:_K-Means_Clustering&amp;diff=305538</id>
		<title>2731: K-Means Clustering</title>
		<link rel="alternate" type="text/html" href="https://www.explainxkcd.com/wiki/index.php?title=2731:_K-Means_Clustering&amp;diff=305538"/>
				<updated>2023-01-30T18:36:46Z</updated>
		
		<summary type="html">&lt;p&gt;Benjaminhwilliams: Use em dashes for subordinate clause; use singular 'criterion' in place of plural 'criteria'.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{comic&lt;br /&gt;
| number    = 2731&lt;br /&gt;
| date      = January 30, 2023&lt;br /&gt;
| title     = K-Means Clustering&lt;br /&gt;
| image     = k_means_clustering_2x.png&lt;br /&gt;
| imagesize = 320x385px&lt;br /&gt;
| noexpand  = true&lt;br /&gt;
| titletext = According to my especially unsupervised K-means clustering algorithm, there are currently about 8 billion types of people in the world.&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
==Explanation==&lt;br /&gt;
{{incomplete|Created by EITHER 8 BILLION OR 3 TYPES OF BOTS - Please change this comment when editing this page. Do NOT delete this tag too soon.}}&lt;br /&gt;
&lt;br /&gt;
{{w|K-means_clustering|''k''-means clustering}} is a method of categorising ''n'' vectors into ''k'' clusters. For example, we might categorise a population by two metrics, and want to best categorise this scatter graph into the distinct populations, algorithmically drawing {{w|Voronoi cell}}s to decide the within-cluster variances. &lt;br /&gt;
&lt;br /&gt;
Ponytail's determination that there are three clusters is unsurprising if she herself falls into the category of those who use K=3 as a fixed value, which will inevitably result in three data clusters regardless of actual distribution. The qualitative interpretation of the other two categories — that is, what placement in the other two categories means — is unclear as Ponytail's analysis is either using a binary criterion (whether or not one sorts data into three groups) as the basis for sorting people into three categories, or is a black box using unknown criteria and she has only been able to determine that her own group shares the tendency to group things into threes. &lt;br /&gt;
&lt;br /&gt;
The title text refers to a K-means algorithm with the opposite problem, with no reduction of K value to converge any two human beings into a common cluster based on shared traits. This is humorous because it would make such a clustering useless for the purposes for which a K-Means Clustering is typically used, such as of making insurance risk pools or targets of advertisement campaigns.&lt;br /&gt;
&lt;br /&gt;
Interestingly, by including the entire human population, the algorithm should be immune to bias in creating its input data. However, since every human is unique,{{cn}} the only way to have the clusters converge is to &amp;quot;throw out&amp;quot; some traits of humans as unimportant. This may be objectionable to humans who disagree with that assessment. In contrast, in a supervised algorithm, the training data is tagged with traits that the trainers seek. These traits could be applied in a manner that is socially unacceptable, and lead to AI behavior that reflects the biases of the trainers.&lt;br /&gt;
&lt;br /&gt;
==Transcript==&lt;br /&gt;
{{incomplete transcript|Do NOT delete this tag too soon.}}&lt;br /&gt;
:Ponytail is presenting on a stage, pointing a screen with a stick. The writings and possible figures on the screen are illegible.&lt;br /&gt;
&lt;br /&gt;
:Ponytail: Our analysis shows that there are three kinds of people in the world: Those who use ''k''-means clustering with ''k''=3, and two other types whose qualitative interpretation is unclear.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{comic discussion}}&lt;br /&gt;
[[Category:Comics featuring Ponytail]]&lt;/div&gt;</summary>
		<author><name>Benjaminhwilliams</name></author>	</entry>

	</feed>