Talk:1313: Regex Golf

Explain xkcd: It's 'cause you're dumb.
Revision as of 17:29, 4 November 2022 by PoolloverNathan (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

This is fairly simple fun little one.

Regex is sort for regular expressions. A regular expression is a series of characters that denotes a search criteria. For example, you could write a regular expression that would search for anything that looks like an address (a la comic 208).

Regex golf is a game in which you attempt to write a regular expression that will search through a list of items and bring back only those items that meet a certain criteria, but not anything else. The joke is that regular expressions are used to search text, but themselves are text strings. This means that you could write a regular expression that would look for another regular expression. You can then apply ad infinitum, and the universe implodes or something.

--Holshy (talk) 05:40, 6 January 2014 (UTC)

The last panel includes, of course, a regex "/(meta-)*regex golf/," which represents the phrase "regex golf" preceded by the phrase "meta-" up to infinite times.

As a punchline, it also refers to Jamie Zawinski's well-known quote about regex,

Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.

Thus, the punchline is that the addition of meta layers to regex golf generates more problems for the programmer, but that was also the setup of the comic. So either the punchline is really weak—worth a chuckle if you got the above two references—or I missed the joke. 06:22, 6 January 2014 (UTC)

Could anybody comment on the first regex? Do I get it right that beyond others it will match all strings that contain a "b"? I can hardly believe that is not the case for any star trek subtitle... 06:54, 6 January 2014 (UTC)

This is the case for all Star Trek Subtitles. Wikipedia's list of movies had no b. It'll match anything containing a word ending in m, any word beginning with n or t that is not the first word, or any word with a b. No Trek movies match. Oddly, so far as I can figure out, the regex in the first panel is wrong, in that it doesn't match the second Star Wars movie at all. And before you tell me prequels don't count, the sole purpose of "m " is to match The Phantom Menace. 07:10, 6 January 2014 (UTC)

Attack of[ t]he Clones (to be read plainly, not as a regular expression). 07:29, 6 January 2014 (UTC)

Ah, I thought it was The Clone Wars. 15:36, 6 January 2014 (UTC)

So, if I add an "e" to the "tn" and delete the "|b" I'm a better golf player than her? 08:23, 6 January 2014 (UTC)

Or you could just move the "b" into the "tn" group. --11:08, 6 January 2014 (UTC)

I got a sneak preview of this comic at about 6:34 first it appeared to be random text in a irc message, but with this comic it now makes sense to me. Verticalbar (talk) 09:31, 6 January 2014 (UTC)

Regex golf (c.f. Perl golf) is a programming competition / is a pastime of finding regular expression that matches one set of strings while matching none of the other set. See for example --JakubNarebski (talk) 11:03, 6 January 2014 (UTC)

The title text isn't exactly true... I haven't tried everything, but that regex doesn't match "gerald ford" at all. 11:23, 6 January 2014 (UTC)

Gerald Ford wasn't elected, he became President following Nixon's resignation. 12:12, 6 January 2014 (UTC)

Inspired by, a reader built a page where the objective is to make a regular expression to match all Star Wars and no Star Trek: 14:00, 6 January 2014 (UTC)

I added a list of all US elected presidents and the part of the title regex they match. I used a python script to generate it, with input from here, then I removed all presidents that do not match after finding they really weren't elected. There may still be superflous ones, that weren't elected but do match the regex, please check. 14:29, 6 January 2014 (UTC)

Does anyone understand the final "No, I had those already"? Is it a reference to regexes in some way or could it be something like that there are infinite problems in life, even when not doing (Meta-)*-Regexes? -- 20:32, 6 January 2014 (UTC)

According to Peter Norvig (Director of research at google), one of the Regular Expression of Randall is wrong as demonstrated here : Mbussonn (talk) 20:47, 6 January 2014 (UTC)

It's happening. -- 11:39, 7 January 2014 (UTC)

"No one wins at [^ ]+ golf." 09:50, 7 January 2014 (UTC)

Gee, would that be "No one wins at \S+ golf."? IronyChef (talk) 23:57, 9 January 2014 (UTC)

Why does this say that it is Case Sensitive. As far as I can tell it would not work if that were true. 02:28, 7 February 2014 (UTC)

"Note that if one included the animated film “Star Wars: The Clone Wars” it would be matched by “ [tn]”." - I don't see how this is true, since the T is at the beginning of the subtitle. If this matched, then surely so would all of the original series Star Trek films. (talk) (please sign your comments with ~~~~)

"I got infinite problems and a bitch ain,t one" 15:50, 29 August 2014 (UTC) (talk) (please sign your comments with ~~~~)

Looks like the algorithm is a bit outdated. It fails to match The Force Awakens but matches Beyond-- 17:57, 5 September 2015 (UTC)

For the 2016 election, the regex predicts that a Democrat (either) will beat Donald Trump, who will win the Republican primaries. (talk) (please sign your comments with ~~~~)

I like that linked article, even though I'm not really into programming. Just noticed Norvig misspells Randall's last name as Monroe instead of Munroe. 03:42, 15 March 2016 (UTC)

How would Trump work with this? EDIT: Hillary works but Trump doesn't. 00:23, 14 November 2016 (UTC)

The article says that the Presidents Regex is now impossible to update after Trump's win over Hillary. However, if Hillary were to win in a future election, it would work again as per the rule stated above the list, wouldn't it? -- 09:26, 3 April 2017 (UTC)

This isn't true either - there was already a presidential loser whose surname was Clinton (DeWitt Clinton, 1812). So presumably Hillary Clinton is likewise not considered in terms of regex eligibility. -- 23:05, 13 April 2017 (UTC)

For the Star Wars/Star Trek golf, including the new films, I've got /m | [tn]|ba|a[sw]/. Can anyone do better? -- Misterblue28 (talk) (please sign your comments with ~~~~)

Including Star Wars films up to The Rise of Skywalker, I get /ke|a.t.|n.*h/. -- 20:46, 20 May 2019 (UTC)

Does this work for Trump v Hillary? (talk) (please sign your comments with ~~~~)

I was just wondering the same thing. Pretty sure it's now literally impossible, since you'd have to both match AND exclude "Clinton". 13:15, 2 October 2018 (UTC)

Regex golf with transcript[edit]

So I decided to play regex golf with the transcript (after learning regex) and it was kinda fun. Here's my list of lines to match:

[Megan is sitting at a laptop. Cueball is standing behind her.]

Megan: /m | [tn]|b/ matches Star Wars subtitles but not Star Trek.

[A close-up of Megan at her laptop.]

Cueball (offscreen): Uh oh...

[Megan typing at her laptop.]

[Cueball facepalming.]

[Another closeup of Megan at her laptop.]

Cueball: Now you have infinite problems.

Here's my list of lines not to match:

Regex golf:

Megan: You try to match one group but not the other.

Cueball: Cool.

Meta-regex golf:

Megan: So I wrote a program that plays regex golf with arbitrary lists...

Meta-meta-regex golf:

Megan: ...But I lost my code, so I'm grepping for files that look like regex golf solvers.

...And beyond:

Megan: Really, this is all /(meta-)*regex golf/.

Megan: No, I had those already.

And here's my (very bad) regex. \[|ms|h\. (Hey, I learned yesterday) 16:40, 17 October 2019 (UTC)

Wanted to drop this hairball from earlier:


PoolloverNathan[talk]UTSc 17:29, 4 November 2022 (UTC)