NACLO 2024 - Problem DGenerally Speaking

We need to reverse-engineer the algorithm of each computer.

C1 only gave G for 18–20, which are all in the training set. Therefore this computer only gives G for sentences in the training set. 36 is in the training set but 37, 38 are not, so (f) = G, (i) = U, (l) = U.
C2 gave G for 18–24. Its strategy isn't immediately obvious, so we'll leave it for later.
C3 always gives the correct answers; but it has no corresponding blanks to fill.
C4 only ever gives G, so (h) = G, (k) = G, (n) = G.

Now for C2. Here are a few hypotheses that don't work:

It only rejects sentences with unknown words: while it does reject all sentences with unknown words, it also rejects 25 and 26 which have no unknown words.
It judges based on co-occurrence of words or elements it has seen: it accepts "The linguist visited the spy." where "linguist", "visited", and "spy" have never co-occurred before.

Our biggest problem is why it accepts "The linguist visited the spy." but not "The woodcarver visited the programmer.", when none of the words are unknown and the structures are also highly regular. If it's not the word and not the structure, then it must be how the words are placed into the structure. Indeed, for a pattern like "The X Y the Z.", "linguist" has appeared as "X" (1), "visited" has appeared as "Y" (7), and "spy" has appeared as "Z" (2), but "woodcarver" has never appeared as "X". Therefore, C2 takes distribution into account. To verify this hypothesis, we must count the distribution of each word in the training set. Note that it's able to get "The main concierge saw the blacksmith" where there's an extra adjective, so the distribution is not just about the word's index position in the sentence, but its position relative to landmarks. Let's suppose that C2 learned the following templates (I'm assuming it has learned that the same categories B, C, and D can be reused across templates, instead of learning one category per template, and it turned out to be a correct, at least irrelevant, assumption):

The B is E.
The B C the D.
The A B C the D.

A	B	C	D	E
cheerful famous happy main talented	ballerina calligrapher cartoonist concierge detective haberdasher linguist programmer spy watchmaker	met saw visited	astronaut ballerina blacksmith linguist programmer spy woodcarver yodeler	asleep famous happy tall knowledgeable

Indeed, we can verify that all sentences accepted by C2 follow one of these templates, and all sentences rejected by C2 violate these templates. Therefore:

31. "tall" not in A; (a) = U
32. Follows template 2; (b) = G
33. Doesn't follow a template; (c) = U
34. "yodeler" not in B; (d) = U
35. Follows template 1; (e) = G
36. Follows template 2; (g) = G
37. "talented" not in E; (j) = U
38. "cartoonist" not in D; (m) = U

In D2, "asleep" only appeared in position E; "happy" appeared in both B and E; "main" only appeared in position B. C2 gave 39 and 40 both G, so HIDDEN_WORD_1 = happy. It gave 41 U, which means HIDDEN_WORD_2 cannot be in position E, so HIDDEN_WORD_2 = main. It gave 44 U, which means HIDDEN_WORD_3 cannot be in position B, so HIDDEN_WORD_3 = asleep.

For me, "happy" is definitely both attributive and predicative; "main" is mostly only attributive (I can think of very limited contexts in which "he is main" is legal, usually as slangs); "asleep" is mostly only predicative (the attributive version is "sleeping"). So I give the same judgment as C2 and differ with C3 by 2.

For D4, here are the ciphered texts and their contexts:

to QGTRHUU it (in response to "make this picture smaller"; later mentioned "enlarge")
the RKGKHENDKEW of the smaller version should make it easier to move around
is currently UQYETKFQ-DKAPE (in response to "looks a little odd", followed by "apply a mirror filter")
EPX! (in response to "Who is the tallest person"; previous two responses "That" and "There")
make this HEEQRMEKXG (in response to "schedule an appointment with my hairdresser")
an appointment TXRQCPQG next week
But then, WQTEQDCQQB, we got two new clients (after "Not much happened for the first week")
had many EHUUJXRKGAT

The problem setup says that all these words have something to do with morphology. Therefore "QGTRHUU" has something to do with "small"; naturally the "TRHUU" part stands for "small", making "QG" stand for "en" (i.e., make), which is consistent with the later mention of "enlarge".¹ Now we can just substitute these deciphered letters into other words and keep guessing as the words become more revealed:

"WQTEQDCQQB" becomes "WesEeDCeeB", which, in the context of "last week", is obviously "yesterweek" (generalized from "yesterday").
"TXRQCPQG" becomes "sXmewPen", which is likely "somewhen" (generalized from "somewhat", "somewhere", "somehow").
"EPX" becomes "tho" (generalized from "what" → "that", "where" → "there").
"HEEQRMEKXG" becomes "attemMtKon", which can only be "attemption" (don't know how it generalized that).
"RKGKHENDKEW" becomes "miniatNrity", which can only be "miniaturity" (from "miniature").
"UQYETKFQ-DKAPE" becomes "leYtsiFe-riAht", which is "leftside-right" (from "upside-down").
Finally "EHUUJXRKGAT" becomes "tallJomings", which is "tallcomings" (the antonym of "shortcomings").

Footnotes

Funny aside: in my CS data structures class, the functions to shrink/grow a hash map are actually called "ensmallen/embiggen". I can never get over the double causative morphology. ↩

NACLO 2024 - Problem DGenerally Speaking

We need to reverse-engineer the algorithm of each computer.

C1 only gave G for 18–20, which are all in the training set. Therefore this computer only gives G for sentences in the training set. 36 is in the training set but 37, 38 are not, so (f) = G, (i) = U, (l) = U.

C2 gave G for 18–24. Its strategy isn't immediately obvious, so we'll leave it for later.

C3 always gives the correct answers; but it has no corresponding blanks to fill.

C4 only ever gives G, so (h) = G, (k) = G, (n) = G.

Now for C2. Here are a few hypotheses that don't work:

It only rejects sentences with unknown words: while it does reject all sentences with unknown words, it also rejects 25 and 26 which have no unknown words.

It judges based on co-occurrence of words or elements it has seen: it accepts "The linguist visited the spy." where "linguist", "visited", and "spy" have never co-occurred before.

The B is E.

The B C the D.

The A B C the D.

cheerful
famous
happy
main
talented

ballerina
calligrapher
cartoonist
concierge
detective
haberdasher
linguist
programmer
spy
watchmaker

met
saw
visited

astronaut
ballerina
blacksmith
linguist
programmer
spy
woodcarver
yodeler

asleep
famous
happy
tall
knowledgeable

Indeed, we can verify that all sentences accepted by C2 follow one of these templates, and all sentences rejected by C2 violate these templates. Therefore:

For D4, here are the ciphered texts and their contexts:

to QGTRHUU it (in response to "make this picture smaller"; later mentioned "enlarge")

the RKGKHENDKEW of the smaller version should make it easier to move around

is currently UQYETKFQ-DKAPE (in response to "looks a little odd", followed by "apply a mirror filter")

EPX! (in response to "Who is the tallest person"; previous two responses "That" and "There")

make this HEEQRMEKXG (in response to "schedule an appointment with my hairdresser")

an appointment TXRQCPQG next week

But then, WQTEQDCQQB, we got two new clients (after "Not much happened for the first week")

had many EHUUJXRKGAT

"WQTEQDCQQB" becomes "WesEeDCeeB", which, in the context of "last week", is obviously "yesterweek" (generalized from "yesterday").

"TXRQCPQG" becomes "sXmewPen", which is likely "somewhen" (generalized from "somewhat", "somewhere", "somehow").

"EPX" becomes "tho" (generalized from "what" → "that", "where" → "there").

"HEEQRMEKXG" becomes "attemMtKon", which can only be "attemption" (don't know how it generalized that).

"RKGKHENDKEW" becomes "miniatNrity", which can only be "miniaturity" (from "miniature").

"UQYETKFQ-DKAPE" becomes "leYtsiFe-riAht", which is "leftside-right" (from "upside-down").

Finally "EHUUJXRKGAT" becomes "tallJomings", which is "tallcomings" (the antonym of "shortcomings").

Footnotes

Funny aside: in my CS data structures class, the functions to shrink/grow a hash map are actually called "ensmallen/embiggen". I can never get over the double causative morphology. ↩