Open round | 15 points | 12.50% | Problem statement | Official solution | Tags: Writing system
First note that the 16th century and 20th century writing systems are identical, except the latter adds some diacritics, so the latter contains more information than the former. This is useful for G3.
Since the 20th century system contains strictly more information, we should try to match it with IPA (instead of 16th), since the IPA is also rich in information. Below, I align them by consonants/vowels. "tl", "ch", and "tz" are well-known to be complex consonants, so I've grouped them. In case where the vowels don't match exactly (there are two vowels in the 20th century system but only one in IPA), I'm including the previous or next consonant as well, because the missing vowel may have been absorbed into either of the neighboring sounds. For example, "hui" becomes /wi/, which may either because h → /w/, ui → /i/, or because hu → /w/, i → /i/. "#" indicates word boundary.
Here are all unique correspondences:
Notice a few patterns for the diacritics: horizontal bars like ā and ē indicate lengthening; grave accents like ì and à and circumflex like ê indicate glottalization. We have a few complex segments left to dissect:
Note three things:
This strongly suggests that "uh", "hu", "qu", "uc", "cu" are all complex segments that rewrite to a single consonant:
This is already sufficient for G1, because we have a 1-to-1 mapping from 20th century to IPA. For G1, we first need to look up each 16th century word in the 20th century system, because the latter contains more information such as length and glottalization.
a. y e # n i hu ā l l â → /j e # n i w aː l l aʔ/
b. a n t o c n ī hu ā n i n → /a n t o k n i w aː n i n/
c. qu i n # ì cu ā c → /k i n # iʔ kʷ aː k/
In G2, we need to do the reverse, which is a bit more involved because currently we can have the same IPA sound rewrite to multiple 20th century segments:
For each one, it means there must exist some context that allows us to deterministically pick one of the alternatives, so we just need to list out all the contexts.
For glottalization, the pattern is that the circumflex only appears at the word end, while the grave accent only appears within a word or at the start. For /w/, /k/, and /kʷ/, it has to do with whether a vowel comes after: whenever there's a vowel after (including both before and after), "hu", "qu", and "cu" are used; otherwise, "uh", "c", and "uc" are used.
Therefore:
a. /w eː w eʔ/ → "hu ē hu ê"
b. /tʃ o k o l aː tɬ/ → "ch o c o l ā t l"
c. /m i k tɬ aː n t eː kʷ tɬ i/ → "m i c tl ā n t ē uc t l i"
Finally for G3, we've been doing this all along: it's possible to rewrite back and forth between the 20th century system and IPA, so they are bijective. 16th century system is a lossy translation and cannot be deterministically rewritten back.