NACLO 2026 - Problem HA Tale of Three Scripts

Open round | 15 points | N/A | Problem statement | Official solution | Tags: Writing system

It's relatively easy to tell the three scripts apart. From H3, we know that "π”€π”šπ”π”Žπ”“π”ˆ" is Elbasan. From H4, we know that "π—Žπ—’π—‡π—‰π—’" is Todhri1. Just look for similar letters, especially "π”š" and "𐔓".

Latin1. mbasi
3. njeriu
10. Γ«shtΓ«
13. gΓ«zojnΓ«
14. kohΓ«n
15. pΓ«rsa
A. njohja
C. lindur
D. kundΓ«r
E. shtetet
I. burrave
L. gjithΓ«
Elbasan2. π”π”Ÿπ”šπ”€π” π”‡
4. π”π”π”…π”Ÿπ”™
8. π”’π”π”™π”–π”Œπ”‡π”“
B. 𐔓𐔑𐔍𐔝𐔇𐔝
F. π”•π”‡π”™π”π”Ÿ
G. π”Šπ”ˆπ”£π”–π”Žπ”“π”ˆ
M. 𐔏𐔖𐔒𐔁𐔇𐔠𐔇
O. π”ˆπ”œπ”π”ˆ
Todhri5. π—”π—šπ—ƒπ—‰π—₯𐗉
6. π—”π—€π—‡π—Šπ—
7. π—Žπ—’π—£π—Š
9. π—™π—šπ—‘π—€
11. 𐗧𐗒𐗒𐗉𐗒
12. 𐗑𐗉𐗒𐗉𐗒
H. 𐗃𐗁𐗒
J. π—ƒπ—π—šπ—π—‰π—˜
K. π—”π—šπ—π—Šπ—˜
N. π—›π—Šπ—π—Ÿπ—€

The scripts look more or less all alphabetic, although they probably don't correspond glyph-to-glyph, because we have H. 𐗃𐗁𐗒 but no word on the left hand side has only 3 glyphs. However this still means we can match letter by letter. (Remember Armenian?)

First notice 10. Γ«shtΓ« and O. π”ˆπ”œπ”π”ˆ. These are the only two words that start and end with the same glyph, so they must match. So in Elbasan: π”ˆ = Γ«, π”œ = sh, 𐔝 = t (I'm assuming left-to-right; it will be justified very soon). Therefore, 13. gΓ«zojnΓ« = G. π”Šπ”ˆπ”£π”–π”Žπ”“π”ˆ, giving us π”Š = g, 𐔣 = z, 𐔖 = o, π”Ž = j, 𐔓 = n. It's reassuring seeing how many of these glyphs look similarβ€”this means we are on the right track. No other Latin-Elbasan correspondences are apparent.

In the left column we have two other Latin words containing "Γ«": 14. kohΓ«n and 15. pΓ«rsa, but no other Elbasan words in the right column contain π”ˆ, so these two words have to match to Todhri. One has "Γ«" in the penultimate place, the other in the second place. We quickly locate K. π—”π—šπ—π—Šπ—˜ and N. π—›π—Šπ—π—Ÿπ—€. Therefore, in Todhri, 𐗛 = p, π—Š = Γ«, 𐗝 = r, π—Ÿ = s, 𐗀 = a, 𐗔 = k, π—š = o, 𐗐 = h, π—˜ = n (again, assuming left-to-right).

In the right column we have two other Latin words containing "Γ«": D. kundΓ«r and L. gjithΓ«, but again no other Elbasan words in the left column contain π”ˆ, so again look in Todhri for π—Š. We find 6. π—”π—€π—‡π—Šπ— and 7. π—Žπ—’π—£π—Šβ€”one with π—Š in the penultimate position and the other in the last position, and with 6 beginning with "k" and ending with "r"; everything is very nicely consistent. This tells us that in Todhri, 𐗀 = un, 𐗇 = d, π—Ž = gj, 𐗒 = i, 𐗣 = th.

Now we've exploited all the "Γ«" correspondences. We go back to finding similar glyph sequences. Notice we have three words that end with "ABA" on both sides: 5. π—”π—šπ—ƒπ—‰π—₯𐗉, 11. 𐗧𐗒𐗒𐗉𐗒, 12. 𐗑𐗉𐗒𐗉𐗒; E. shtetet, B. 𐔓𐔑𐔍𐔝𐔇𐔝, M. 𐔏𐔖𐔒𐔁𐔇𐔠𐔇. Because in B "𐔝𐔇𐔝" and M "𐔇𐔠𐔇", we have "𐔇" occurring either as "A" or "B", they must correspond to "𐗉π—₯𐗉" and "𐗒𐗉𐗒" in some other order. This leaves E "tet" to correspond to "𐗒𐗉𐗒" (one of 11 or 12). Therefore in Todhri, 𐗒 = t, 𐗉 = e.

In J. π—ƒπ—π—šπ—π—‰π—˜, we have a word ending in "n". We know that "n" is "𐔓" in Elbasan, and no other Latin word on the left ends in "n", so 8. π”’π”π”™π”–π”Œπ”‡π”“ = J. π—ƒπ—π—šπ—π—‰π—˜. We already know that π—π—šπ—π—‰π—˜ = "rohen" by all the work above, so in Elbasan, 𐔙 = r, π”Œ = h, 𐔇 = e.

Now go back to the three "ABA" words. They have now become: 5. ko𐗃eπ—₯e, 11. 𐗧itet, 12. 𐗑etet; E. shtetet, B. n𐔑𐔍tet, M. 𐔏o𐔒𐔁e𐔠e. Therefore 5 = M, 11 = B, 12 = E. In Elbasan, 𐔍 = i, 𐔏 = k. In Todhri, 𐗑 = sht. (We try not to make Elbasan-Todhri correspondences to avoid complicating things.)

In 4. π”π”π”…π”Ÿπ”™, the word ends in "r". The only word on the right ending in "r" is C. lindur. So in Elbasan, 𐔐 = l, 𐔅 = nd, π”Ÿ = u.

Now we don't have many words left and most of the vowels figured out, so we can match them based on vowel sequences.

Let's list out the alphabet we have so far:

LatinElbasanTodhriLatinElbasanTodhri
a𐔀𐗀as𐗁
b𐔁gjπ—Ž
e𐔇𐗉hj𐗑
Γ«π”ˆπ—Šmb𐗃
gπ”Šnd𐔅𐗇
hπ”Œnj𐔕𐗙
i𐔍𐗒n?𐔓𐔑𐗧
jπ”Žrrπ”š
k𐔏𐗔shπ”œ
l𐔐sht𐗑
nπ”“π—˜th𐗣
oπ”–π—š
p𐗛
r𐔙𐗝
sπ—Ÿ
t𐔝𐗒
uπ”Ÿπ—€
v𐔠π—₯
z𐔣

(I've reanalyzed "π—”π—€π—‡π—Šπ—" as "k-u-nd-Γ«-r" instead of "k-un-d-Γ«-r". Although both are compatible with the data since "𐗀" and "𐗇" only appeared once, we have Elbasan for "nd" but not "un", so the "nd" bigram seems more justified.)

E2, E3, E4 are just table lookup. Note that "π—Žπ—’π—‡π—‰π—’" contains the "𐗇" character which I've revised to be "nd" instead of just "d". I don't know if treating it as just "d" is allowed, since it's also compatible with the input data.

Footnotes

  1. The Todhri script was added to Unicode in Unicode 16 (2024), so at the time of writing, very few fonts support it. I had to use Noto Serif's Todhri font. ↩