• Shihali@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 days ago

    There is one big group of losers from UTF-8: Eastern Europeans. Greek and Cyrillic need two bytes per letter and there’s no way around it in Unicode, while national code pages only used one byte per letter.

    On the other hand, unless you have a big strictly monolingual database (or strictly national language/English), it seems worth taking the hit to size for the flexibility.