UTF-8 is a Brilliant Design — Vishnu's Pages

cm0002@lemmy.world · 6 days ago

UTF-8 is a Brilliant Design — Vishnu's Pages

Shihali@sh.itjust.works · 6 days ago

There is one big group of losers from UTF-8: Eastern Europeans. Greek and Cyrillic need two bytes per letter and there’s no way around it in Unicode, while national code pages only used one byte per letter.

On the other hand, unless you have a big strictly monolingual database (or strictly national language/English), it seems worth taking the hit to size for the flexibility.