UTF-8
- Variable number of bytes per codepoint
- U+0000…U+007F encoded exactly the same as ASCII
- U+0080…U+07FF 2 bytes per codepoint
- U+0800…U+FFFF 3 bytes per codepoint
- U+10000…U+10FFFF 4 bytes per codepoint
- How?
- You won’t believe this one simple trick!