If a font's encoding is UTF8, then rather than taking "straight" 8 and 16 bit strings, strings are decoded according to UTF-8 or UTF-16. For example, the byte sequence C2 A3 in UTF-8 represents character &A3. The halfword sequence D800 DC00 in UTF-16 represents character &10000. 32-bit strings are treated normally (hence the string is treated as UCS-4).
|
|
|