WHICH CHARACTER SET?

More than 20 Chinese and Japanese character sets and encodings have evolved over the years, making localization very confusing. Particularly confusing is the difference between a Chinese encoding and a Chinese character set. As mentioned in Chapter 2, “Navigating the Multilingual Internet,” the character set is a group of characters, and an encoding is the mapping of characters to numbers so that computers can display them. Sometimes a character set and an encoding are one and the same, as with Big5 and GB-18030; you can call these coded character sets. Figure 13 charts the more popular character sets and encodings and shows what languages they represent. For a thorough explanation of Asian character sets ...

Get Beyond Borders: Web Globalization Strategies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.