ABOUT UNICODE

The evolution of Unicode began more than a decade ago when the Unicode Consortium and an ISO working group set out to create a universal character set. As Unicode evolved over the years, it expanded to include more and more characters. Unicode 3.1, the most current version available, includes 94,140 characters—a far cry from the days of ASCII (see Figure 1).

Figure 1. For more information, visit the Unicode Consortium web site at www.unicode.org.

Unicode was designed to be backward-compatible with as many major character sets as possible. For example, the first 256 characters of Unicode look strikingly similar to Latin 1, ...

Get Beyond Borders: Web Globalization Strategies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.