Unicode

This section looks a little more closely at the character set that XML uses: Unicode.

Note

Strictly speaking, XML uses the international Standard ISO/IEC 10646. Unfortunately, the International Organization for Standardization (ISO) does not make its standards freely available on the Web in the way that the World Wide Consortium (W3C) does. The Unicode encoding follows ISO/IEC 10646 and is accessible via the Web.

The Unicode organization created a character encoding that has global acceptance. Before the emergence of Unicode, many encoding schemes existed that, for many practical purposes, were incompatible. There is, of course, no intrinsic ...

Get Sams Teach Yourself XML in 10 Minutes now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.