Chapter 29. Character sets

XML markup and document text can only be recognized when the characters they are comprised of conform to recognized standard encoding schemes. This chapter describes character encoding schemes in general, and the most important standards used today (including ASCII, ISO 8859, Unicode and ISO 10646). An understanding of the intentions and limitations of these formats is fundamental to the appreciation of the purpose and scope of XML.

Get XML Companion, The, Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.