The best way to explain how an XML document is composed is to present one. The following example shows an XML document you might use to describe two authors:
<?xml version="1.0" encoding="us-ascii"?> <authors> <person id="lear"> <name>Edward Lear</name> <nationality>British</nationality> </person> <person id="asimov"> <name>Isaac Asimov</name> <nationality>American</nationality> </person> <person id="mysteryperson"/> </authors>
The first line of the document is known as the XML declaration. This tells a processing application which version of XML you are using (the version indicator is mandatory) and which character encoding you have used for the document. In the previous example, the document is encoded in ASCII. (The significance of character encoding is covered later in this chapter.) If the XML declaration is omitted, a processor will make certain assumptions about your document. In particular, it will expect it to be encoded in UTF-8, an encoding of the Unicode character set. However, it is best to use the XML declaration wherever possible, both to avoid confusion over the character encoding and to indicate to processors which version of XML you’re using.
The second line of the example begins an
element, which has been named “authors.” The
contents of that element include everything between the right angle
<authors> and the left angle bracket
</authors>. The actual syntactic