Character Groups

For convenience, we can use character groups to stand for groups of characters. Specifying L is simpler than specifying all letters, particularly if you have to allow for the many thousands of characters defined in the Unicode standard. The character groups are used with the \p and \P operators; \p{L} means all letters, while \P{L} means everything except letters. The character groups and their names are defined in the Unicode Character Database, available online here: http://www.unicode.org/Public/UNIDATA/UCD.html.

The following lists are the character groups.

Get XSLT, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.