Where to Get the Unicode Character Database

As the Unicode Character Database is updated relatively frequently, it's a good idea not to rely too heavily on the version on the CD that comes with the Unicode standard. In fact, it's almost obsolete right now, as the structure of the data files was changed when Unicode 3.1 came out. It'll be mostly right, but may differ in some particulars from the most current version, and it'll be organized differently.

You can always find the most current version on the Unicode Web and FTP sites. The URL of the Unicode Data page, which includes links to all the files, is

http://www.unicode.org/unicode/onlinedat/online.html

The current version of the Unicode Character Database is always at

http://www.unicode.org/Public/UNIDATA/ ...

Get Unicode Demystified now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.