Global by Design

Adventures in Web Globalization

Global by Design header image 2

Unicode: Bigger & Better

Written by John Yunker Posted on by John Yunker

John is president of Byte Level Research and author of The Web Globalization Report Card. He is based in San Diego, California.

The Unicode Consortion has released Version 4.0. For those not familiar with it, Unicode is “the fundamental specification for the representation of text, at the core of all modern software, programming languages, and standards, including Windows, Java, C#, Perl, XML, HTML, DB2, Oracle, and many others.”

How is Version 4.0 better than previous versions? Here’s what the press release says:

Version 4.0 encodes over 96,000 characters, twice as many as Version 3.0,

and includes two record-breaking collections of encoded characters. The

largest encoded character collection for Chinese characters in the history

of computing has doubled in size yet again to encompass over 2000 years of

Chinese, Japanese, Korean, and Vietnamese literary usage, including all the

main classical dictionaries of these languages. Version 4.0 also encodes

the largest set of characters for mathematical and technical publishing in

existence.

Unicode is a remarkable achievement. I highly recommend taking a few moments to visit the Web site: www.unicode.org.

Possibly related posts:

Tags: Web Globalization